针对更快AI工作负载定制的Google Kubernetes Engine
GKE重磅升级!推出 Gateway API Inference Extension,智能路由LLM,吞吐量飙升40%,尾部延迟暴降60%!更有Cluster Director加持,打造AI超级计算,单作业可达65000个GPU/TPU!Gemini Clou
google kubern googlekubernetes 2025-04-11 09:15 4
GKE重磅升级!推出 Gateway API Inference Extension,智能路由LLM,吞吐量飙升40%,尾部延迟暴降60%!更有Cluster Director加持,打造AI超级计算,单作业可达65000个GPU/TPU!Gemini Clou
google kubern googlekubernetes 2025-04-11 09:15 4