
Cloud Run GPUs, now GA, makes running AI workloads easier for everyone
created: June 4, 2025, 8:28 a.m. | updated: June 5, 2025, 12:51 a.m.
Developers love Cloud Run, Google Cloud’s serverless runtime, for its simplicity, flexibility, and scalability.
And today, we’re thrilled to announce that NVIDIA GPU support for Cloud Run is now generally available, offering a powerful runtime for a variety of use cases that’s also remarkably cost-efficient.
Scale to zero : Cloud Run automatically scales your GPU instances down to zero when no requests are received, eliminating idle costs.
Support for GPUs in Cloud Run is a significant milestone, underscoring our leadership in making GPU-accelerated applications simpler, faster, and more cost-effective than ever before.
With seamless access to NVIDIA L4 GPUs, developers can now bring AI applications to production faster and more cost-effectively than ever before.” - Dave Salvator, director of accelerated computing products, NVIDIA
4 days, 3 hours ago: Hacker News