
Benchmarking ZeRO-Inference on the NVIDIA GH200 Grace Hopper Superchip
This blog explores the synergy of DeepSpeed’s ZeRO-Inference, a technology designed to make large AI model inference more accessible and cost-effective, with ...
This blog explores the synergy of DeepSpeed’s ZeRO-Inference, a technology designed to make large AI model inference more accessible and cost-effective, with ...
Published on by Chuan Li
Persistent storage for Lambda Cloud recently exited beta and became available in the majority of regions. We are excited to announce that filesystems are now ...
Published on by Kathy Bui
In this blog, Lambda showcases the capabilities of NVIDIA’s Transformer Engine, a cutting-edge library that accelerates the performance of transformer models ...
Published on by Chuan Li
Create a cloud account instantly to spin up GPUs today or contact us to secure a long-term contract for thousands of GPUs