The Lambda Deep Learning Blog

Published on August 20, 2025 by Jessica Nicholson

The Essential Guide to GPUs for AI, Training and Inference

Introduction Graphics Processing Units (GPUs) were originally designed to handle computer graphics, like making video games look realistic or helping Netflix ...

Published on July 29, 2025 by Anket Sah

Introducing NVIDIA SHARP on Lambda 1CC: Next-Gen Performance for Distributed AI Workloads

Lambda’s 1-Click Clusters(1CC) provide AI teams with streamlined access to scalable, multi-node GPU clusters, cutting through the complexity of distributed ...

Published on June 5, 2025 by dstack

Partner Spotlight: Orchestrating large-scale agent training on Lambda with dstack and RAGEN

Lambda + dstack: Empowering your ML team with rock-solid infrastructure for distributed reasoning agent training

Published on May 15, 2025 by Anket Sah

Introducing Lambda's Cloud Metrics Dashboard: Real-Time Insights for Your GPU Workloads

AI doesn’t wait and neither should real-time insights into your infrastructure!

Published on May 1, 2025 by Anket Sah

Lambda Managed Slurm: AI Cluster Management, Your Way

Introducing Managed Slurm (Early Preview) on Lambda: Your AI Cluster’s New Best Friend Think of Slurm as the air‑traffic controller for your GPU fleet that ...

Published on December 19, 2024 by Thomas Bordes

Get Into The ARMs Race: Future-Proof Your Workloads Now With Lambda

Blackwell is coming… so is ARM computing 2025 is just around the corner, and with it comes the highly anticipated launch of NVIDIA's revolutionary Blackwell ...

Published on October 7, 2024 by Mitesh Agrawal

More Options for AI Developers: New On-Demand 1x, 2x and 4x NVIDIA H100 SXM Tensor Core GPU Instances in Lambda’s Cloud

Opening up options: higher-end GPUs in smaller chunks We're excited to announce the launch of new 1x, 2x, and 4x NVIDIA H100 SXM Tensor Core GPU instances in ...

Published on October 3, 2024 by Robert Brooks IV

Will YOU win Lambda’s Golden Ticket in October?!

A Golden Ticket to an extraordinary prize We’re excited to introduce Lambda’s Golden Ticket prize draw, offering you and your team the chance to win full-time ...

Published on August 15, 2024 by Mitesh Agrawal

Unveiling Hermes 3: The First Full-Parameter Fine-Tuned Llama 3.1 405B Model is on Lambda’s Cloud

Try Hermes 3 for free with the New Lambda Chat Completions API and Lambda Chat. Introducing Hermes 3: A new era for Llama fine-tuning We are thrilled to ...

Published on June 3, 2024 by Mitesh Agrawal

Introducing Lambda 1-Click Clusters, a new way to train large AI models

Introducing Lambda 1-Click Clusters: 16 to 512 interconnected NVIDIA H100 Tensor Core GPUs. Available on-demand. No long-term contracts required. Spinning up a ...

Published on December 20, 2023 by Chuan Li

Benchmarking ZeRO-Inference on the NVIDIA GH200 Grace Hopper Superchip

This blog explores the synergy of DeepSpeed’s ZeRO-Inference, a technology designed to make large AI model inference more accessible and cost-effective, with ...

Persistent storage now available for on-demand NVIDIA H100 instances

Published on December 19, 2023 by Kathy Bui

Persistent storage now available for on-demand NVIDIA H100 GPU instances

Persistent storage for Lambda Cloud recently exited beta and became available in the majority of regions. We are excited to announce that filesystems are now ...

Published on November 21, 2023 by Chuan Li

Unleashing the power of Transformers with NVIDIA Transformer Engine

In this blog, Lambda showcases the capabilities of NVIDIA’s Transformer Engine, a cutting-edge library that accelerates the performance of transformer models ...

DeepChat 3-Step Training At Scale: NVIDIA H100 SXM5 vs A100

Published on October 12, 2023 by Chuan Li

DeepChat 3-Step Training At Scale: Lambda’s Instances of NVIDIA H100 SXM5 vs A100 SXM4

GPU benchmarks on Lambda’s offering of the NVIDIA H100 SXM5 vs the NVIDIA A100 SXM4 using DeepChat’s 3-step training example.

Hyperplane Server with NVIDIA H100 and AMD EPYC 9004 series

Published on September 7, 2023 by Maxx Garrison

Lambda launches new Hyperplane Server with NVIDIA H100 GPUs and AMD EPYC 9004 series CPUs

Lambda has launched a new Hyperplane server with NVIDIA H100 GPUs and AMD EPYC 9004 series CPUs. The new AI server combines the fastest GPU type on the market, ...

The Lambda
Deep Learning Blog

Recent