The Lambda Deep Learning Blog

Published on February 4, 2026 by Lea Alcantara

Kimi K2 Thinking: what 200+ tool calls mean for production

TL;DR: Kimi K2 Thinking is Moonshot AI's open-source reasoning model, scoring 44.9% on Humanity's Last Exam with the ability to chain 200-300 sequential tool ...

Published on January 20, 2026 by Cody Brownstein

How to deploy ML jobs on Lambda Cloud with SkyPilot

TL;DR: SkyPilot is an open-source orchestration tool that automates ML job deployment on Lambda Cloud. This tutorial covers installation, configuration, and ...

Published on January 13, 2026 by Lea Alcantara

2025 AI wrapped

2025 was a year of momentum in AI. Intelligence progressed through new, innovative methods. Open-source communities released competitive models. Research labs ...

Published on January 8, 2026 by Jessica Nicholson

JAX on NVIDIA GPUs Part 2: A practical guide for ML engineers

This guide demonstrates how to scale JAX-based LLM training from a single GPU to multi-node clusters on NVIDIA Blackwell infrastructure. We present a ...

Published on December 22, 2025 by Zach Mueller

How to serve Kimi-K2-Instruct on Lambda with vLLM

When your model doesn’t fit on a single GPU, you suddenly need to target multiple GPUs on a single machine, configure a serving stack that actually uses all ...

Published on September 19, 2025 by Jessica Nicholson

JAX on NVIDIA GPUs Part 1: Fundamentals and best practices

JAX unlocks distinct advantages on GPUs: automatic kernel fusion via XLA, composable transformations, and hardware-agnostic code that moves between ...

Published on September 9, 2025 by Anket Sah

LLM performance up 15.4%: MLPerf v5.1 confirms NVIDIA HGX B200 on Lambda is built for enterprise inference

Inference at scale is still too slow. Large models often stall under real-world load, burning time, compute, and user trust. That’s the problem we set out to ...

Published on August 20, 2025 by Jessica Nicholson

The Essential Guide to GPUs for AI, Training and Inference

Introduction Graphics Processing Units (GPUs) were originally designed to handle computer graphics, like making video games look realistic or helping Netflix ...

Published on August 12, 2025 by Anket Sah

Introducing Next-Gen Training and Inference at Scale on Lambda Instances with NVIDIA Blackwell

NVIDIA Blackwell GPUs are now available as 8x Lambda Instances On-Demand, featuring the powerful NVIDIA HGX™ B200 in addition to our trusted lineup.

Published on August 7, 2025 by Jessica Nicholson

Beginners Guide to Reasoning in AI

If you've been anywhere near LLMs lately, you've probably heard the word "reasoning" thrown around more than a frisbee at a college campus. GPT-4 can "reason" ...

Published on July 29, 2025 by Anket Sah

Introducing NVIDIA SHARP on Lambda 1CC: Next-Gen Performance for Distributed AI Workloads

Lambda’s 1-Click Clusters(1CC) provide AI teams with streamlined access to scalable, multi-node GPU clusters, cutting through the complexity of distributed ...

Published on July 16, 2025 by Anket Sah

Accelerate Your AI Workflow with FP4 Quantization on Lambda

As AI models grow in complexity and size, the demand for efficient computation becomes paramount. FP4 (4-bit Floating Point) precision emerges as a ...

Published on June 20, 2025 by Anket Sah

Apriel 5B: ServiceNow’s Enterprise AI Trained and Deployed on Lambda

In AI, scaling doesn’t always mean “bigger.” That’s why we champion lean, efficient LLM design, that maximizes performance while minimizing compute cost and ...

Published on June 5, 2025 by dstack

Partner Spotlight: Orchestrating large-scale agent training on Lambda with dstack and RAGEN

Lambda + dstack: Empowering your ML team with rock-solid infrastructure for distributed reasoning agent training

Published on June 4, 2025 by Anket Sah

DeepSeek-R1-0528: The Open-Source Titan Now Live on Lambda’s Inference API

DeepSeek has just leveled up. The latest release, DeepSeek-R1-0528, is now available on Lambda’s Inference API, delivering a formidable blend of mathematical ...