The Lambda Deep Learning Blog

Published on May 22, 2026 by Zach Mueller

DeepSeek V4: the most expected open-source model ever released, and the quietest landing

After 15 months of incremental updates, leaks, and rumored leaks, DeepSeek released version 4. It arrived without the fanfare R1 and R1-preview commanded in ...

Published on April 30, 2026 by Zach Mueller

Creating highly efficient agents: 450M tool-calling tokens distilled for post-training from top open-source models

Harnesses If you've used Claude Code or Codex, you've used a harness. A harness is the infrastructure layer that wraps an AI coding agent and decides how it ...

Published on March 5, 2026 by Lambda

Open model, open metrics: How Lambda and the Olmo team trained Olmo Hybrid

Open-source models are now one of the main engines of progress in AI. When strong models like Nemotron, Llama 3.1, and Qwen3-Next are released openly, the ...

Published on August 15, 2024 by Mitesh Agrawal

Unveiling Hermes 3: The First Full-Parameter Fine-Tuned Llama 3.1 405B Model is on Lambda’s Cloud

Try Hermes 3 for free with the New Lambda Chat Completions API and Lambda Chat. Introducing Hermes 3: A new era for Llama fine-tuning We are thrilled to ...

Published on February 13, 2024 by David Hartmann

ShadeRunner: Chrome plugin for enhanced on-page research

In today's digital era, accessing information efficiently is crucial. Our new Chrome plugin, ShadeRunner, aims to simplify this process by offering a range of ...

Published on September 14, 2023 by Xi Tian

Exploring AI's Role in Summarizing Scientific Reviews

Published on July 24, 2023 by Xi Tian

Chat with a PDF using Falcon: Unleashing the Power of Open-Source LLMs!

Unlock the potential of open-source LLMs by hosting your very own langchain+Falcon+Chroma application! Now, you can upload a PDF and engage in captivating ...

Published on July 20, 2023 by Corey Lowman

Fine tuning Meta's LLaMA 2 on Lambda GPU Cloud

This blog post provides instructions on how to fine tune Llama 2 models on Lambda Cloud using a $0.60/hr A10 GPU.

Published on July 13, 2023 by David Hall

Considerations for Large-Scale NVIDIA H100 Cluster Deployments

Published on June 29, 2023 by Xi Tian

Fine-tuning Falcon LLM 7B/40B

This guide walks you through how to fine-tune Falcon LLM 7B/40B on a single GPU with LoRA and quantization, enabling data parallelism for linear scaling across ...