Getting Started Guide — Lambda Cloud GPU Instances
This guide will walk you through the process of launching a Lambda Cloud GPU instance and using SSH to log in. While we offer both a Web Terminal and Jupyter ...
Published on by Remy Guercio
This guide will walk you through the process of launching a Lambda Cloud GPU instance and using SSH to log in. While we offer both a Web Terminal and Jupyter ...
Published on by Stephen Balaban
In this post we'll walk through using our Total Cost of Ownership (TCO) calculator to examine the cost of a variety of Lambda Hyperplane-16 clusters. We have ...
Published on by Michael Balaban
State-of-the-art (SOTA) deep learning models have massive memory footprints. Many GPUs don't have enough VRAM to train them. In this post, we determine which ...
Published on by Chuan Li
by Chuan Li, PhD
Published on by Stephen Balaban
The desired end-state of this tutorial, a running subnet manager on your switch. This tutorial will walk you through the steps required to set up a Mellanox ...
Published on by Chuan Li
TensorFlow 2 is now live! This tutorial walks you through the process of building a simple CIFAR-10 image classifier using deep learning. In this tutorial, we ...
Published on by Chuan Li
This blog will walk you through the steps of setting up a Horovod + Keras environment for multi-GPU training.
Published on by Chuan Li
One of the most asked questions we get at Lambda Labs is, “how do I track resource utilization for deep learning jobs?” Resource utilization tracking can help ...
Published on by Chuan Li
Distributed training allows scaling up deep learning task so bigger models can be learned or training can be conducted at a faster pace. In a previous ...
Published on by Chuan Li
During training, weights in the neural networks are updated so that the model performs better on the training data. For a while, improvements on the training ...
Published on by Chuan Li
This tutorial combines two items from previous tutorials: saving models and callbacks. Checkpoints are saved model states that occur during training. With ...
Published on by Chuan Li
This tutorial shows you how to perform transfer learning using TensorFlow 2.0. We will cover:
Published on by Stephen Balaban
This presentation is a high-level overview of the different types of training regimes that you'll encounter as you move from single GPU to multi GPU to multi ...
Published on by Michael Balaban
In this post, Lambda Labs benchmarks the Titan V's Deep Learning / Machine Learning performance and compares it to other commonly used GPUs. We use the Titan V ...
Published on by Stephen Balaban
by Chuan Li, PhD