The Lambda Deep Learning Blog (12)

Published on June 7, 2019 by Chuan Li

TensorFlow 2.0 Tutorial 05: Distributed Training across Multiple Nodes

Distributed training allows scaling up deep learning task so bigger models can be learned or training can be conducted at a faster pace. In a previous ...

Published on June 6, 2019 by Chuan Li

TensorFlow 2.0 Tutorial 04: Early Stopping

During training, weights in the neural networks are updated so that the model performs better on the training data. For a while, improvements on the training ...

Published on June 6, 2019 by Chuan Li

TensorFlow 2.0 Tutorial 03: Saving Checkpoints

This tutorial combines two items from previous tutorials: saving models and callbacks. Checkpoints are saved model states that occur during training. With ...

Published on June 5, 2019 by Chuan Li

TensorFlow 2.0 Tutorial 02: Transfer Learning

This tutorial shows you how to perform transfer learning using TensorFlow 2.0. We will cover:

Published on May 31, 2019 by Stephen Balaban

A Gentle Introduction to Multi GPU and Multi Node Distributed Training

This presentation is a high-level overview of the different types of training regimes that you'll encounter as you move from single GPU to multi GPU to multi ...

Published on March 12, 2019 by Michael Balaban

Titan V Deep Learning Benchmarks with TensorFlow

In this post, Lambda Labs benchmarks the Titan V's Deep Learning / Machine Learning performance and compares it to other commonly used GPUs. We use the Titan V ...

Published on March 4, 2019 by Stephen Balaban

RTX 2080 Ti Deep Learning Benchmarks with TensorFlow

by Chuan Li, PhD

Published on February 17, 2019 by Stephen Balaban

Perform GPU, CPU, and I/O stress testing on Linux

CPU, GPU, and I/O utilization monitoring using tmux, htop, iotop, and nvidia-smi. This stress test is running on a Lambda GPU Cloud 4x GPU instance. Often ...

Published on February 16, 2019 by Stephen Balaban

How to Run OpenAI's GPT-2 Text Generator on Your Computer

Update June 5th 2020: OpenAI has announced a successor to GPT-2 in a newly published paper. Checkout our GPT-3 model overview.

Published on February 11, 2019 by Chuan Li

V100 server on-prem vs AWS p3 instance cost comparison

Deep Learning requires GPUs, which are very expensive to rent in the cloud. In this post, we compare the cost of buying vs. renting a cloud GPU server. We use ...

Published on February 10, 2019 by Stephen Balaban

Install CUDA 10 on Ubuntu 18.04

You were probably thinking that this was going to be a long post. You're in luck. All you need to do is to install Ubuntu 18.04 and then Lambda Stack. Here's ...

Published on February 10, 2019 by Stephen Balaban

Set up a GPU accelerated Docker container using Lambda Stack + Lambda Stack Dockerfiles on Ubuntu 20.04 LTS

Or, how Lambda Stack + Lambda Stack Dockerfiles = GPU accelerated deep learning containers

Published on February 8, 2019 by Chuan Li

Text Generation: Char-RNN Data preparation and TensorFlow implementation

This tutorial is about making a character-based text generator using a simple two-layer LSTM. It will walk you through the data preparation and the network ...

Published on February 6, 2019 by Chuan Li

Multi-GPU enabled BERT using Horovod

BERT is Google's pre-training language representations which obtained the state-of-the-art results on a wide range of Natural Language Processing tasks. ...

Published on January 25, 2019 by Stephen Balaban

On-prem GPU Training Infrastructure for Deep Learning - Slides

These slides are from my talk at Rework Deep Learning Summit 2019.

The Lambda Deep Learning Blog