We open sourced the benchmarking code we use at Lambda Labs so that anybody can reproduce the benchmarks that we publish or run their own. We encourage people to email us with their results and will continue to publish those results here. You can run the code and email benchmarks@lambdalabs.com or tweet @LambdaAPI. This is the official page for all Lambda Community Benchmarks.
How to get your results published here
Component | Version
-----------------|------------
CPU | $(cat /proc/cpuinfo | grep 'model name' | uniq | awk -F: '{ print $2 }')
Distro | $(lsb_release -d)
Kernel Version | $(uname -r)
Kernel Arch | $(uname -m)
GPU | $(sudo lspci | grep VGA\ compat | head -n1)
Tensorflow | $(python -c 'import tensorflow;print(tensorflow.__version__)' 2> /dev/null)
NVIDIA Driver | $(head -n1 /proc/driver/nvidia/version | awk '{ print $8 }')
CUDA | $(nvcc --version | tail -n 1 | grep Cuda | awk '{ print $6 }')
cuDNN | $(cat /usr/include/cudnn.h | grep -P 'define\ CUDNN_MAJOR|define\ CUDNN_MINOR|define\ CUDNN_PATCHLEVEL' | awk '{ print $3 }' | sed ':a;N;$!ba;s/\n/./g')
Python | $(python --version 2>&1)
Copy the above and paste into template.txt. Then run the code below to output your table.
IFS=' '
cat > template.txt
CTRL-V (paste in)
CTRL-D (end file)
(for line in $(cat template.txt); do eval "echo \"$line\""; done) > specs-table.txt
Crowd Sourced Results
Here are the results that have been submitted to us by third parties.
Summary (Stanislav Brizitsky)
| Component |
Version |
| CPU |
AMD Phenom(tm) II X6 1075T Processor |
| Distro |
Ubuntu 19.04 |
| Kernel Version |
5.0.0-31-generic |
| Kernel Arch |
x86_64 |
| GPU |
04:00.0 VGA compatible controller: NVIDIA Corporation TU102 GeForce RTX 2080 Ti (rev a1) |
| Tensorflow |
1.14.0 |
| NVIDIA Driver |
430.50 |
| CUDA |
V10.1.105 |
| cuDNN |
7.6.4 |
| Python |
Python 3.7.3 |
Summary
| model |
input size |
param mem |
feat. mem |
flops |
| resnet-50 |
224 x 224 |
98 MB |
103 MB |
4 BFLOPs |
| resnet-152 |
224 x 224 |
230 MB |
219 MB |
11 BFLOPs |
| inception-v3 |
299 x 299 |
91 MB |
89 MB |
6 BFLOPs |
| vgg-vd-19 |
224 x 224 |
548 MB |
63 MB |
20 BFLOPs |
| alexnet |
227 x 227 |
233 MB |
3 MB |
1.5 BFLOPs |
| ssd-300 |
300 x 300 |
100 MB |
116 MB |
31 GFLOPS |
syn-replicated-fp32-1gpus
| Config |
X6-GeForce_RTX_2080_Ti |
| resnet50 |
294.45 |
| resnet152 |
107.72 |
| inception3 |
193.16 |
| inception4 |
76.11 |
| vgg16 |
176.88 |
| alexnet |
3665.52 |
| ssd300 |
150.39 |
syn-parameter_server-fp32-1gpus
| Config |
X6-GeForce_RTX_2080_Ti |
| resnet50 |
291.69 |
| resnet152 |
107.40 |
| inception3 |
192.78 |
| inception4 |
76.10 |
| vgg16 |
176.89 |
| alexnet |
3673.53 |
| ssd300 |
150.30 |
syn-replicated-fp16-1gpus
| Config |
X6-GeForce_RTX_2080_Ti |
| resnet50 |
462.98 |
| resnet152 |
172.64 |
| inception3 |
284.54 |
| inception4 |
104.68 |
| vgg16 |
261.92 |
| alexnet |
4755.02 |
| ssd300 |
194.10 |
syn-parameter_server-fp16-1gpus
| Config |
X6-GeForce_RTX_2080_Ti |
| resnet50 |
468.89 |
| resnet152 |
176.30 |
| inception3 |
287.66 |
| inception4 |
108.32 |
| vgg16 |
266.71 |
| alexnet |
4856.88 |
| ssd300 |
197.04 |
| Attachments area |
|
Summary (Antonio Marin)
RTX2080Ti benchmark
Specifications
| Component |
Version |
| CPU |
Intel(R) Core(TM) i9-7900X CPU @ 3.30GHz |
| Distro |
Description: Ubuntu 16.04.6 LTS |
| Kernel Version |
4.15.0-52-generic |
| Kernel Arch |
x86_64 |
| GPU |
65:00.0 VGA compatible controller: NVIDIA Corporation GV102 (rev a1) |
| Tensorflow |
1.12.0 |
| NVIDIA Driver |
418.56 |
| CUDA |
V7.5.17 |
| cuDNN |
7.3.1 |
| Python |
Python 3.6.8 :: Anaconda, Inc. |
Benchmark results
| model |
input size |
param mem |
feat. mem |
flops |
| resnet-50 |
224 x 224 |
98 MB |
103 MB |
4 BFLOPs |
| resnet-152 |
224 x 224 |
230 MB |
219 MB |
11 BFLOPs |
| inception-v3 |
299 x 299 |
91 MB |
89 MB |
6 BFLOPs |
| vgg-vd-19 |
224 x 224 |
548 MB |
63 MB |
20 BFLOPs |
| alexnet |
227 x 227 |
233 MB |
3 MB |
1.5 BFLOPs |
| ssd-300 |
300 x 300 |
100 MB |
116 MB |
31 GFLOPS |
syn-replicated-fp32-1gpus
| Config |
i9-7900X-GeForce_RTX_2080_Ti |
| resnet50 |
318.45 |
| resnet152 |
121.54 |
| inception3 |
210.28 |
| inception4 |
88.72 |
| vgg16 |
186.87 |
| alexnet |
3877.75 |
| ssd300 |
162.28 |
syn-parameter_server-fp32-1gpus
| Config |
i9-7900X-GeForce_RTX_2080_Ti |
| resnet50 |
316.52 |
| resnet152 |
122.22 |
| inception3 |
211.87 |
| inception4 |
88.26 |
| vgg16 |
186.70 |
| alexnet |
3868.16 |
| ssd300 |
162.23 |
syn-replicated-fp16-1gpus
| Config |
i9-7900X-GeForce_RTX_2080_Ti |
| resnet50 |
448.98 |
| resnet152 |
159.09 |
| inception3 |
261.64 |
| inception4 |
96.25 |
| vgg16 |
215.97 |
| alexnet |
4507.86 |
| ssd300 |
186.27 |
syn-parameter_server-fp16-1gpus
| Config |
i9-7900X-GeForce_RTX_2080_Ti |
| resnet50 |
454.84 |
| resnet152 |
162.12 |
| inception3 |
259.83 |
| inception4 |
98.24 |
| vgg16 |
220.16 |
| alexnet |
4566.05 |
| ssd300 |
187.44 |
Summary - Mike Metral - 1080 Ti
| model |
input size |
param mem |
feat. mem |
flops |
| resnet-50 |
224 x 224 |
98 MB |
103 MB |
4 BFLOPs |
| resnet-152 |
224 x 224 |
230 MB |
219 MB |
11 BFLOPs |
| inception-v3 |
299 x 299 |
91 MB |
89 MB |
6 BFLOPs |
| vgg-vd-19 |
224 x 224 |
548 MB |
63 MB |
20 BFLOPs |
| alexnet |
227 x 227 |
233 MB |
3 MB |
1.5 BFLOPs |
| ssd-300 |
300 x 300 |
100 MB |
116 MB |
31 GFLOPS |
syn-replicated-fp32-1gpus
| Config |
v2-GeForce_GTX_1080_Ti |
| resnet50 |
221.33 |
| resnet152 |
84.99 |
| inception3 |
142.51 |
| inception4 |
60.11 |
| vgg16 |
142.39 |
| alexnet |
2868.88 |
| ssd300 |
112.22 |
syn-parameter_server-fp32-1gpus
| Config |
v2-GeForce_GTX_1080_Ti |
| resnet50 |
221.24 |
| resnet152 |
85.04 |
| inception3 |
142.39 |
| inception4 |
60.12 |
| vgg16 |
142.17 |
| alexnet |
2870.47 |
| ssd300 |
112.14 |
syn-replicated-fp16-1gpus
| Config |
v2-GeForce_GTX_1080_Ti |
| resnet50 |
275.24 |
| resnet152 |
99.76 |
| inception3 |
161.39 |
| inception4 |
64.63 |
| vgg16 |
153.03 |
| alexnet |
2981.33 |
| ssd300 |
126.42 |
syn-parameter_server-fp16-1gpus
| Config |
v2-GeForce_GTX_1080_Ti |
| resnet50 |
275.78 |
| resnet152 |
100.20 |
| inception3 |
160.48 |
| inception4 |
65.22 |
| vgg16 |
156.34 |
| alexnet |
3022.28 |
| ssd300 |
127.33 |
Hardware / Software
| Component |
Version |
| Distro |
Ubuntu 18.04.1 |
| Kernel |
4.18.5 x86_64 |
| GPU / Compute Capacity |
NVIDIA GeForce GTX 1080 TI - 6.1 |
| Tensorflow |
v1.11.0 |
| NVIDIA |
410.57 |
| CUDA |
10.0.130_410.48 |
| cuDNN |
7.3.0.29 |
| NCCL |
2.3.5 |
| GCC Ubuntu |
6.4.0-17ubuntu1 |
| Python |
3.6.6 |
| Bazel |
0.16.1 |