Benchmarking TPU, GPU, and CPU Platforms for Deep Learning

07/24/2019
by   Gu-Yeon Wei, et al.
31

Training deep learning models is compute-intensive and there is an industry-wide trend towards hardware specialization to improve performance. To systematically benchmark deep learning platforms, we introduce ParaDnn, a parameterized benchmark suite for deep learning that generates end-to-end models for fully connected (FC), convolutional (CNN), and recurrent (RNN) neural networks. Along with six real-world models, we benchmark Google's Cloud TPU v2/v3, NVIDIA's V100 GPU, and an Intel Skylake CPU platform. We take a deep dive into TPU architecture, reveal its bottlenecks, and highlight valuable lessons learned for future specialized system design. We also provide a thorough comparison of the platforms and find that each has unique strengths for some types of models. Finally, we quantify the rapid performance improvements that specialized software stacks provide for the TPU and GPU platforms.

READ FULL TEXT

page 5

page 8

research
09/11/2018

Comparing Computing Platforms for Deep Learning on a Humanoid Robot

The goal of this study is to test two different computing platforms with...
research
09/15/2019

Performance and Power Evaluation of AI Accelerators for Training Deep Learning Models

Deep neural networks (DNNs) have become widely used in many AI applicati...
research
04/16/2015

Caffe con Troll: Shallow Ideas to Speed Up Deep Learning

We present Caffe con Troll (CcT), a fully compatible end-to-end version ...
research
11/09/2017

Performance Evaluation of Deep Learning Tools in Docker Containers

With the success of deep learning techniques in a broad range of applica...
research
07/25/2022

An Empirical Deep Dive into Deep Learning's Driving Dynamics

We present an empirical dataset surveying the deep learning phenomenon o...
research
06/21/2023

Edge Devices Inference Performance Comparison

In this work, we investigate the inference time of the MobileNet family,...
research
02/27/2023

Predicting the Performance of a Computing System with Deep Networks

Predicting the performance and energy consumption of computing hardware ...

Please sign up or login with your details

Forgot password? Click here to reset