research-article

Accelerator Design for Deep Learning Training: Extended Abstract: Invited

Authors:
Ankur Agrawal

IBM T. J. Watson Research Center

IBM T. J. Watson Research Center
View Profile

,
Chia-Yu Chen

IBM T. J. Watson Research Center

IBM T. J. Watson Research Center
View Profile

,
Jungwook Choi

IBM T. J. Watson Research Center

IBM T. J. Watson Research Center
View Profile

,
Kailash Gopalakrishnan

IBM T. J. Watson Research Center

IBM T. J. Watson Research Center
View Profile

,
Jinwook Oh

IBM T. J. Watson Research Center

IBM T. J. Watson Research Center
View Profile

,
Sunil Shukla

IBM T. J. Watson Research Center

IBM T. J. Watson Research Center
View Profile

,
Viji Srinivasan

IBM T. J. Watson Research Center

IBM T. J. Watson Research Center
View Profile

,
Swagath Venkataramani

IBM T. J. Watson Research Center

IBM T. J. Watson Research Center
View Profile

,
Wei Zhang

IBM T. J. Watson Research Center

IBM T. J. Watson Research Center
View Profile

DAC '17: Proceedings of the 54th Annual Design Automation Conference 2017June 2017Article No.: 57Pages 1–2https://doi.org/10.1145/3061639.3072944

Published:18 June 2017Publication History

DAC '17: Proceedings of the 54th Annual Design Automation Conference 2017

Pages 1–2

ABSTRACT

Deep Neural Networks (DNNs) have emerged as a powerful and versatile set of techniques showing successes on challenging artificial intelligence (AI) problems. Applications in domains such as image/video processing, autonomous cars, natural language processing, speech synthesis and recognition, genomics and many others have embraced deep learning as the foundation. DNNs achieve superior accuracy for these applications with high computational complexity using very large models which require 100s of MBs of data storage, exaops of computation and high bandwidth for data movement. In spite of these impressive advances, it still takes days to weeks to train state of the art Deep Networks on large datasets - which directly limits the pace of innovation and adoption. In this paper, we present a multi-pronged approach to address the challenges in meeting both the throughput and the energy efficiency goals for DNN training.

References

Gupta.S., Agrawal.A., Gopalakrishnan.K., Narayanan.P., "Deep Learning with Limited Numerical Precision," ICML, 2015. Google ScholarDigital Library
Agrawal A., Choi J., Gopalakrishnan K., Gupta S., Nair R., Oh J., Prener D., Shukla S., Srinivasan V., Sura Z., "Approximate computing: Challenges and opportunities", IEEE International Conference on Rebooting Computing (ICRC) 2016.Google ScholarCross Ref
Gupta S., Zhang W., Wang F., "Model Accuracy and Runtime Tradeoff in Distributed Deep Learning: A Systematic Study", IEEE International Conference on Data Mining (ICDM) 2016.Google Scholar
Venkataramani S., Choi J., Srinivasan V., Gopalakrishnan K., Chang L., "DeepMatrix: A Systematic Framework to Analyze Deep Neural Network Performance on Shared Memory Accelerator Systems", IEEE/ACM Parallel Architectures and Compiler Techniques (PACT) 2017 (under review)Google Scholar

Recommendations

Advanced Deep Learning with Keras: Apply deep learning techniques, autoencoders, GANs, variational autoencoders, deep reinforcement learning, policy gradients, and more
Read More
A hybrid adversarial training for deep learning model and denoising network resistant to adversarial examples
Abstract
Deep neural networks (DNNs) are vulnerable to adversarial attacks that generate adversarial examples by adding small perturbations to the clean images. To combat adversarial attacks, the two main defense methods used are denoising and adversarial ...
Read More
An OpenCL™ Deep Learning Accelerator on Arria 10
FPGA '17: Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays

Convolutional neural nets (CNNs) have become a practical means to perform vision tasks, particularly in the area of image classification. FPGAs are well known to be able to perform convolutions efficiently, however, most recent efforts to run CNNs on ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

DAC '17: Proceedings of the 54th Annual Design Automation Conference 2017
June 2017
533 pages
ISBN:9781450349277
DOI:10.1145/3061639

Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 18 June 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate1,770of5,499submissions,32%
Upcoming Conference
DAC '24

Sponsor:

sigda

61st ACM/IEEE Design Automation Conference

June 23 - 27, 2024

San Francisco , CA , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 492
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Accelerator Design for Deep Learning Training: Extended Abstract: Invited

DAC '17: Proceedings of the 54th Annual Design Automation Conference 2017

ABSTRACT

References

Cited By

Recommendations

Advanced Deep Learning with Keras: Apply deep learning techniques, autoencoders, GANs, variational autoencoders, deep reinforcement learning, policy gradients, and more

A hybrid adversarial training for deep learning model and denoising network resistant to adversarial examples

An OpenCL™ Deep Learning Accelerator on Arria 10

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Accelerator Design for Deep Learning Training: Extended Abstract: Invited

DAC '17: Proceedings of the 54th Annual Design Automation Conference 2017

ABSTRACT

References

Cited By

Recommendations

Advanced Deep Learning with Keras: Apply deep learning techniques, autoencoders, GANs, variational autoencoders, deep reinforcement learning, policy gradients, and more

A hybrid adversarial training for deep learning model and denoising network resistant to adversarial examples

An OpenCL™ Deep Learning Accelerator on Arria 10

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media