Abstract
Deep convolutional neural networks (CNN) have shown their good performances in many computer vision tasks. However, the high computational complexity of CNN involves a huge amount of data movements between the computational processor core and memory hierarchy which occupies the major of the power consumption. This paper presents Chain-NN, a novel energy-efficient 1D chain architecture for accelerating deep CNNs. Chain-NN consists of the dedicated dual-channel process engines (PE). In Chain-NN, convolutions are done by the 1D systolic primitives composed of a group of adjacent PEs. These systolic primitives, together with the proposed column-wise scan input pattern, can fully reuse input operand to reduce the memory bandwidth requirement for energy saving. Moreover, the 1D chain architecture allows the systolic primitives to be easily reconfigured according to specific CNN parameters with fewer design complexity. The synthesis and layout of Chain-NN is under TSMC 28nm process. It costs 3751k logic gates and 352KB on-chip memory. The results show a 576-PE Chain-NN can be scaled up to 700MHz. This achieves a peak throughput of 806.4GOPS with 567.5mW and is able to accelerate the five convolutional layers in AlexNet at a frame rate of 326.2fps. 1421.0GOPS/W power efficiency is at least 2.5 to 4.1x times better than the state-of-the-art works.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2017 Design, Automation and Test in Europe, DATE 2017 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1032-1037 |
Number of pages | 6 |
ISBN (Electronic) | 9783981537093 |
DOIs | |
Publication status | Published - 2017 May 11 |
Event | 20th Design, Automation and Test in Europe, DATE 2017 - Swisstech, Lausanne, Switzerland Duration: 2017 Mar 27 → 2017 Mar 31 |
Other
Other | 20th Design, Automation and Test in Europe, DATE 2017 |
---|---|
Country/Territory | Switzerland |
City | Swisstech, Lausanne |
Period | 17/3/27 → 17/3/31 |
Keywords
- Accelerator
- ASIC
- CNN
- Convolutional neural networks
- Memory bandwidth
- Power efficiency
ASJC Scopus subject areas
- Computer Networks and Communications
- Hardware and Architecture
- Safety, Risk, Reliability and Quality