Systolic Array Based Accelerator and Algorithm Mapping for Deep Learning Algorithms

Zhijie Yang; Lei Wang; Dong Ding; Xiangyu Zhang; Yu Deng; Shiming Li; Qiang Dou

doi:10.1007/978-3-030-05677-3_16

Conference Papers Year : 2018

Systolic Array Based Accelerator and Algorithm Mapping for Deep Learning Algorithms

(1) , (1) , (1) , (1) , (1) , (1) , (1)

Zhijie Yang

Function : Author

College of Computer Science [Changsha]

Lei Wang

Function : Author
PersonId : 1053385

College of Computer Science [Changsha]

Dong Ding

Function : Author
PersonId : 796947
ORCID : 0000-0003-2623-5046

College of Computer Science [Changsha]

Xiangyu Zhang

Function : Author

College of Computer Science [Changsha]

Yu Deng

Function : Author

College of Computer Science [Changsha]

Shiming Li

Function : Author

College of Computer Science [Changsha]

Qiang Dou

Function : Author

College of Computer Science [Changsha]

Abstract

As the depth of DNN increases, the need for DNN calculations for the storage and computing power of the underlying computing platform is increasing. In this work, we implement an accelerator on FPGA for deep learning algorithms (CNN and RNN). The core computing module of the accelerator is a 32 * 32 systolic array of PEs. A mapping method for variable size of CNN and RNN algorithms is proposed. The experiment result shows that the maximum power consumption of the accelerator is 7.5W@100Mhz, the peak performance is 0.2Tops/s, and the real performance is 7.6Mops@100Mhz when running the 1st layer of LeNet-5.

Keywords

Accelerator Systolic array DNN Data mapping

Domains

Computer Science [cs]

Fichier principal

477597_1_En_16_Chapter.pdf (489)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-02279547

Submitted on : Thursday, September 5, 2019-1:30:54 PM

Last modification on : Thursday, September 5, 2019-1:35:35 PM

Long-term archiving on : Thursday, February 6, 2020-1:43:30 AM

Dates and versions

hal-02279547 , version 1 (05-09-2019)

Licence

Attribution

Identifiers

HAL Id : hal-02279547 , version 1
DOI : 10.1007/978-3-030-05677-3_16

Cite

Zhijie Yang, Lei Wang, Dong Ding, Xiangyu Zhang, Yu Deng, et al.. Systolic Array Based Accelerator and Algorithm Mapping for Deep Learning Algorithms. 15th IFIP International Conference on Network and Parallel Computing (NPC), Nov 2018, Muroran, Japan. pp.153-158, ⟨10.1007/978-3-030-05677-3_16⟩. ⟨hal-02279547⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP-LNCS IFIP IFIP-TC IFIP-TC10 IFIP-NPC IFIP-WG10-3 IFIP-LNCS-11276

189 View

837 Download

Systolic Array Based Accelerator and Algorithm Mapping for Deep Learning Algorithms

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share