Hardware-aware Softmax Approximation for Deep Neural Networks

Page view(s)

Checked on

Please use this identifier to cite or link to this item: https://astaroar.ripplewerkz.co/communities-collections/articles/14257

Title:

Hardware-aware Softmax Approximation for Deep Neural Networks

Journal Title:

Asian Conference on Computer Vision 2018

DOI:

OA Status:

Publication URL:

Authors:

Xue Geng, Jie Lin, Bin Zhao, Anmin Kong, Mohamed M. Sabry Aly, Vijay Chandrasekhar

Keywords:

Softmax

Publication Date:

02 December 2018

Citation:

Abstract:

There has been a rapid development of custom hardware for accelerating the inference speed of deep neural networks (DNNs), by explicitly incorporating hardware metrics (e.g., area and energy) as additional constraints, in addition to application accuracy. Recent efforts mainly focused on linear functions (matrix multiplication) in convolutional (Conv) or fully connected (FC) layers, while there is no publicly available study on optimizing the inference of non-linear functions in DNNs, with hardware constraints. In this paper, we address the problem of cost-efficient inference for Softmax, a popular non-linear function in DNNs. We introduce a hardware-aware linear approximation framework by algorithm and hardware co-optimization, with the goal of minimizing the cost in terms of area and energy, without incurring significant loss in application accuracy. This is achieved by simultaneously reducing the operand bit-width and approximating cost-intensive operations in Softmax (e.g. exponential and division) with cost-effective operations (e.g. addition and bit shifts). We designed and synthesized a hardware unit for our approximation approach, to estimate the area and energy consumption. In addition, we introduce a training method to further save area and energy cost, by reduced precision. Our approach reduces area cost by 13× and energy consumption by 2× with 11-bit operand width, compared to baseline at 19-bit for VOC2007 dataset in Faster R-CNN.

License type:

PublisherCopyrights

Funding Info:

Hardware-Software Co-optimization for Deep Learning (Project No.A1892b0026)

Description:

URI:

https://astaroar.ripplewerkz.co/communities-collections/articles/14257

ISBN:

Collections:

Institute for Infocomm Research

Files uploaded:

Manuscripts in This Item:

File	Size	Format	Action
0421.pdf	499.45 KB	PDF	Open