Resource-Constrained Implementation and Optimization of a Deep Neural Network for Vehicle Classification
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Scientific › peer-review
Standard
Resource-Constrained Implementation and Optimization of a Deep Neural Network for Vehicle Classification. / Xie, Renjie; Huttunen, Heikki; Lin, Shuoxin; Bhattacharyya, Shuvra; Takala, Jarmo.
2016 24th European Signal Processing Conference (EUSIPCO). IEEE, 2016. p. 1862-1866.Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Scientific › peer-review
Harvard
APA
Vancouver
Author
Bibtex - Download
}
RIS (suitable for import to EndNote) - Download
TY - GEN
T1 - Resource-Constrained Implementation and Optimization of a Deep Neural Network for Vehicle Classification
AU - Xie, Renjie
AU - Huttunen, Heikki
AU - Lin, Shuoxin
AU - Bhattacharyya, Shuvra
AU - Takala, Jarmo
N1 - INT=tie,"Xie, Renjie"
PY - 2016/9
Y1 - 2016/9
N2 - Deep learning has attracted great research interest in recent years in many signal processing application areas. However, investigation of deep learning implementations in highly resource-constrained contexts has been relatively unexplored due to the large computational requirements involved. In this paper, we investigate the implementation of a deep learning application for vehicle classification on multicore platforms with limited numbers of available processor cores. We apply model-based design methods based on signal processing oriented dataflow models of computation, and using the resulting dataflow representations, we apply various design optimizations to derive efficient implementations on three different multicore platforms. Using model-based design techniques throughout the design process, we demonstrate the ability to flexibly experiment with optimizing design transformations, and alternative multicore target platforms to achieve efficient implementations that are tailored to the resource constraints of these platforms.
AB - Deep learning has attracted great research interest in recent years in many signal processing application areas. However, investigation of deep learning implementations in highly resource-constrained contexts has been relatively unexplored due to the large computational requirements involved. In this paper, we investigate the implementation of a deep learning application for vehicle classification on multicore platforms with limited numbers of available processor cores. We apply model-based design methods based on signal processing oriented dataflow models of computation, and using the resulting dataflow representations, we apply various design optimizations to derive efficient implementations on three different multicore platforms. Using model-based design techniques throughout the design process, we demonstrate the ability to flexibly experiment with optimizing design transformations, and alternative multicore target platforms to achieve efficient implementations that are tailored to the resource constraints of these platforms.
M3 - Conference contribution
SP - 1862
EP - 1866
BT - 2016 24th European Signal Processing Conference (EUSIPCO)
PB - IEEE
ER -