Journal of Real Time Image Processing, Volume 17, Issue 5, Pages 1487-1503 , 01/10/2020

Speeding up inference on deep neural networks for object detection by performing partial convolution

Wattanapong Kurdthongmee

Abstract

Real-time object detection is an expected application of deep neural networks (DNNs). It can be achieved by employing graphic processing units (GPUs) or dedicated hardware accelerators. Alternatively, in this work, we present a software scheme to accelerate the inference stage of DNNs designed for object detection. The scheme relies on partial processing within the consecutive convolution layers of a DNN. It makes use of different relationships between the locations of the components of an input feature, an intermediate feature representation, and an output feature to effectively identify the modified components. This downsizes the matrix multiplicand to cover only those modified components. Therefore, matrix multiplication is accelerated within a convolution layer. In addition, the aforementioned relationships can also be employed to signal the next consecutive convolution layer regarding the modified components. This further helps reduce the overhead of the comparison on a member-by-member basis to identify the modified components. The proposed scheme has been experimentally benchmarked against a similar concept approach, namely, CBinfer, and against the original Darknet on the Tiny-You Only Look Once network. The experiments were conducted on a personal computer with dual CPU running at 3.5 GHz without GPU acceleration upon video data sets from YouTube. The results show that improvement ratios of 1.56 and 13.10 in terms of detection frame rate over CBinfer and Darknet, respectively, are attainable on average. Our scheme was also extended to exploit GPU-assisted acceleration. The experimental results of NVIDIA Jetson TX2 reached a detection frame rate of 28.12 frames per second (1.25× with respect to CBinfer). The accuracy of detection of all experiments was preserved at 90% of the original Darknet.

Document Type

Article

Source Type

Journal

Keywords

ConvolutionDeep neural networksDNNs object detectionInference acceleration

ASJC Subject Area

Computer Science : Information Systems

Funding Agency

Thailand Research Fund


Bibliography


& Kurdthongmee, W. (2020). Speeding up inference on deep neural networks for object detection by performing partial convolution. Journal of Real Time Image Processing, 17(5) 1487-1503. doi:10.1007/s11554-019-00906-6

Copy | Save