TY - GEN
T1 - Deep Learning Video Analytics on Edge Computing Devices
AU - Tan, Tianxiang
AU - Cao, Guohong
N1 - Funding Information:
This work was supported in part by the National Science Foundation (NSF) under grant CNS-1815465.
Publisher Copyright:
© 2021 IEEE.
PY - 2021/7/6
Y1 - 2021/7/6
N2 - The rapid progress of deep learning-based techniques such as Convolutional Neural Network (CNN) has enabled many emerging applications related to video analytics and running them on mobile devices can help improve our daily lives in many ways. However, there are many challenges for video analytics on mobile devices using multiple CNN models. CNN models are resource hungry, and each model requires a large amount of computational power and occupies a large portion of memory space. Although video processing can be offloaded to reduce the computation time, transmitting large amount of video data is time consuming. Thus, offloading is not always the best option. Moreover, different CNN models have different memory usage and processing time, making the scheduling problem more complex. As a result, besides deciding which task to be offloaded, we must decide which CNN model should reside in the memory and for how long, and which CNN model should be switched out due to memory constraint. In this paper, we propose resource aware scheduling algorithms to address these challenges. We identify the task scheduling problem for running multiple CNN models on mobile devices under resource constraints and formulate it as an integer programming problem. We propose resource-Aware scheduling algorithms which combine offloading and local processing methods to minimize the completion time of video processing. We implement the proposed scheduling algorithms on Android-based smartphones and demonstrate its effectiveness through extensive experiments.
AB - The rapid progress of deep learning-based techniques such as Convolutional Neural Network (CNN) has enabled many emerging applications related to video analytics and running them on mobile devices can help improve our daily lives in many ways. However, there are many challenges for video analytics on mobile devices using multiple CNN models. CNN models are resource hungry, and each model requires a large amount of computational power and occupies a large portion of memory space. Although video processing can be offloaded to reduce the computation time, transmitting large amount of video data is time consuming. Thus, offloading is not always the best option. Moreover, different CNN models have different memory usage and processing time, making the scheduling problem more complex. As a result, besides deciding which task to be offloaded, we must decide which CNN model should reside in the memory and for how long, and which CNN model should be switched out due to memory constraint. In this paper, we propose resource aware scheduling algorithms to address these challenges. We identify the task scheduling problem for running multiple CNN models on mobile devices under resource constraints and formulate it as an integer programming problem. We propose resource-Aware scheduling algorithms which combine offloading and local processing methods to minimize the completion time of video processing. We implement the proposed scheduling algorithms on Android-based smartphones and demonstrate its effectiveness through extensive experiments.
UR - http://www.scopus.com/inward/record.url?scp=85111742783&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85111742783&partnerID=8YFLogxK
U2 - 10.1109/SECON52354.2021.9491614
DO - 10.1109/SECON52354.2021.9491614
M3 - Conference contribution
AN - SCOPUS:85111742783
T3 - Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks workshops
BT - 2021 18th IEEE International Conference on Sensing, Communication and Networking, SECON 2021
PB - IEEE Computer Society
T2 - 18th IEEE International Conference on Sensing, Communication and Networking, SECON 2021
Y2 - 6 July 2021 through 9 July 2021
ER -