TY - GEN
T1 - CCIED
T2 - 22nd IEEE International Conference on High Performance Computing and Communications, 18th IEEE International Conference on Smart City and 6th IEEE International Conference on Data Science and Systems, HPCC-SmartCity-DSS 2020
AU - Hu, Chuanwen
AU - Bai, Yuebin
AU - Wang, Rui
AU - Liu, Chang
AU - Wang, Xiaolin
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/12
Y1 - 2020/12
N2 - Recently, deep learning technology has shined in the fields of computer vision, natural language processing and speech recognition, and related products have sprung up like mushrooms. Due to the storage and calculation of deep neural network (DNN) models are relatively large and mobile edge devices are often resource-constrained, how to efficiently deploy DNN models on resource-constrained edge devices has attracted great attention from academia and industry. There's strength in numbers, so we propose CCIED, a framework which lets edge devices cooperate with each other to complete DNN inference tasks. Due to task inputs in the mobile edge computing scenarios usually have great similarities, the outputs of the middle layer of the neural network and the corresponding labels are cached. When a similar input already exists in the cache, the device does not need to perform the remaining calculations, but directly returns the cached results. One of the challenges of collaborative inference is that the communication overhead associated with transferring intermediate data can be significant. We therefore perform weight pruning only on the layer that obtains the intermediate results, which can greatly reduce the redundant parameters of the intermediate results, thereby reducing the time for transferring data between devices, and basically does not reduce the complexity of the model. Experimental results show that CCIED can efficiently deploy the DNN model on edge devices with almost no loss of precision, and can significantly reduce the total latency during cache hits.
AB - Recently, deep learning technology has shined in the fields of computer vision, natural language processing and speech recognition, and related products have sprung up like mushrooms. Due to the storage and calculation of deep neural network (DNN) models are relatively large and mobile edge devices are often resource-constrained, how to efficiently deploy DNN models on resource-constrained edge devices has attracted great attention from academia and industry. There's strength in numbers, so we propose CCIED, a framework which lets edge devices cooperate with each other to complete DNN inference tasks. Due to task inputs in the mobile edge computing scenarios usually have great similarities, the outputs of the middle layer of the neural network and the corresponding labels are cached. When a similar input already exists in the cache, the device does not need to perform the remaining calculations, but directly returns the cached results. One of the challenges of collaborative inference is that the communication overhead associated with transferring intermediate data can be significant. We therefore perform weight pruning only on the layer that obtains the intermediate results, which can greatly reduce the redundant parameters of the intermediate results, thereby reducing the time for transferring data between devices, and basically does not reduce the complexity of the model. Experimental results show that CCIED can efficiently deploy the DNN model on edge devices with almost no loss of precision, and can significantly reduce the total latency during cache hits.
KW - Cache.
KW - Collaborative inference
KW - Deep neural networks
KW - Edge computing
UR - https://www.scopus.com/pages/publications/85105290906
U2 - 10.1109/HPCC-SmartCity-DSS50907.2020.00086
DO - 10.1109/HPCC-SmartCity-DSS50907.2020.00086
M3 - 会议稿件
AN - SCOPUS:85105290906
T3 - Proceedings - 2020 IEEE 22nd International Conference on High Performance Computing and Communications, IEEE 18th International Conference on Smart City and IEEE 6th International Conference on Data Science and Systems, HPCC-SmartCity-DSS 2020
SP - 668
EP - 673
BT - Proceedings - 2020 IEEE 22nd International Conference on High Performance Computing and Communications, IEEE 18th International Conference on Smart City and IEEE 6th International Conference on Data Science and Systems, HPCC-SmartCity-DSS 2020
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 14 December 2020 through 16 December 2020
ER -