TY - GEN
T1 - Building your private cloud storage on public cloud service using embedded GPUs
AU - Cheng, Wangzhao
AU - Zheng, Fangyu
AU - Pan, Wuqiong
AU - Lin, Jingqiang
AU - Li, Huorong
AU - Li, Bingyu
N1 - Publisher Copyright:
© ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2018.
PY - 2018
Y1 - 2018
N2 - When the public cloud provides infrastructure as a service (IaaS), the customer can outsource its data to the public cloud and release itself from the burden of storing data locally. At this point, the customer can not guarantee the security of the data in the public cloud. Encrypting data before using cloud storage is a viable solution, but frequent data encryption operations cause the original limited local computing resources to be even more stretched. In this paper, we used Jetson TX1 to build a client-side data encryption device that proxies the customer’s data encryption and decryption operations. Firstly, a GPU-based SM4 implementation is carefully scheduled in the integrated GPU on Jetson TX1, including instruction-level optimization and variable improvement for data arrangement. Secondly, using zero-copy access on the device, we reduce the impact of explicit data transfer operations on overall performance. Finally, our SM4 kernel is capable of encrypting data at 30.30 Gbps on Jetson TX1, it is 26.6 times faster than the CPU-based implementation on the same platform. Furthermore, data processing throughput of the device reaches 30.19Gbps, a single Jetson TX1 owns sufficiently redundant computational power for the customer in 10 Gigabit fiber network environment.
AB - When the public cloud provides infrastructure as a service (IaaS), the customer can outsource its data to the public cloud and release itself from the burden of storing data locally. At this point, the customer can not guarantee the security of the data in the public cloud. Encrypting data before using cloud storage is a viable solution, but frequent data encryption operations cause the original limited local computing resources to be even more stretched. In this paper, we used Jetson TX1 to build a client-side data encryption device that proxies the customer’s data encryption and decryption operations. Firstly, a GPU-based SM4 implementation is carefully scheduled in the integrated GPU on Jetson TX1, including instruction-level optimization and variable improvement for data arrangement. Secondly, using zero-copy access on the device, we reduce the impact of explicit data transfer operations on overall performance. Finally, our SM4 kernel is capable of encrypting data at 30.30 Gbps on Jetson TX1, it is 26.6 times faster than the CPU-based implementation on the same platform. Furthermore, data processing throughput of the device reaches 30.19Gbps, a single Jetson TX1 owns sufficiently redundant computational power for the customer in 10 Gigabit fiber network environment.
KW - CUDA
KW - Jetson TX1
KW - SM4 implementation
KW - Symmetric cryptographic algorithm
KW - Virtual private cloud storage
UR - https://www.scopus.com/pages/publications/85059687945
U2 - 10.1007/978-3-030-01701-9_28
DO - 10.1007/978-3-030-01701-9_28
M3 - 会议稿件
AN - SCOPUS:85059687945
SN - 9783030017002
T3 - Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST
SP - 512
EP - 528
BT - Security and Privacy in Communication Networks - 14th International Conference, SecureComm 2018, Proceedings
A2 - Li, Yingjiu
A2 - Chang, Bing
A2 - Zhu, Sencun
A2 - Beyah, Raheem
PB - Springer Verlag
T2 - 14th International EAI Conference on Security and Privacy in Communication Networks, SecureComm 2018
Y2 - 8 August 2018 through 10 August 2018
ER -