TY - GEN
T1 - Research on the accuracy of single precision on graphics processing unit
AU - Yuan, Tao
AU - Mingfa, Zhu
AU - Limin, Xiao
AU - Li, Ruan
AU - Guan, Dongyi
AU - Chen, Siming
AU - Yi, Ding
PY - 2011
Y1 - 2011
N2 - The single precision in the computer is composed of two parts: the mantissa and the exponent. which are expressed by the limited binary bits. During adding on the single precision, the smaller one should shift to line up the decimal points, If the mantissa of the smaller one exceeds the range of registers, truncating or rounding off will be executed and cause losing precision. As far as the serials of GTX200 is concerned, the length of the register storing intermediate results is the same as the one for the final results. The accuracy problem is very prominent while the length of the mantissa exceeds the range of register after lining up the decimal points during the single precision adding. In this paper, we use the partial sum algorithm to improve the accuracy of single precision adding, and verify the correctness of the algorithm from the perspective of experiment by means of the matrix multiplication. Finally, we analyze the effect of partial sum algorithm on compute peak of the GPU and come to the conclusion that the partial sum algorithm has little influence on the compute peak of the GPU.
AB - The single precision in the computer is composed of two parts: the mantissa and the exponent. which are expressed by the limited binary bits. During adding on the single precision, the smaller one should shift to line up the decimal points, If the mantissa of the smaller one exceeds the range of registers, truncating or rounding off will be executed and cause losing precision. As far as the serials of GTX200 is concerned, the length of the register storing intermediate results is the same as the one for the final results. The accuracy problem is very prominent while the length of the mantissa exceeds the range of register after lining up the decimal points during the single precision adding. In this paper, we use the partial sum algorithm to improve the accuracy of single precision adding, and verify the correctness of the algorithm from the perspective of experiment by means of the matrix multiplication. Finally, we analyze the effect of partial sum algorithm on compute peak of the GPU and come to the conclusion that the partial sum algorithm has little influence on the compute peak of the GPU.
KW - Compute accuracy
KW - Partial summation
KW - Single precision float
UR - https://www.scopus.com/pages/publications/80051883644
U2 - 10.1109/CMSP.2011.103
DO - 10.1109/CMSP.2011.103
M3 - 会议稿件
AN - SCOPUS:80051883644
SN - 9780769543567
T3 - Proceedings - 2011 International Conference on Multimedia and Signal Processing, CMSP 2011
SP - 57
EP - 60
BT - Proceedings - 2011 International Conference on Multimedia and Signal Processing, CMSP 2011
T2 - 2011 International Conference on Multimedia and Signal Processing, CMSP 2011
Y2 - 14 May 2011 through 15 May 2011
ER -