跳到主要导航 跳到搜索 跳到主要内容

Rotated binary neural network

  • Mingbao Lin
  • , Rongrong Ji*
  • , Zihan Xu
  • , Baochang Zhang
  • , Yan Wang
  • , Yongjian Wu
  • , Feiyue Huang
  • , Chia Wen Lin
  • *此作品的通讯作者
  • Xiamen University
  • Peng Cheng Laboratory
  • Pinterest
  • Tencent
  • National Tsing Hua University

科研成果: 期刊稿件会议文章同行评审

摘要

Binary Neural Network (BNN) shows its predominance in reducing the complexity of deep neural networks. However, it suffers severe performance degradation. One of the major impediments is the large quantization error between the full-precision weight vector and its binary vector. Previous works focus on compensating for the norm gap while leaving the angular bias hardly touched. In this paper, for the first time, we explore the influence of angular bias on the quantization error and then introduce a Rotated Binary Neural Network (RBNN), which considers the angle alignment between the full-precision weight vector and its binarized version. At the beginning of each training epoch, we propose to rotate the full-precision weight vector to its binary vector to reduce the angular bias. To avoid the high complexity of learning a large rotation matrix, we further introduce a bi-rotation formulation that learns two smaller rotation matrices. In the training stage, we devise an adjustable rotated weight vector for binarization to escape the potential local optimum. Our rotation leads to around 50% weight flips which maximize the information gain. Finally, we propose a training-aware approximation of the sign function for the gradient backward. Experiments on CIFAR-10 and ImageNet demonstrate the superiorities of RBNN over many state-of-the-arts. Our source code, experimental settings, training logs and binary models are available at https://github.com/lmbxmu/RBNN.

源语言英语
期刊Advances in Neural Information Processing Systems
2020-December
出版状态已出版 - 2020
活动34th Conference on Neural Information Processing Systems, NeurIPS 2020 - Virtual, Online
期限: 6 12月 202012 12月 2020

指纹

探究 'Rotated binary neural network' 的科研主题。它们共同构成独一无二的指纹。

引用此