跳到主要导航 跳到搜索 跳到主要内容

Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies

  • Beihang University
  • SenseTime Group Limited
  • Nanyang Technological University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Existing Binary Neural Networks (BNNs) mainly operate on local convolutions with binarization function. However, such simple bit operations lack the ability of modeling contextual dependencies, which is critical for learning discriminative deep representations in vision models. In this work, we tackle this issue by presenting new designs of binary neural modules, which enables BNNs to learn effective contextual dependencies. First, we propose a binary multi-layer perceptron (MLP) block as an alternative to binary convolution blocks to directly model contextual dependencies. Both short-range and long-range feature dependencies are modeled by binary MLPs, where the former provides local inductive bias and the latter breaks limited receptive field in binary convolutions. Second, to improve the robustness of binary models with contextual dependencies, we compute the contextual dynamic embeddings to determine the binarization thresholds in general binary convolutional blocks. Armed with our binary MLP blocks and improved binary convolution, we build the BNNs with explicit Contextual Dependency modeling, termed as BCDNet. On the standard ImageNet-1K classification benchmark, the BCDNet achieves 72.3% Top-1 accuracy and outperforms leading binary methods by a large margin. In particular, the proposed BCDNet exceeds the state-of-the-art ReActNet-A by 2.9% Top-1 accuracy with similar operations. Our code is available at https://github.com/Sense-GVT/BCDNet.

源语言英语
主期刊名Computer Vision – ECCV 2022 - 17th European Conference, Proceedings
编辑Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner
出版商Springer Science and Business Media Deutschland GmbH
536-552
页数17
ISBN(印刷版)9783031200823
DOI
出版状态已出版 - 2022
活动17th European Conference on Computer Vision, ECCV 2022 - Tel Aviv, 以色列
期限: 23 10月 202227 10月 2022

出版系列

姓名Lecture Notes in Computer Science
13671 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议17th European Conference on Computer Vision, ECCV 2022
国家/地区以色列
Tel Aviv
时期23/10/2227/10/22

指纹

探究 'Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies' 的科研主题。它们共同构成独一无二的指纹。

引用此