跳到主要导航 跳到搜索 跳到主要内容

DCP–NAS: Discrepant Child–Parent Neural Architecture Search for 1-bit CNNs

  • Beihang University
  • Zhongguancun Laboratory
  • Nanchang Institute of Technology
  • Universal Ubiquitous Co.

科研成果: 期刊稿件文章同行评审

摘要

Neural architecture search (NAS) proves to be among the effective approaches for many tasks by generating an application-adaptive neural architecture, which is still challenged by high computational cost and memory consumption. At the same time, 1-bit convolutional neural networks (CNNs) with binary weights and activations show their potential for resource-limited embedded devices. One natural approach is to use 1-bit CNNs to reduce the computation and memory cost of NAS by taking advantage of the strengths of each in a unified framework, while searching the 1-bit CNNs is more challenging due to the more complicated processes involved. In this paper, we introduce Discrepant Child–Parent Neural Architecture Search (DCP–NAS) to efficiently search 1-bit CNNs, based on a new framework of searching the 1-bit model (Child) under the supervision of a real-valued model (Parent). Particularly, we first utilize a Parent model to calculate a tangent direction, based on which the tangent propagation method is introduced to search the optimized 1-bit Child. We further observe a coupling relationship between the weights and architecture parameters existing in such differentiable frameworks. To address the issue, we propose a decoupled optimization method to search an optimized architecture. Extensive experiments demonstrate that our DCP–NAS achieves much better results than prior arts on both CIFAR-10 and ImageNet datasets. In particular, the backbones achieved by our DCP–NAS achieve strong generalization performance on person re-identification and object detection.

源语言英语
页(从-至)2793-2815
页数23
期刊International Journal of Computer Vision
131
11
DOI
出版状态已出版 - 11月 2023

指纹

探究 'DCP–NAS: Discrepant Child–Parent Neural Architecture Search for 1-bit CNNs' 的科研主题。它们共同构成独一无二的指纹。

引用此