ES-MPQ: Evolutionary Search Enabled Mixed Precision Quantization Framework for Computing-in-Memory

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Network quantization can effectively reduce the complexity without changing the network structures, which is conducive to deploying deep neural networks (DNN) on edge devices. However, most of the existing methods set the quantization precision manually and rarely consider the case that the computing array is limited, such as computing-in-memory (CIM). In this paper, we introduce a novel method named ES-MPQ, which employs evolutionary search to achieve mixed precision quantization with a small calibration dataset. The ES-MPQ can optimize multiple objectives to achieve better hardware efficiency. The experimental results for ResNet-18 on CIFAR-10 show that the proposed ES-MPQ can reduce the parameter size and energy consumption by up to 1.89x and 2.81x, respectively, compared with the fixed bit-width (8 bits) quantization, while losing only 0.59% accuracy.

Original languageEnglish
Title of host publicationProceedings - 2023 12th IEEE Non-Volatile Memory Systems and Applications Symposium, NVMSA 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages38-43
Number of pages6
ISBN (Electronic)9798350344967
DOIs
StatePublished - 2023
Event12th IEEE Non-Volatile Memory Systems and Applications Symposium, NVMSA 2023 - Niigata, Japan
Duration: 30 Aug 20231 Sep 2023

Publication series

NameProceedings - 2023 12th IEEE Non-Volatile Memory Systems and Applications Symposium, NVMSA 2023

Conference

Conference12th IEEE Non-Volatile Memory Systems and Applications Symposium, NVMSA 2023
Country/TerritoryJapan
CityNiigata
Period30/08/231/09/23

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Mixed precision quantization
  • computing-in-memory
  • evolutionary search

Fingerprint

Dive into the research topics of 'ES-MPQ: Evolutionary Search Enabled Mixed Precision Quantization Framework for Computing-in-Memory'. Together they form a unique fingerprint.

Cite this