跳到主要导航 跳到搜索 跳到主要内容

LMIE-BERT: A Learnable Method for Inter-Layer Ensembles to Accelerate Inference of BERT-Style Pre-trained Models

  • Weikai Qi*
  • , Xing Guo
  • , Haohua Du
  • *此作品的通讯作者
  • School of Computer Science and Technology, Anhui University
  • USTC-DEQING Alpha Innovation Institute

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Pre-trained models have brought tremendous accuracy improvements to Natural Language Processing(NLP) and Computer Vision tasks, but they suffer from slow inference speed due to the heavy model, which hinders their deployment in production. The early exit methods have been proposed to accelerate the inference speed of large pre-trained models. However, these methods will lose control of accuracy at higher speed ratios. In order to balance the trade-off between model speed and accuracy better, we propose a novel early-exit mechanism called LMIE-BERT. To achieve this, we introduce a learnable method for inter-layer ensemble strategy in the internal classifier, it trains the model to fit the information from both the previous and current layers, which enables the early exit method to get more robust results. The experimental results demonstrate that LMIE-BERT can maintain over 90% of the accuracy of the original model while achieving a 4× inference speed up in multiple tasks. Our method is ahead of other early exit methods in terms of model accuracy for the same speed ratio.

源语言英语
主期刊名Proceedings - 2023 9th International Conference on Big Data Computing and Communications, BigCom 2023
出版商Institute of Electrical and Electronics Engineers Inc.
271-277
页数7
ISBN(电子版)9798350331240
DOI
出版状态已出版 - 2023
活动9th International Conference on Big Data Computing and Communications, BigCom 2023 - Hainan, 中国
期限: 4 8月 20236 8月 2023

出版系列

姓名Proceedings - 2023 9th International Conference on Big Data Computing and Communications, BigCom 2023

会议

会议9th International Conference on Big Data Computing and Communications, BigCom 2023
国家/地区中国
Hainan
时期4/08/236/08/23

指纹

探究 'LMIE-BERT: A Learnable Method for Inter-Layer Ensembles to Accelerate Inference of BERT-Style Pre-trained Models' 的科研主题。它们共同构成独一无二的指纹。

引用此