跳到主要导航 跳到搜索 跳到主要内容

Towards Harmonized Uncertainty Estimation for Large Language Models

  • Rui Li
  • , Jing Long
  • , Muge Qi
  • , Heming Xia
  • , Lei Sha
  • , Peiyi Wang
  • , Zhifang Sui*
  • *此作品的通讯作者
  • Peking University
  • Hong Kong Polytechnic University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

To facilitate robust and trustworthy deployment of large language models (LLMs), it is essential to quantify the reliability of their generations through uncertainty estimation. While recent efforts have made significant advancements by leveraging the internal logic and linguistic features of LLMs to estimate uncertainty scores, our empirical analysis highlights the pitfalls of these methods to strike a harmonized estimation between indication, balance, and calibration, which hinders their broader capability for accurate uncertainty estimation. To address this challenge, we propose CUE (Corrector for Uncertainty Estimation): A straightforward yet effective method that employs a lightweight model trained on data aligned with the target LLM's performance to adjust uncertainty scores. Comprehensive experiments across diverse models and tasks demonstrate its effectiveness, which achieves consistent improvements of up to 60% over existing methods. Resources are available at https://github.com/O-L1RU1/Corrector4UE.

源语言英语
主期刊名Long Papers
编辑Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
出版商Association for Computational Linguistics (ACL)
22938-22953
页数16
ISBN(电子版)9798891762510
DOI
出版状态已出版 - 2025
活动63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025 - Vienna, 奥地利
期限: 27 7月 20251 8月 2025

出版系列

姓名Proceedings of the Annual Meeting of the Association for Computational Linguistics
1
ISSN(印刷版)0736-587X

会议

会议63rd Annual Meeting of the Association for Computational Linguistics, ACL 2025
国家/地区奥地利
Vienna
时期27/07/251/08/25

指纹

探究 'Towards Harmonized Uncertainty Estimation for Large Language Models' 的科研主题。它们共同构成独一无二的指纹。

引用此