跳到主要导航 跳到搜索 跳到主要内容

Selecting text classification model through maximizing posterior evidence over informative sub-space

  • Zhiwei Sun
  • , Jun Bai*
  • , Zhuofan Chen
  • , Chen Li
  • , Wenge Rong
  • , Zhang Xiong
  • *此作品的通讯作者
  • Beihang University
  • Beijing Institute for General Artificial Intelligence (BIGAI)

科研成果: 期刊稿件文章同行评审

摘要

Text classification is a pivotal task in natural language understanding, and its performance has seen remarkable advancements with the rise of Pre-trained Language Models (PLMs). Recently, the proliferation of PLMs has made it increasingly challenging to choose the most suitable model for a given dataset. Since fine-tuning the sheer number of models is impractical, Transferability Estimation (TE) has become a promising solution to efficient model selection. Unlike current TE methods that focus solely on fixed and hard class assignments to evaluate the quality of model-encoded features, our approach further takes into account the inter-sample and inter-model variations represented by soft class assignments. We achieve this by utilizing class embeddings to predict posterior class assignments, with the logarithm of the maximum posterior evidence serving as the transferability score. Moreover, we found that the informative sub-space of the dataset can lead to more accurate calculation of soft class assignments, where we achieve efficient annotation of informative samples by eliciting the powerful judging ability of large language model. The resulting posterior evidence over the informative sub-space, LogIPE, enables us to capture subtle differences between models, enhancing the accuracy of model selection and validated by extensive experiments conducted on a wide range of text classification datasets as well as candidate PLMs.

源语言英语
文章编号1912377
期刊Frontiers of Computer Science
19
12
DOI
出版状态已出版 - 12月 2025

指纹

探究 'Selecting text classification model through maximizing posterior evidence over informative sub-space' 的科研主题。它们共同构成独一无二的指纹。

引用此