跳到主要导航 跳到搜索 跳到主要内容

Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment

  • Jianfei Zhang
  • , Jun Bai
  • , Bei Li
  • , Yanmeng Wang
  • , Rumei Li
  • , Chenghua Lin
  • , Wenge Rong
  • Beihang University
  • Beijing Institute for GAI (BIGAI)
  • Meituan
  • Ping An Technology
  • University of Manchester

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Aligning Large Language Models (LLMs) with general human preferences has been proved crucial in improving the interaction quality between LLMs and human. However, human values are inherently diverse among different individuals, making it insufficient to align LLMs solely with general preferences. To address this, personalizing LLMs according to individual feedback emerges as a promising solution. Nonetheless, this approach presents challenges in terms of the efficiency of alignment algorithms. In this work, we introduce a flexible paradigm for individual preference alignment. Our method fundamentally improves efficiency by disentangling preference representation from text generation in LLMs. We validate our approach across multiple text generation tasks and demonstrate that it can produce aligned quality as well as or better than PEFT-based methods, while reducing additional training time for each new individual preference by 80% to 90% in comparison with them.

源语言英语
主期刊名Main Conference
编辑Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
出版商Association for Computational Linguistics (ACL)
4813-4839
页数27
ISBN(电子版)9798891761964
出版状态已出版 - 2025
活动31st International Conference on Computational Linguistics, COLING 2025 - Abu Dhabi, 阿拉伯联合酋长国
期限: 19 1月 202524 1月 2025

出版系列

姓名Proceedings - International Conference on Computational Linguistics, COLING
ISSN(印刷版)2951-2093

会议

会议31st International Conference on Computational Linguistics, COLING 2025
国家/地区阿拉伯联合酋长国
Abu Dhabi
时期19/01/2524/01/25

指纹

探究 'Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment' 的科研主题。它们共同构成独一无二的指纹。

引用此