Skip to main navigation Skip to search Skip to main content

Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment

  • Jianfei Zhang
  • , Jun Bai
  • , Bei Li
  • , Yanmeng Wang
  • , Rumei Li
  • , Chenghua Lin
  • , Wenge Rong
  • Beihang University
  • Beijing Institute for GAI (BIGAI)
  • Meituan
  • Ping An Technology
  • University of Manchester

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Aligning Large Language Models (LLMs) with general human preferences has been proved crucial in improving the interaction quality between LLMs and human. However, human values are inherently diverse among different individuals, making it insufficient to align LLMs solely with general preferences. To address this, personalizing LLMs according to individual feedback emerges as a promising solution. Nonetheless, this approach presents challenges in terms of the efficiency of alignment algorithms. In this work, we introduce a flexible paradigm for individual preference alignment. Our method fundamentally improves efficiency by disentangling preference representation from text generation in LLMs. We validate our approach across multiple text generation tasks and demonstrate that it can produce aligned quality as well as or better than PEFT-based methods, while reducing additional training time for each new individual preference by 80% to 90% in comparison with them.

Original languageEnglish
Title of host publicationMain Conference
EditorsOwen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
PublisherAssociation for Computational Linguistics (ACL)
Pages4813-4839
Number of pages27
ISBN (Electronic)9798891761964
StatePublished - 2025
Event31st International Conference on Computational Linguistics, COLING 2025 - Abu Dhabi, United Arab Emirates
Duration: 19 Jan 202524 Jan 2025

Publication series

NameProceedings - International Conference on Computational Linguistics, COLING
ISSN (Print)2951-2093

Conference

Conference31st International Conference on Computational Linguistics, COLING 2025
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period19/01/2524/01/25

Fingerprint

Dive into the research topics of 'Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment'. Together they form a unique fingerprint.

Cite this