Skip to main navigation Skip to search Skip to main content

Fine-Grained Controllable Generation of Latent Language Diffusion Models

  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Controllable generation is a crucial technique to provide generated content that meet specific demands in practical applications. An emerging trend for this is utilizing latent variables to represent those demands and guide the generation process, such as with diffusion models. While diffusion models have made significant strides in the realm of image synthesis, applications of diffusion models in text generation remain limited. Specifically, current control signals applied to diffusion language models, such as fixed class labels and abstract syntax trees, are inadequate to capture arbitrary user intents or too abstract for most users to understand and manipulate, making it challenging to meet varied user needs. To tackle this challenge, we propose methods to automatically extract and evaluate fine-grained information in forms of keywords from raw text; on that basis, we encode those keywords into latent variables, and steer latent language models to generate text conditioned on those keywords. To ensure better generalization ability under diverse keywords, we regularize the keyword latent variables through von Mises-Fisher kernel and Principal Component Analysis. Through experiments we demonstrate that fine-grained information in the form of keywords can effectively enhance the generation quality of corresponding text, as well as the effectiveness of our keywords evaluation and regularization methods.

Original languageEnglish
Title of host publicationNeural Information Processing - 31st International Conference, ICONIP 2024, Proceedings
EditorsMufti Mahmud, Maryam Doborjeh, Kevin Wong, Andrew Chi Sing Leung, Zohreh Doborjeh, M. Tanveer
PublisherSpringer Science and Business Media Deutschland GmbH
Pages254-267
Number of pages14
ISBN (Print)9789819670291
DOIs
StatePublished - 2025
Event31st International Conference on Neural Information Processing, ICONIP 2024 - Auckland, New Zealand
Duration: 2 Dec 20246 Dec 2024

Publication series

NameCommunications in Computer and Information Science
Volume2295 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference31st International Conference on Neural Information Processing, ICONIP 2024
Country/TerritoryNew Zealand
CityAuckland
Period2/12/246/12/24

Keywords

  • Controllable generation
  • Diffusion models
  • Stable diffusion

Fingerprint

Dive into the research topics of 'Fine-Grained Controllable Generation of Latent Language Diffusion Models'. Together they form a unique fingerprint.

Cite this