TY - GEN
T1 - Fine-Grained Controllable Generation of Latent Language Diffusion Models
AU - Sun, Haozhe
AU - Zhang, Jianfei
AU - Li, Chen
AU - Ouyang, Yuanxin
AU - Rong, Wenge
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - Controllable generation is a crucial technique to provide generated content that meet specific demands in practical applications. An emerging trend for this is utilizing latent variables to represent those demands and guide the generation process, such as with diffusion models. While diffusion models have made significant strides in the realm of image synthesis, applications of diffusion models in text generation remain limited. Specifically, current control signals applied to diffusion language models, such as fixed class labels and abstract syntax trees, are inadequate to capture arbitrary user intents or too abstract for most users to understand and manipulate, making it challenging to meet varied user needs. To tackle this challenge, we propose methods to automatically extract and evaluate fine-grained information in forms of keywords from raw text; on that basis, we encode those keywords into latent variables, and steer latent language models to generate text conditioned on those keywords. To ensure better generalization ability under diverse keywords, we regularize the keyword latent variables through von Mises-Fisher kernel and Principal Component Analysis. Through experiments we demonstrate that fine-grained information in the form of keywords can effectively enhance the generation quality of corresponding text, as well as the effectiveness of our keywords evaluation and regularization methods.
AB - Controllable generation is a crucial technique to provide generated content that meet specific demands in practical applications. An emerging trend for this is utilizing latent variables to represent those demands and guide the generation process, such as with diffusion models. While diffusion models have made significant strides in the realm of image synthesis, applications of diffusion models in text generation remain limited. Specifically, current control signals applied to diffusion language models, such as fixed class labels and abstract syntax trees, are inadequate to capture arbitrary user intents or too abstract for most users to understand and manipulate, making it challenging to meet varied user needs. To tackle this challenge, we propose methods to automatically extract and evaluate fine-grained information in forms of keywords from raw text; on that basis, we encode those keywords into latent variables, and steer latent language models to generate text conditioned on those keywords. To ensure better generalization ability under diverse keywords, we regularize the keyword latent variables through von Mises-Fisher kernel and Principal Component Analysis. Through experiments we demonstrate that fine-grained information in the form of keywords can effectively enhance the generation quality of corresponding text, as well as the effectiveness of our keywords evaluation and regularization methods.
KW - Controllable generation
KW - Diffusion models
KW - Stable diffusion
UR - https://www.scopus.com/pages/publications/105010008810
U2 - 10.1007/978-981-96-7030-7_18
DO - 10.1007/978-981-96-7030-7_18
M3 - 会议稿件
AN - SCOPUS:105010008810
SN - 9789819670291
T3 - Communications in Computer and Information Science
SP - 254
EP - 267
BT - Neural Information Processing - 31st International Conference, ICONIP 2024, Proceedings
A2 - Mahmud, Mufti
A2 - Doborjeh, Maryam
A2 - Wong, Kevin
A2 - Leung, Andrew Chi Sing
A2 - Doborjeh, Zohreh
A2 - Tanveer, M.
PB - Springer Science and Business Media Deutschland GmbH
T2 - 31st International Conference on Neural Information Processing, ICONIP 2024
Y2 - 2 December 2024 through 6 December 2024
ER -