Skip to main navigation Skip to search Skip to main content

Text-driven Physically Interpretable Face Editing

  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper proposes a novel and physically interpretable method for face editing with arbitrary text prompts. Different from previous GAN-inversion editing methods that manipulate its latent space or diffusion methods conduct manipulation as a reverse process, we regard the face editing process as imposing vector flow fields on face images, representing the offset of spatial coordinates and color for each pixel. Under this paradigm, we represent the vector flow field in two ways: 1) explicitly represent the flow vectors with rasterized tensors, and 2) implicitly parameterize the flow vectors as continuous, smooth, and resolution-agnostic neural fields. The flow vectors are iteratively optimized under the guidance of the pre-trained CLIP model by maximizing the correlation between the edited image and the text prompt. We also propose a learning-based one-shot face editing framework, which is fast and adaptable to any text prompt input. Compared with SOTA text-driven face editing methods, our method can generate physically interpretable face editing results with high identity consistency and image quality.

Original languageEnglish
Title of host publicationIEEE International Conference on Multimedia and Expo Workshops
Subtitle of host publicationJourney to the Center of Machine Imagination, ICMEW 2025 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331587437
DOIs
StatePublished - 2025
Event2025 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2025 - Nantes, France
Duration: 30 Jun 20254 Jul 2025

Publication series

NameIEEE International Conference on Multimedia and Expo Workshops: Journey to the Center of Machine Imagination, ICMEW 2025 - Proceedings

Conference

Conference2025 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2025
Country/TerritoryFrance
CityNantes
Period30/06/254/07/25

Keywords

  • face editing
  • physically interpretable
  • text-driven

Fingerprint

Dive into the research topics of 'Text-driven Physically Interpretable Face Editing'. Together they form a unique fingerprint.

Cite this