Skip to main navigation Skip to search Skip to main content

基于隐空间扩散模型的差分隐私数据合成方法研究

Translated title of the contribution: Differential Privacy Data Synthesis Method Based on Latent Diffusion Model
  • Yinchi Ge
  • , Hui Zhang*
  • , Haohang Sun
  • *Corresponding author for this work
  • Beihang University

Research output: Contribution to journalArticlepeer-review

Abstract

The widespread application of data sharing and publication in the socio-economic domain drives scientific progress and societal development. However, issues related to copyright and privacy, especially concerning personal data, remain critical challenges. Differential privacy data synthesis has emerged as an effective means of protecting data privacy, where data holders can release synthetic data instead of real data, thereby enhancing data utility and availability while preserving privacy. In response to the limited usability of existing differential privacy generation models, this paper proposes a two-stage differential privacy generation model based on the latent space diffusion approach. Firstly, the differential privacy-aware information compression is performed on the original image, and it is projected from the pixel space to the latent space to obtain the desensitized latent vector representation of the original sensitive data. The latent vector is then fed into a diffusion model to gradually transform into a prior distribution and sampled through a denoising process. Experimental results based on the MNIST and Fashion MNIST datasets demonstrate that the proposed model exhibits significant improvements in terms of Frechet inception distance(FID) and downstream task accuracy compared to state-of-the-art models like DP-Sinkhorn.

Translated title of the contributionDifferential Privacy Data Synthesis Method Based on Latent Diffusion Model
Original languageChinese (Traditional)
Pages (from-to)30-38
Number of pages9
JournalComputer Science
Volume51
Issue number3
DOIs
StatePublished - 15 Mar 2024

Fingerprint

Dive into the research topics of 'Differential Privacy Data Synthesis Method Based on Latent Diffusion Model'. Together they form a unique fingerprint.

Cite this