跳到主要导航 跳到搜索 跳到主要内容

基于隐空间扩散模型的差分隐私数据合成方法研究

  • Yinchi Ge
  • , Hui Zhang*
  • , Haohang Sun
  • *此作品的通讯作者
  • Beihang University

科研成果: 期刊稿件文章同行评审

摘要

The widespread application of data sharing and publication in the socio-economic domain drives scientific progress and societal development. However, issues related to copyright and privacy, especially concerning personal data, remain critical challenges. Differential privacy data synthesis has emerged as an effective means of protecting data privacy, where data holders can release synthetic data instead of real data, thereby enhancing data utility and availability while preserving privacy. In response to the limited usability of existing differential privacy generation models, this paper proposes a two-stage differential privacy generation model based on the latent space diffusion approach. Firstly, the differential privacy-aware information compression is performed on the original image, and it is projected from the pixel space to the latent space to obtain the desensitized latent vector representation of the original sensitive data. The latent vector is then fed into a diffusion model to gradually transform into a prior distribution and sampled through a denoising process. Experimental results based on the MNIST and Fashion MNIST datasets demonstrate that the proposed model exhibits significant improvements in terms of Frechet inception distance(FID) and downstream task accuracy compared to state-of-the-art models like DP-Sinkhorn.

投稿的翻译标题Differential Privacy Data Synthesis Method Based on Latent Diffusion Model
源语言繁体中文
页(从-至)30-38
页数9
期刊Computer Science
51
3
DOI
出版状态已出版 - 15 3月 2024

关键词

  • Autoencoder
  • Data synthesis
  • Differential privacy
  • Diffusion models
  • Generative models

指纹

探究 '基于隐空间扩散模型的差分隐私数据合成方法研究' 的科研主题。它们共同构成独一无二的指纹。

引用此