跳到主要导航 跳到搜索 跳到主要内容

Learning Continuous Spatiotemporal Implicit Neural Fields for Unsupervised Video Denoising

  • Xiaowan Hu
  • , Henan Liu
  • , Ce Zheng
  • , Xinyang Li
  • , Mai Xu*
  • *此作品的通讯作者
  • Beihang University
  • Tsinghua University

科研成果: 期刊稿件文章同行评审

摘要

Video denoising is fundamental to low-level vision and real-world imaging, yet existing self-supervised methods remain fragile under severe noise and complex motion. Most approaches still rely on spatially and temporally discrete grid-based representations: blind-spot networks enforce J-invariance by masking center pixels with a limited receptive field, while recurrent models build temporal dependencies on discretized frame sequences and noise-sensitive optical flow, leading to error accumulation and motion artifacts. We address this model bottleneck by reformulating self-supervised video denoising as learning a continuous spatiotemporal implicit field. Building on coordinate-based implicit neural representations, we propose a unified video denoising model with a spatiotemporal implicit neural field (SINF). In the spatial domain, blind-spot implicit spatial field maps coordinates directly to pixel-level representations, enabling globally informed texture recovery beyond receptive-field limits. In the temporal domain, an implicit temporal embedding with periodic activations encodes motion continuously over time, while a time-aware spatial graph module refines cross-frame alignment. Together, SINF remodels discretized video signals into a continuous spatiotemporal intensity field, enabling more robust pixel-wise associations than coarse optical flow. Extensive experiments on synthetic and real noisy video benchmarks demonstrate that our SINF achieves state-of-the-art performance on synthetic and real noisy video benchmarks.

指纹

探究 'Learning Continuous Spatiotemporal Implicit Neural Fields for Unsupervised Video Denoising' 的科研主题。它们共同构成独一无二的指纹。

引用此