Dictionary Learning Based Two-stage Near-lossless Video Compression

  • Zuhai Zhang
  • , Luheng Jia*
  • , Li Song
  • , Shuyuan Zhu
  • , Yuanfang Guo
  • , Kebin Jia
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Traditional hybrid video coding framework using block based predictive coding and transform coding, such as the High Efficiency Video Coding (HEVC), cannot further dig out the redundancy remained in quantized transformed residual, causing extra bits consumption. Measured by rate-distortion (RD) performance, the problem of higher bits consuming can be solved reversely by video quality enhancing. In this work, we proposed a video coding scheme that solve the problem by enhancing the reconstructed video quality using supplementary information from further compressed quantization error. Aiming at better R-D performance for near-lossless video coding, we propose a novel video coding scheme using a two-stage framework that extracts quantization error as complementary information which is compressed using dictionary learning and sparse representation. The employed over-complete dictionary is learned through K-SVD with orthogonal matching pursuit (OMP) for sparse representation. Statistically reduandancy is further removed by a modified context-adaptive binary arithmetic coding (CABAC) with adaptive context models. This approach not only retains the advantages of the traditionally encoder for lossy compression but also exploits the redundancy in the quantization error to achieve high-quality near-lossless compression. Experimental results demonstrate that our method significantly outperforms traditional HEVC lossy encoder with over -20% BD-BR on average at high bitrate range for near-lossless coding, while the method is also proved to be efficient at low bitrate range achieving over -50% BD-BR on average with average PSNR over 41dB, which retains near-lossless performance.

Original languageEnglish
Title of host publicationAPSIPA ASC 2024 - Asia Pacific Signal and Information Processing Association Annual Summit and Conference 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350367331
DOIs
StatePublished - 2024
Event2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2024 - Macau, China
Duration: 3 Dec 20246 Dec 2024

Publication series

NameAPSIPA ASC 2024 - Asia Pacific Signal and Information Processing Association Annual Summit and Conference 2024

Conference

Conference2024 Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2024
Country/TerritoryChina
CityMacau
Period3/12/246/12/24

Fingerprint

Dive into the research topics of 'Dictionary Learning Based Two-stage Near-lossless Video Compression'. Together they form a unique fingerprint.

Cite this