TY - JOUR
T1 - Burst image super-resolution via multi-cross attention encoding and multi-scan state-space decoding
AU - Huang, Tengda
AU - Zhang, Yu
AU - Li, Tianren
AU - Qu, Yufu
AU - Liu, Fulin
AU - Wei, Zhenzhong
N1 - Publisher Copyright:
© 2025 Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
PY - 2025/11
Y1 - 2025/11
N2 - Multi-image super-resolution (MISR) can achieve higher image quality than single-image super-resolution (SISR) by aggregating sub-pixel information from multiple spatially shifted frames. Among MISR tasks, burst super-resolution (BurstSR) has gained significant attention due to its wide range of applications. Most existing methods use fixed and narrow attention windows, limiting feature perception and hindering alignment and aggregation. To address these limitations, we propose a novel feature extractor that incorporates two newly designed attention mechanisms: overlapping cross-window attention and cross-frame attention, enabling more precise and efficient extraction of sub-pixel information across multiple frames. Furthermore, we introduce a Multi-scan State-Space Module with the cross-frame attention mechanism to enhance feature aggregation. Extensive experiments on both synthetic and real-world benchmarks demonstrate the superiority of our approach. Additional evaluations on ISO 12233 resolution test charts further confirm its enhanced super-resolution performance.
AB - Multi-image super-resolution (MISR) can achieve higher image quality than single-image super-resolution (SISR) by aggregating sub-pixel information from multiple spatially shifted frames. Among MISR tasks, burst super-resolution (BurstSR) has gained significant attention due to its wide range of applications. Most existing methods use fixed and narrow attention windows, limiting feature perception and hindering alignment and aggregation. To address these limitations, we propose a novel feature extractor that incorporates two newly designed attention mechanisms: overlapping cross-window attention and cross-frame attention, enabling more precise and efficient extraction of sub-pixel information across multiple frames. Furthermore, we introduce a Multi-scan State-Space Module with the cross-frame attention mechanism to enhance feature aggregation. Extensive experiments on both synthetic and real-world benchmarks demonstrate the superiority of our approach. Additional evaluations on ISO 12233 resolution test charts further confirm its enhanced super-resolution performance.
KW - Burst super-resolution
KW - Multi-cross attention
KW - Multi-image super-resolution
KW - State-space module
KW - Sub-pixel information extraction
UR - https://www.scopus.com/pages/publications/105019497255
U2 - 10.1016/j.imavis.2025.105773
DO - 10.1016/j.imavis.2025.105773
M3 - 文章
AN - SCOPUS:105019497255
SN - 0262-8856
VL - 163
JO - Image and Vision Computing
JF - Image and Vision Computing
M1 - 105773
ER -