This paper presents a novel attention-based video super-resolution (VSR) method that avoids costly optical flow estimation while effectively exploiting temporal correlations between frames. We propose an aligner module that utilizes cross-attention to blend relevant patches from adjacent frames, gathering information from multiple frames simultaneously. This method improves upon traditional flow-based approaches by working at a block level and enabling the blending of several pixels, yielding better alignment for larger motions. The proposed VSR technique can upscale videos up to 4x while simultaneously removing compression artifacts, enhancing both resolution and quality. Experimental results demonstrate the effectiveness of this approach compared to classic flow-based methods, particularly in handling compressed videos where compression artifacts can severely impact optical flow estimation.
Multi-Frame Alignment for Video Super-Resolution Using Attention / Di Rienzo, Marco; Bruni, Matteo; Galteri, Leonardo; Becattini, Federico; Bertini, Marco. - ELETTRONICO. - (2025), pp. 608-620. ( ICIAP) [10.1007/978-3-032-10185-3_48].
Multi-Frame Alignment for Video Super-Resolution Using Attention
Bruni, Matteo;Galteri, Leonardo;Becattini, Federico;Bertini, Marco
2025
Abstract
This paper presents a novel attention-based video super-resolution (VSR) method that avoids costly optical flow estimation while effectively exploiting temporal correlations between frames. We propose an aligner module that utilizes cross-attention to blend relevant patches from adjacent frames, gathering information from multiple frames simultaneously. This method improves upon traditional flow-based approaches by working at a block level and enabling the blending of several pixels, yielding better alignment for larger motions. The proposed VSR technique can upscale videos up to 4x while simultaneously removing compression artifacts, enhancing both resolution and quality. Experimental results demonstrate the effectiveness of this approach compared to classic flow-based methods, particularly in handling compressed videos where compression artifacts can severely impact optical flow estimation.| File | Dimensione | Formato | |
|---|---|---|---|
|
978-3-032-10185-3_48.pdf
accesso aperto
Tipologia:
Pdf editoriale (Version of record)
Licenza:
Open Access
Dimensione
1.62 MB
Formato
Adobe PDF
|
1.62 MB | Adobe PDF |
I documenti in FLORE sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



