SimFlowSR IconSimFlowSR: Self-similarity Aggregation over Consistent Information Flow for Single Image Super-Resolution

1National Yang Ming Chiao Tung University, 2National Cheng Kung University
SimFlowSR Teaser

TL;DR: SimFlowSR integrates CEB (consistent flow) with GAB (self-similarity aggregation) for efficient high-fidelity super-resolution.

Abstract

Recent SISR methods suffer from unstable feature propagation and large model sizes. While consistent information flow stabilizes activation dynamics, current approaches struggle to recover high-frequency details crucial for perceptual quality.

We propose SimFlowSR, comprising two complementary branches: CEB stabilizes dynamics through dense-residual connections, while GAB recovers fine structures via parameter-free geometric transformations (dihedral group D₄). This achieves globally consistent yet detail-rich reconstruction with minimal overhead.

When integrated into SwinIR and MambaIR, SimFlowSR improves quality with up to 37% fewer parameters, demonstrating superior performance across standard benchmarks.

Motivation

Motivation

Despite DRCT's competitive performance through consistent information flow, it exhibits limited high-frequency capture. Training dynamics on DF2K show DRCT achieves stable entropy but suboptimal frequency ratio—revealing the gap: consistent flow ensures stability, but lacks mechanisms for high-frequency recovery.

SimFlowSR achieves both highest frequency ratio and stable entropy, validating that CEB + GAB enables stable yet detail-rich reconstruction under low complexity.

Geometric Self-Similarity

Self-similarity

Natural images exhibit geometric self-similarity—structural patterns recur under rotation, flipping, scaling. This fractal property manifests across diverse scenes (brick walls, leaves, snowflakes), enabling high-frequency recovery without heavy computation.

GAB

GAB explicitly models self-similarity via parameter-free transformations. Unlike methods searching patches without geometric modeling (a), GAB applies dihedral group D₄ transformations (identity, 90°/180°/270° rotations, flips) to construct candidates (b). Initially dissimilar patches align after transformation, revealing latent patterns. Top candidates are selected (cosine similarity), inverse-aligned (T⁻¹), and aggregated (adaptive attention)—no learnable parameters required.

Network Architecture

SimFlowSR employs a modular dual-branch architecture seamlessly integrated into backbones (SwinIR, MambaIR, RWKVIR). CEB maintains consistent flow via dense-residual connections; GAB aggregates self-similar features via D₄ transformations. This plug-and-play design enhances quality across architectures while reducing complexity by up to 37%.

Visualization

Activation Dynamics

SimFlowSR achieves the most stable feature propagation. Conventional methods (SwinIR-RSTB, HAT-RHAG) show dramatic activation fluctuations across depths, indicating unstable information flow. DRCT (RDG) improves stability through dense connections, but SimFlowSR further compresses the dynamic range with tighter activation clustering—validating that CEB + GAB maintains stable representations while preserving fine details.


Spatial Aggregation

LAM showing superior spatial aggregation with higher Diffusion Index.

ERF demonstrating substantially broader spatial coverage.

SimFlowSR captures long-range correspondences effectively. LAM visualization shows significantly higher Diffusion Index, while ERF demonstrates broader spatial coverage across all backbones—confirming GAB's multi-scale D₄ transformations consistently enhance spatial modeling capability.


BibTeX

@article{lee2025simflowsr,
  title={SimFlowSR: Self-similarity Aggregation over Consistent Information Flow for Single Image Super-Resolution},
  author={Lee, Chia-Ming and Hsu, Chih-Chung},
  journal={arXiv preprint arXiv:XXXX.XXXXX},
  year={2025}
}