ProSMA-UNet: Decoder Conditioning for Proximal-Sparse Skip Feature Selection

Chun-Wun Cheng¹

Yanqi Cheng¹

Peiyuan Jing²

Guang Yang²

Javier Montoya³

¹Department of Applied Mathematics and Theoretical Physics, University of Cambridge, UK
²Imperial College London, UK
³Zurich University of Applied Sciences (ZHAW), Switzerland
⁴Yau Mathematical Sciences Center, Tsinghua University, China
^*Corresponding Author

affiliations

[Paper]

This is a website made for ProSMA-UNet: Decoder Conditioning for Proximal-Sparse Skip Feature Selection.

Abstract

Medical image segmentation commonly relies on U-shaped encoder-decoder architectures such as U-Net, where skip connections preserve fine spatial detail by injecting high-resolution encoder features into the decoder. However, these skip pathways also propagate low-level textures, background clutter, and acquisition noise, allowing irrelevant information to bypass deeper semantic filtering-an issue that is particularly detrimental in low-contrast clinical imaging. Although attention gates have been introduced to address this limitation, they typically produce dense sigmoid masks that softly reweight features rather than explicitly removing irrelevant activations. We propose ProSMA-UNet (Proximal-Sparse Multi-Scale Attention U-Net), which reformulates skip gating as a decoder-conditioned sparse feature selection problem. ProSMA constructs a multi-scale compatibility field using lightweight depthwise dilated convolutions to capture relevance across local and contextual scales, then enforces explicit sparsity via an 𝓁₁ proximal operator with learnable per-channel thresholds, yielding a closed-form soft-thresholding gate that can remove noisy responses. To further suppress semantically irrelevant channels, ProSMA incorporates decoder-conditioned channel gating driven by global decoder context. Extensive experiments on challenging 2D and 3D benchmarks demonstrate state-of-the-art performance, with particularly large gains (≈20%) on difficult 3D segmentation tasks.

Experiments

Paper and Supplementary Material

Chun-Wun Cheng, Yanqi Cheng, Peiyuan Jing, Guang Yang, Javier Montoya, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero.
ProSMA-UNet: Decoder Conditioning for Proximal-Sparse Skip Feature Selection

[Bibtex]

Acknowledgements

CWC is supported by the Swiss National Science Foundation (SNSF) under grant number 20HW-1 220785. YC is funded by an AstraZeneca studentship and a Google studentship. CBS acknowledges support from the Philip Leverhulme Prize, the Royal Society Wolfson Fellowship, the EPSRC advanced career fellowship EP/V029428/1, EPSRC grants EP/S026045/1 and EP/T003553/1, EP/N014588/1, EP/T017961/1, the Wellcome Innovator Awards 215733/Z/19/Z and 221633/Z/20/Z, the European Union Horizon 2020 research and innovation programme under the Marie Skodowska-Curie grant agreement No. 777826 NoMADS, the Cantab Capital Institute for the Mathematics of Information and the Alan Turing Institute. AIAR gratefully acknowledges the support from Yau Mathematical Sciences Center, Tsinghua University.