ProSMA-UNet: Decoder Conditioning for Proximal-Sparse Skip Feature Selection

Chun-Wun Cheng1
Yanqi Cheng1
Peiyuan Jing2
Guang Yang2
Javier Montoya3
Carola-Bibiane Schönlieb1
Angelica I. Aviles-Rivero*4

1Department of Applied Mathematics and Theoretical Physics, University of Cambridge, UK
2Imperial College London, UK
3Zurich University of Applied Sciences (ZHAW), Switzerland
4Yau Mathematical Sciences Center, Tsinghua University, China
*Corresponding Author


affiliations

[Paper]
This is a website made for ProSMA-UNet: Decoder Conditioning for Proximal-Sparse Skip Feature Selection.

Abstract

Medical image segmentation commonly relies on U-shaped encoder-decoder architectures such as U-Net, where skip connections preserve fine spatial detail by injecting high-resolution encoder features into the decoder. However, these skip pathways also propagate low-level textures, background clutter, and acquisition noise, allowing irrelevant information to bypass deeper semantic filtering-an issue that is particularly detrimental in low-contrast clinical imaging. Although attention gates have been introduced to address this limitation, they typically produce dense sigmoid masks that softly reweight features rather than explicitly removing irrelevant activations. We propose ProSMA-UNet (Proximal-Sparse Multi-Scale Attention U-Net), which reformulates skip gating as a decoder-conditioned sparse feature selection problem. ProSMA constructs a multi-scale compatibility field using lightweight depthwise dilated convolutions to capture relevance across local and contextual scales, then enforces explicit sparsity via an 𝓁1 proximal operator with learnable per-channel thresholds, yielding a closed-form soft-thresholding gate that can remove noisy responses. To further suppress semantically irrelevant channels, ProSMA incorporates decoder-conditioned channel gating driven by global decoder context. Extensive experiments on challenging 2D and 3D benchmarks demonstrate state-of-the-art performance, with particularly large gains (≈20%) on difficult 3D segmentation tasks.


Experiments



Paper and Supplementary Material


Chun-Wun Cheng, Yanqi Cheng, Peiyuan Jing, Guang Yang, Javier Montoya, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero.
ProSMA-UNet: Decoder Conditioning for Proximal-Sparse Skip Feature Selection

[Bibtex]


Acknowledgements

CWC is supported by the Swiss National Science Foundation (SNSF) under grant number 20HW-1 220785. YC is funded by an AstraZeneca studentship and a Google studentship. CBS acknowledges support from the Philip Leverhulme Prize, the Royal Society Wolfson Fellowship, the EPSRC advanced career fellowship EP/V029428/1, EPSRC grants EP/S026045/1 and EP/T003553/1, EP/N014588/1, EP/T017961/1, the Wellcome Innovator Awards 215733/Z/19/Z and 221633/Z/20/Z, the European Union Horizon 2020 research and innovation programme under the Marie Skodowska-Curie grant agreement No. 777826 NoMADS, the Cantab Capital Institute for the Mathematics of Information and the Alan Turing Institute. AIAR gratefully acknowledges the support from Yau Mathematical Sciences Center, Tsinghua University.