arXiv:2308.13680 Abstract | arXiv Analytics

arXiv:2308.13680 [cs.CV]Abstract References Reviews Resources

ACC-UNet: A Completely Convolutional UNet model for the 2020s

Published 2023-08-25Version 1

This decade is marked by the introduction of Vision Transformer, a radical paradigm shift in broad computer vision. A similar trend is followed in medical imaging, UNet, one of the most influential architectures, has been redesigned with transformers. Recently, the efficacy of convolutional models in vision is being reinvestigated by seminal works such as ConvNext, which elevates a ResNet to Swin Transformer level. Deriving inspiration from this, we aim to improve a purely convolutional UNet model so that it can be on par with the transformer-based models, e.g, Swin-Unet or UCTransNet. We examined several advantages of the transformer-based UNet models, primarily long-range dependencies and cross-level skip connections. We attempted to emulate them through convolution operations and thus propose, ACC-UNet, a completely convolutional UNet model that brings the best of both worlds, the inherent inductive biases of convnets with the design decisions of transformers. ACC-UNet was evaluated on 5 different medical image segmentation benchmarks and consistently outperformed convnets, transformers, and their hybrids. Notably, ACC-UNet outperforms state-of-the-art models Swin-Unet and UCTransNet by $2.64 \pm 2.54\%$ and $0.45 \pm 1.61\%$ in terms of dice score, respectively, while using a fraction of their parameters ($59.26\%$ and $24.24\%$). Our codes are available at https://github.com/kiharalab/ACC-UNet.

Categories: cs.CV

Keywords: acc-unet outperforms state-of-the-art models swin-unet, medical image segmentation benchmarks, swin transformer level, purely convolutional unet model, cross-level skip connections

Related articles: Most relevant | Search more

arXiv:2107.08623 [cs.CV] (Published 2021-07-19)

LeViT-UNet: Make Faster Encoders with Transformer for Medical Image Segmentation

Guoping Xu, Xingrong Wu, Xuan Zhang, Xinwei He

arXiv:2111.10989 [cs.CV] (Published 2021-11-22, updated 2023-07-30)

Exploring Feature Representation Learning for Semi-supervised Medical Image Segmentation

Huimin Wu, Xiaomeng Li, Kwang-Ting Cheng

arXiv:2009.07501 [cs.CV] (Published 2020-09-16)

UXNet: Searching Multi-level Feature Aggregation for 3D Medical Image Segmentation

Yuanfeng Ji, Ruimao Zhang, Zhen Li, Jiamin Ren, Shaoting Zhang, Ping Luo