EfficientUNetViT: Efficient Breast Tumor Segmentation Utilizing UNet Architecture and Pretrained Vision Transformer

  • Shokofeh Anari
  • , Gabriel Gomes de Oliveira
  • , Ramin Ranjbarzadeh
  • , Angela Maria Alves
  • , Gabriel Caumo Vaz
  • , Malika Bendechache

Research output: Contribution to a Journal (Peer & Non Peer)Articlepeer-review

22 Citations (Scopus)

Abstract

This study introduces a sophisticated neural network structure for segmenting breast tumors. It achieves this by combining a pretrained Vision Transformer (ViT) model with a UNet framework. The UNet architecture, commonly employed for biomedical image segmentation, is further enhanced with depthwise separable convolutional blocks to decrease computational complexity and parameter count, resulting in better efficiency and less overfitting. The ViT, renowned for its robust feature extraction capabilities utilizing self-attention processes, efficiently captures the overall context within images, surpassing the performance of conventional convolutional networks. By using a pretrained ViT as the encoder in our UNet model, we take advantage of its extensive feature representations acquired from extensive datasets, resulting in a major enhancement in the model’s ability to generalize and train efficiently. The suggested model has exceptional performance in segmenting breast cancers from medical images, highlighting the advantages of integrating transformer-based encoders with efficient UNet topologies. This hybrid methodology emphasizes the capabilities of transformers in the field of medical image processing and establishes a new standard for accuracy and efficiency in activities related to tumor segmentation.

Original languageEnglish
Article number945
JournalBioengineering
Volume11
Issue number9
DOIs
Publication statusPublished - Sep 2024

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • breast cancer
  • depthwise separable convolutional
  • UNet
  • vision transformer

Fingerprint

Dive into the research topics of 'EfficientUNetViT: Efficient Breast Tumor Segmentation Utilizing UNet Architecture and Pretrained Vision Transformer'. Together they form a unique fingerprint.

Cite this