TY - JOUR
T1 - CFFormer
T2 - Cross CNN-Transformer channel attention and spatial feature fusion for improved segmentation of heterogeneous medical images
AU - Li, Jiaxuan
AU - Xu, Qing
AU - He, Xiangjian
AU - Liu, Ziyu
AU - Zhang, Daokun
AU - Wang, Ruili
AU - Qu, Rong
AU - Qiu, Guoping
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2026/1/1
Y1 - 2026/1/1
N2 - Medical image segmentation plays an important role in computer-aided diagnosis. Existing methods mainly utilize spatial attention to highlight the region of interest. However, due to limitations of medical imaging devices, medical images exhibit significant heterogeneity, posing challenges for segmentation. Ultrasound images, for instance, often suffer from speckle noise, low resolution, and poor contrast between target tissues and background, which may lead to inaccurate boundary delineation. To address these challenges caused by heterogeneous image quality, we propose a hybrid CNN-Transformer model, called CFFormer, which leverages effective channel feature extraction to enhance the model's ability to accurately identify tissue regions by capturing rich contextual information. The proposed architecture contains two key components: the Cross Feature Channel Attention (CFCA) module and the X-Spatial Feature Fusion (XFF) module. The model incorporates dual encoders, with the CNN encoder focusing on capturing local features and the Transformer encoder modeling global features. The CFCA module filters and facilitates interactions between the channel features from the two encoders, while the XFF module effectively reduces the significant semantic information differences in spatial features, enabling a smooth and cohesive spatial feature fusion. We evaluate our model across eight datasets covering five modalities to test its generalization capability. Experimental results demonstrate that our model outperforms current state-of-the-art methods and maintains accurate tissue region segmentation across heterogeneous medical image datasets. The code is available at https://github.com/JiaxuanFelix/CFFormer.
AB - Medical image segmentation plays an important role in computer-aided diagnosis. Existing methods mainly utilize spatial attention to highlight the region of interest. However, due to limitations of medical imaging devices, medical images exhibit significant heterogeneity, posing challenges for segmentation. Ultrasound images, for instance, often suffer from speckle noise, low resolution, and poor contrast between target tissues and background, which may lead to inaccurate boundary delineation. To address these challenges caused by heterogeneous image quality, we propose a hybrid CNN-Transformer model, called CFFormer, which leverages effective channel feature extraction to enhance the model's ability to accurately identify tissue regions by capturing rich contextual information. The proposed architecture contains two key components: the Cross Feature Channel Attention (CFCA) module and the X-Spatial Feature Fusion (XFF) module. The model incorporates dual encoders, with the CNN encoder focusing on capturing local features and the Transformer encoder modeling global features. The CFCA module filters and facilitates interactions between the channel features from the two encoders, while the XFF module effectively reduces the significant semantic information differences in spatial features, enabling a smooth and cohesive spatial feature fusion. We evaluate our model across eight datasets covering five modalities to test its generalization capability. Experimental results demonstrate that our model outperforms current state-of-the-art methods and maintains accurate tissue region segmentation across heterogeneous medical image datasets. The code is available at https://github.com/JiaxuanFelix/CFFormer.
KW - Deep learning
KW - Hybrid CNN-transformer model
KW - Image segmentation
KW - Medical image segmentation
UR - https://www.scopus.com/pages/publications/105009724225
U2 - 10.1016/j.eswa.2025.128835
DO - 10.1016/j.eswa.2025.128835
M3 - Article
AN - SCOPUS:105009724225
SN - 0957-4174
VL - 295
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 128835
ER -