TY - JOUR
T1 - MLPFormer
T2 - MLP-integrated transformer for colorectal histopathology whole slide image segmentation
AU - Wang, Yuxuan
AU - Li, Dan
AU - Li, Xuechen
AU - Guo, Yan
AU - Zuo, Yanfei
AU - Shen, Linlin
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
PY - 2025/2
Y1 - 2025/2
N2 - Colorectal intraepithelial neoplasia is a precancerous lesion of colorectal cancer, which is mainly diagnosed using pathological images. According to the characteristics of lesions, precancerous lesions can be classified into four different grades, i.e., background, normal, low and high level. However, due to the continuity of lesion development, tissue in different stages has high similarity. Recently, visual transformers have achieved impressive results in different visual tasks, due to their capacity of capturing long-range dependencies. However, due to computational cost, transformer cannot well extract detailed feature, which is important for lesion grading. To accurately segment pathological tissues with high similarity, in this work, we embed the multi-head MLP mixer module in the transformer network to extract cell and edge information. Furthermore, we propose a hierarchical MLP decoder to facilitate multi-scale feature fusion. The proposed transformer, namely MLPFormer, achieves remarkable results against competitive baselines on the Histo-CRC Biopsy dataset. The experimental results demonstrate that MLPFormer significantly outperforms the competitive baselines, i.e., a 3% dice improvement is achieved over the SegFormer series.
AB - Colorectal intraepithelial neoplasia is a precancerous lesion of colorectal cancer, which is mainly diagnosed using pathological images. According to the characteristics of lesions, precancerous lesions can be classified into four different grades, i.e., background, normal, low and high level. However, due to the continuity of lesion development, tissue in different stages has high similarity. Recently, visual transformers have achieved impressive results in different visual tasks, due to their capacity of capturing long-range dependencies. However, due to computational cost, transformer cannot well extract detailed feature, which is important for lesion grading. To accurately segment pathological tissues with high similarity, in this work, we embed the multi-head MLP mixer module in the transformer network to extract cell and edge information. Furthermore, we propose a hierarchical MLP decoder to facilitate multi-scale feature fusion. The proposed transformer, namely MLPFormer, achieves remarkable results against competitive baselines on the Histo-CRC Biopsy dataset. The experimental results demonstrate that MLPFormer significantly outperforms the competitive baselines, i.e., a 3% dice improvement is achieved over the SegFormer series.
KW - Colorectal cancer
KW - MLP
KW - Pathological
KW - Segmentation
KW - Transformer
KW - Whole slide image
UR - http://www.scopus.com/inward/record.url?scp=85212870733&partnerID=8YFLogxK
U2 - 10.1007/s00521-024-10884-x
DO - 10.1007/s00521-024-10884-x
M3 - Article
AN - SCOPUS:85212870733
SN - 0941-0643
VL - 37
SP - 4651
EP - 4661
JO - Neural Computing and Applications
JF - Neural Computing and Applications
IS - 6
ER -