Multi-scale U-net with Edge Guidance for Multimodal Retinal Image Deformable Registration

Yuntong Tian; Yan Hu; Yuhui Ma; Huaying Hao; Lei Mou; Jianlong Yang; Yitian Zhao; Jiang Liu

doi:10.1109/EMBC44109.2020.9175613

Multi-scale U-net with Edge Guidance for Multimodal Retinal Image Deformable Registration

Yuntong Tian, Yan Hu, Yuhui Ma, Huaying Hao, Lei Mou, Jianlong Yang, Yitian Zhao, Jiang Liu

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

13 Citations (Scopus)

Abstract

Registration of multimodal retinal images is of great importance in facilitating the diagnosis and treatment of many eye diseases, such as the registration between color fundus images and optical coherence tomography (OCT) images. However, it is difficult to obtain ground truth, and most existing algorithms are for rigid registration without considering the optical distortion. In this paper, we present an unsupervised learning method for deformable registration between the two images. To solve the registration problem, the structure achieves a multi-level receptive field and takes contour and local detail into account. To measure the edge difference caused by different distortions in the optics center and edge, an edge similarity (ES) loss term is proposed, so loss function is composed by local cross-correlation, edge similarity and diffusion regularizer on the spatial gradients of the deformation matrix. Thus, we propose a multi-scale input layer, U-net with dilated convolution structure, squeeze excitation (SE) block and spatial transformer layers. Quantitative experiments prove the proposed framework is best compared with several conventional and deep learningbased methods, and our ES loss and structure combined with Unet and multi-scale layers achieve competitive results for normal and abnormal images.

Original language	English
Title of host publication	42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society
Subtitle of host publication	Enabling Innovative Technologies for Global Healthcare, EMBC 2020
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1360-1363
Number of pages	4
ISBN (Electronic)	9781728119908
DOIs	https://doi.org/10.1109/EMBC44109.2020.9175613
Publication status	Published - Jul 2020
Externally published	Yes
Event	42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society, EMBC 2020 - Montreal, Canada Duration: 20 Jul 2020 → 24 Jul 2020

Publication series

Name	Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
Volume	2020-July
ISSN (Print)	1557-170X

Conference

Conference	42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society, EMBC 2020
Country/Territory	Canada
City	Montreal
Period	20/07/20 → 24/07/20

Keywords

Color Fundus
Deep Learning
Deformable Registration
Multimodal Registration
Optical Coherence Tomography

ASJC Scopus subject areas

Signal Processing
Biomedical Engineering
Computer Vision and Pattern Recognition
Health Informatics

Access to Document

10.1109/EMBC44109.2020.9175613

Cite this

Tian, Y., Hu, Y., Ma, Y., Hao, H., Mou, L., Yang, J., Zhao, Y., & Liu, J. (2020). Multi-scale U-net with Edge Guidance for Multimodal Retinal Image Deformable Registration. In 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society: Enabling Innovative Technologies for Global Healthcare, EMBC 2020 (pp. 1360-1363). Article 9175613 (Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS; Vol. 2020-July). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/EMBC44109.2020.9175613

Tian, Yuntong ; Hu, Yan ; Ma, Yuhui et al. / Multi-scale U-net with Edge Guidance for Multimodal Retinal Image Deformable Registration. 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society: Enabling Innovative Technologies for Global Healthcare, EMBC 2020. Institute of Electrical and Electronics Engineers Inc., 2020. pp. 1360-1363 (Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS).

@inproceedings{8eab8d79731e4827be575ec1f7523d2d,

title = "Multi-scale U-net with Edge Guidance for Multimodal Retinal Image Deformable Registration",

abstract = "Registration of multimodal retinal images is of great importance in facilitating the diagnosis and treatment of many eye diseases, such as the registration between color fundus images and optical coherence tomography (OCT) images. However, it is difficult to obtain ground truth, and most existing algorithms are for rigid registration without considering the optical distortion. In this paper, we present an unsupervised learning method for deformable registration between the two images. To solve the registration problem, the structure achieves a multi-level receptive field and takes contour and local detail into account. To measure the edge difference caused by different distortions in the optics center and edge, an edge similarity (ES) loss term is proposed, so loss function is composed by local cross-correlation, edge similarity and diffusion regularizer on the spatial gradients of the deformation matrix. Thus, we propose a multi-scale input layer, U-net with dilated convolution structure, squeeze excitation (SE) block and spatial transformer layers. Quantitative experiments prove the proposed framework is best compared with several conventional and deep learningbased methods, and our ES loss and structure combined with Unet and multi-scale layers achieve competitive results for normal and abnormal images.",

keywords = "Color Fundus, Deep Learning, Deformable Registration, Multimodal Registration, Optical Coherence Tomography",

author = "Yuntong Tian and Yan Hu and Yuhui Ma and Huaying Hao and Lei Mou and Jianlong Yang and Yitian Zhao and Jiang Liu",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE.; 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society, EMBC 2020 ; Conference date: 20-07-2020 Through 24-07-2020",

year = "2020",

month = jul,

doi = "10.1109/EMBC44109.2020.9175613",

language = "English",

series = "Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1360--1363",

booktitle = "42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society",

address = "United States",

}

Tian, Y, Hu, Y, Ma, Y, Hao, H, Mou, L, Yang, J, Zhao, Y & Liu, J 2020, Multi-scale U-net with Edge Guidance for Multimodal Retinal Image Deformable Registration. in 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society: Enabling Innovative Technologies for Global Healthcare, EMBC 2020., 9175613, Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS, vol. 2020-July, Institute of Electrical and Electronics Engineers Inc., pp. 1360-1363, 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society, EMBC 2020, Montreal, Canada, 20/07/20. https://doi.org/10.1109/EMBC44109.2020.9175613

Multi-scale U-net with Edge Guidance for Multimodal Retinal Image Deformable Registration. / Tian, Yuntong; Hu, Yan; Ma, Yuhui et al.
42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society: Enabling Innovative Technologies for Global Healthcare, EMBC 2020. Institute of Electrical and Electronics Engineers Inc., 2020. p. 1360-1363 9175613 (Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS; Vol. 2020-July).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Multi-scale U-net with Edge Guidance for Multimodal Retinal Image Deformable Registration

AU - Tian, Yuntong

AU - Hu, Yan

AU - Ma, Yuhui

AU - Hao, Huaying

AU - Mou, Lei

AU - Yang, Jianlong

AU - Zhao, Yitian

AU - Liu, Jiang

PY - 2020/7

Y1 - 2020/7

N2 - Registration of multimodal retinal images is of great importance in facilitating the diagnosis and treatment of many eye diseases, such as the registration between color fundus images and optical coherence tomography (OCT) images. However, it is difficult to obtain ground truth, and most existing algorithms are for rigid registration without considering the optical distortion. In this paper, we present an unsupervised learning method for deformable registration between the two images. To solve the registration problem, the structure achieves a multi-level receptive field and takes contour and local detail into account. To measure the edge difference caused by different distortions in the optics center and edge, an edge similarity (ES) loss term is proposed, so loss function is composed by local cross-correlation, edge similarity and diffusion regularizer on the spatial gradients of the deformation matrix. Thus, we propose a multi-scale input layer, U-net with dilated convolution structure, squeeze excitation (SE) block and spatial transformer layers. Quantitative experiments prove the proposed framework is best compared with several conventional and deep learningbased methods, and our ES loss and structure combined with Unet and multi-scale layers achieve competitive results for normal and abnormal images.

AB - Registration of multimodal retinal images is of great importance in facilitating the diagnosis and treatment of many eye diseases, such as the registration between color fundus images and optical coherence tomography (OCT) images. However, it is difficult to obtain ground truth, and most existing algorithms are for rigid registration without considering the optical distortion. In this paper, we present an unsupervised learning method for deformable registration between the two images. To solve the registration problem, the structure achieves a multi-level receptive field and takes contour and local detail into account. To measure the edge difference caused by different distortions in the optics center and edge, an edge similarity (ES) loss term is proposed, so loss function is composed by local cross-correlation, edge similarity and diffusion regularizer on the spatial gradients of the deformation matrix. Thus, we propose a multi-scale input layer, U-net with dilated convolution structure, squeeze excitation (SE) block and spatial transformer layers. Quantitative experiments prove the proposed framework is best compared with several conventional and deep learningbased methods, and our ES loss and structure combined with Unet and multi-scale layers achieve competitive results for normal and abnormal images.

KW - Color Fundus

KW - Deep Learning

KW - Deformable Registration

KW - Multimodal Registration

KW - Optical Coherence Tomography

UR - http://www.scopus.com/inward/record.url?scp=85091036155&partnerID=8YFLogxK

U2 - 10.1109/EMBC44109.2020.9175613

DO - 10.1109/EMBC44109.2020.9175613

M3 - Conference contribution

AN - SCOPUS:85091036155

T3 - Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS

SP - 1360

EP - 1363

BT - 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society, EMBC 2020

Y2 - 20 July 2020 through 24 July 2020

ER -

Tian Y, Hu Y, Ma Y, Hao H, Mou L, Yang J et al. Multi-scale U-net with Edge Guidance for Multimodal Retinal Image Deformable Registration. In 42nd Annual International Conferences of the IEEE Engineering in Medicine and Biology Society: Enabling Innovative Technologies for Global Healthcare, EMBC 2020. Institute of Electrical and Electronics Engineers Inc. 2020. p. 1360-1363. 9175613. (Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS). doi: 10.1109/EMBC44109.2020.9175613

Multi-scale U-net with Edge Guidance for Multimodal Retinal Image Deformable Registration

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this