Retinal Structure Detection in OCTA Image via Voting-Based Multitask Learning

Jinkui Hao; Ting Shen; Xueli Zhu; Yonghuai Liu; Ardhendu Behera; Dan Zhang; Bang Chen; Jiang Liu; Jiong Zhang; Yitian Zhao

doi:10.1109/TMI.2022.3202183

Retinal Structure Detection in OCTA Image via Voting-Based Multitask Learning

Jinkui Hao, Ting Shen, Xueli Zhu, Yonghuai Liu, Ardhendu Behera, Dan Zhang, Bang Chen, Jiang Liu, Jiong Zhang, Yitian Zhao

Research output: Journal Publication › Article › peer-review

27 Citations (Scopus)

Abstract

Automated detection of retinal structures, such as retinal vessels (RV), the foveal avascular zone (FAZ), and retinal vascular junctions (RVJ), are of great importance for understanding diseases of the eye and clinical decision-making. In this paper, we propose a novel Voting-based Adaptive Feature Fusion multi-task network (VAFF-Net) for joint segmentation, detection, and classification of RV, FAZ, and RVJ in optical coherence tomography angiography (OCTA). A task-specific voting gate module is proposed to adaptively extract and fuse different features for specific tasks at two levels: features at different spatial positions from a single encoder, and features from multiple encoders. In particular, since the complexity of the microvasculature in OCTA images makes simultaneous precise localization and classification of retinal vascular junctions into bifurcation/crossing a challenging task, we specifically design a task head by combining the heatmap regression and grid classification. We take advantage of three different en face angiograms from various retinal layers, rather than following existing methods that use only a single en face. We carry out extensive experiments on three OCTA datasets acquired using different imaging devices, and the results demonstrate that the proposed method performs on the whole better than either the state-of-the-art single-purpose methods or existing multi-task learning solutions. We also demonstrate that our multi-task learning method generalizes across other imaging modalities, such as color fundus photography, and may potentially be used as a general multi-task learning tool. We also construct three datasets for multiple structure detection, and part of these datasets with the source code and evaluation benchmark have been released for public access.

Original language	English
Pages (from-to)	3969-3980
Number of pages	12
Journal	IEEE Transactions on Medical Imaging
Volume	41
Issue number	12
DOIs	https://doi.org/10.1109/TMI.2022.3202183
Publication status	Published - 1 Dec 2022
Externally published	Yes

Keywords

OCTA
classification
detection
multi-task learning
retina structures
segmentation

ASJC Scopus subject areas

Software
Radiological and Ultrasound Technology
Computer Science Applications
Electrical and Electronic Engineering

Access to Document

10.1109/TMI.2022.3202183

Cite this

@article{48f76aa3e78b4fa4a3e7a02310233b03,

title = "Retinal Structure Detection in OCTA Image via Voting-Based Multitask Learning",

abstract = "Automated detection of retinal structures, such as retinal vessels (RV), the foveal avascular zone (FAZ), and retinal vascular junctions (RVJ), are of great importance for understanding diseases of the eye and clinical decision-making. In this paper, we propose a novel Voting-based Adaptive Feature Fusion multi-task network (VAFF-Net) for joint segmentation, detection, and classification of RV, FAZ, and RVJ in optical coherence tomography angiography (OCTA). A task-specific voting gate module is proposed to adaptively extract and fuse different features for specific tasks at two levels: features at different spatial positions from a single encoder, and features from multiple encoders. In particular, since the complexity of the microvasculature in OCTA images makes simultaneous precise localization and classification of retinal vascular junctions into bifurcation/crossing a challenging task, we specifically design a task head by combining the heatmap regression and grid classification. We take advantage of three different en face angiograms from various retinal layers, rather than following existing methods that use only a single en face. We carry out extensive experiments on three OCTA datasets acquired using different imaging devices, and the results demonstrate that the proposed method performs on the whole better than either the state-of-the-art single-purpose methods or existing multi-task learning solutions. We also demonstrate that our multi-task learning method generalizes across other imaging modalities, such as color fundus photography, and may potentially be used as a general multi-task learning tool. We also construct three datasets for multiple structure detection, and part of these datasets with the source code and evaluation benchmark have been released for public access.",

keywords = "OCTA, classification, detection, multi-task learning, retina structures, segmentation",

author = "Jinkui Hao and Ting Shen and Xueli Zhu and Yonghuai Liu and Ardhendu Behera and Dan Zhang and Bang Chen and Jiang Liu and Jiong Zhang and Yitian Zhao",

note = "Publisher Copyright: {\textcopyright} 1982-2012 IEEE.",

year = "2022",

month = dec,

day = "1",

doi = "10.1109/TMI.2022.3202183",

language = "English",

volume = "41",

pages = "3969--3980",

journal = "IEEE Transactions on Medical Imaging",

issn = "0278-0062",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "12",

}

TY - JOUR

T1 - Retinal Structure Detection in OCTA Image via Voting-Based Multitask Learning

AU - Hao, Jinkui

AU - Shen, Ting

AU - Zhu, Xueli

AU - Liu, Yonghuai

AU - Behera, Ardhendu

AU - Zhang, Dan

AU - Chen, Bang

AU - Liu, Jiang

AU - Zhang, Jiong

AU - Zhao, Yitian

PY - 2022/12/1

Y1 - 2022/12/1

N2 - Automated detection of retinal structures, such as retinal vessels (RV), the foveal avascular zone (FAZ), and retinal vascular junctions (RVJ), are of great importance for understanding diseases of the eye and clinical decision-making. In this paper, we propose a novel Voting-based Adaptive Feature Fusion multi-task network (VAFF-Net) for joint segmentation, detection, and classification of RV, FAZ, and RVJ in optical coherence tomography angiography (OCTA). A task-specific voting gate module is proposed to adaptively extract and fuse different features for specific tasks at two levels: features at different spatial positions from a single encoder, and features from multiple encoders. In particular, since the complexity of the microvasculature in OCTA images makes simultaneous precise localization and classification of retinal vascular junctions into bifurcation/crossing a challenging task, we specifically design a task head by combining the heatmap regression and grid classification. We take advantage of three different en face angiograms from various retinal layers, rather than following existing methods that use only a single en face. We carry out extensive experiments on three OCTA datasets acquired using different imaging devices, and the results demonstrate that the proposed method performs on the whole better than either the state-of-the-art single-purpose methods or existing multi-task learning solutions. We also demonstrate that our multi-task learning method generalizes across other imaging modalities, such as color fundus photography, and may potentially be used as a general multi-task learning tool. We also construct three datasets for multiple structure detection, and part of these datasets with the source code and evaluation benchmark have been released for public access.

AB - Automated detection of retinal structures, such as retinal vessels (RV), the foveal avascular zone (FAZ), and retinal vascular junctions (RVJ), are of great importance for understanding diseases of the eye and clinical decision-making. In this paper, we propose a novel Voting-based Adaptive Feature Fusion multi-task network (VAFF-Net) for joint segmentation, detection, and classification of RV, FAZ, and RVJ in optical coherence tomography angiography (OCTA). A task-specific voting gate module is proposed to adaptively extract and fuse different features for specific tasks at two levels: features at different spatial positions from a single encoder, and features from multiple encoders. In particular, since the complexity of the microvasculature in OCTA images makes simultaneous precise localization and classification of retinal vascular junctions into bifurcation/crossing a challenging task, we specifically design a task head by combining the heatmap regression and grid classification. We take advantage of three different en face angiograms from various retinal layers, rather than following existing methods that use only a single en face. We carry out extensive experiments on three OCTA datasets acquired using different imaging devices, and the results demonstrate that the proposed method performs on the whole better than either the state-of-the-art single-purpose methods or existing multi-task learning solutions. We also demonstrate that our multi-task learning method generalizes across other imaging modalities, such as color fundus photography, and may potentially be used as a general multi-task learning tool. We also construct three datasets for multiple structure detection, and part of these datasets with the source code and evaluation benchmark have been released for public access.

KW - OCTA

KW - classification

KW - detection

KW - multi-task learning

KW - retina structures

KW - segmentation

UR - http://www.scopus.com/inward/record.url?scp=85137909881&partnerID=8YFLogxK

U2 - 10.1109/TMI.2022.3202183

DO - 10.1109/TMI.2022.3202183

M3 - Article

C2 - 36044489

AN - SCOPUS:85137909881

SN - 0278-0062

VL - 41

SP - 3969

EP - 3980

JO - IEEE Transactions on Medical Imaging

JF - IEEE Transactions on Medical Imaging

IS - 12

ER -

Retinal Structure Detection in OCTA Image via Voting-Based Multitask Learning

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this