Semi-Supervised Surgical Video Semantic Segmentation with Cross Supervision of Inter-Frame

Derui Li; Yan Hu; Junyong Shen; Luoying Hao; Jiang Liu

doi:10.1109/ISBI53787.2023.10230375

Semi-Supervised Surgical Video Semantic Segmentation with Cross Supervision of Inter-Frame

Derui Li, Yan Hu, Junyong Shen, Luoying Hao, Jiang Liu

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

Abstract

Accurate surgical video semantic segmentation is vital for computer-aided surgery. Semi-supervised algorithms produce pseudo labels to solve the problem of the lack of labels, as it is very difficult to obtain the pixel-level segmentation labels from doctors or researchers. However, most of the algorithms consider the videos as independent images, which cannot solve some issues caused by complex surgery scenarios, such as blurred instruments. The paper proposes a novel Cross Supervision of Inter-frame (CSI) method using inter-frame information from surgery video to crosswise supervise semantic segmentation. Specifically, we design Inter-frame Information Transformation (I2T) modules to transfer features with class prototypes between continuous frames mutually. Besides, we utilize ground truth to supervise inter-frame features for labeled frames, and for unlabeled frames, we propose a cross pseudo loss and a pixel-wise contrastive loss as the constraints. Extensive experiments are performed on a publicly available cataract surgery dataset, which proves that our CSI method improves the segmentation accuracy after considering the inter-frame information.

Original language	English
Title of host publication	2023 IEEE International Symposium on Biomedical Imaging, ISBI 2023
Publisher	IEEE Computer Society
ISBN (Electronic)	9781665473583
DOIs	https://doi.org/10.1109/ISBI53787.2023.10230375
Publication status	Published - 2023
Externally published	Yes
Event	20th IEEE International Symposium on Biomedical Imaging, ISBI 2023 - Cartagena, Colombia Duration: 18 Apr 2023 → 21 Apr 2023

Publication series

Name	Proceedings - International Symposium on Biomedical Imaging
Volume	2023-April
ISSN (Print)	1945-7928
ISSN (Electronic)	1945-8452

Conference

Conference	20th IEEE International Symposium on Biomedical Imaging, ISBI 2023
Country/Territory	Colombia
City	Cartagena
Period	18/04/23 → 21/04/23

Keywords

Cataract surgery
Inter-frame
Segmentation
Semi-supervised

ASJC Scopus subject areas

Biomedical Engineering
Radiology Nuclear Medicine and imaging

Access to Document

10.1109/ISBI53787.2023.10230375

Cite this

@inproceedings{16d594c9b16f49a8bf63fadabbceee2c,

title = "Semi-Supervised Surgical Video Semantic Segmentation with Cross Supervision of Inter-Frame",

abstract = "Accurate surgical video semantic segmentation is vital for computer-aided surgery. Semi-supervised algorithms produce pseudo labels to solve the problem of the lack of labels, as it is very difficult to obtain the pixel-level segmentation labels from doctors or researchers. However, most of the algorithms consider the videos as independent images, which cannot solve some issues caused by complex surgery scenarios, such as blurred instruments. The paper proposes a novel Cross Supervision of Inter-frame (CSI) method using inter-frame information from surgery video to crosswise supervise semantic segmentation. Specifically, we design Inter-frame Information Transformation (I2T) modules to transfer features with class prototypes between continuous frames mutually. Besides, we utilize ground truth to supervise inter-frame features for labeled frames, and for unlabeled frames, we propose a cross pseudo loss and a pixel-wise contrastive loss as the constraints. Extensive experiments are performed on a publicly available cataract surgery dataset, which proves that our CSI method improves the segmentation accuracy after considering the inter-frame information.",

keywords = "Cataract surgery, Inter-frame, Segmentation, Semi-supervised",

author = "Derui Li and Yan Hu and Junyong Shen and Luoying Hao and Jiang Liu",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 20th IEEE International Symposium on Biomedical Imaging, ISBI 2023 ; Conference date: 18-04-2023 Through 21-04-2023",

year = "2023",

doi = "10.1109/ISBI53787.2023.10230375",

language = "English",

series = "Proceedings - International Symposium on Biomedical Imaging",

publisher = "IEEE Computer Society",

booktitle = "2023 IEEE International Symposium on Biomedical Imaging, ISBI 2023",

address = "United States",

}

Li, D, Hu, Y, Shen, J, Hao, L & Liu, J 2023, Semi-Supervised Surgical Video Semantic Segmentation with Cross Supervision of Inter-Frame. in 2023 IEEE International Symposium on Biomedical Imaging, ISBI 2023. Proceedings - International Symposium on Biomedical Imaging, vol. 2023-April, IEEE Computer Society, 20th IEEE International Symposium on Biomedical Imaging, ISBI 2023, Cartagena, Colombia, 18/04/23. https://doi.org/10.1109/ISBI53787.2023.10230375

Semi-Supervised Surgical Video Semantic Segmentation with Cross Supervision of Inter-Frame. / Li, Derui; Hu, Yan; Shen, Junyong et al.
2023 IEEE International Symposium on Biomedical Imaging, ISBI 2023. IEEE Computer Society, 2023. (Proceedings - International Symposium on Biomedical Imaging; Vol. 2023-April).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Semi-Supervised Surgical Video Semantic Segmentation with Cross Supervision of Inter-Frame

AU - Li, Derui

AU - Hu, Yan

AU - Shen, Junyong

AU - Hao, Luoying

AU - Liu, Jiang

PY - 2023

Y1 - 2023

N2 - Accurate surgical video semantic segmentation is vital for computer-aided surgery. Semi-supervised algorithms produce pseudo labels to solve the problem of the lack of labels, as it is very difficult to obtain the pixel-level segmentation labels from doctors or researchers. However, most of the algorithms consider the videos as independent images, which cannot solve some issues caused by complex surgery scenarios, such as blurred instruments. The paper proposes a novel Cross Supervision of Inter-frame (CSI) method using inter-frame information from surgery video to crosswise supervise semantic segmentation. Specifically, we design Inter-frame Information Transformation (I2T) modules to transfer features with class prototypes between continuous frames mutually. Besides, we utilize ground truth to supervise inter-frame features for labeled frames, and for unlabeled frames, we propose a cross pseudo loss and a pixel-wise contrastive loss as the constraints. Extensive experiments are performed on a publicly available cataract surgery dataset, which proves that our CSI method improves the segmentation accuracy after considering the inter-frame information.

AB - Accurate surgical video semantic segmentation is vital for computer-aided surgery. Semi-supervised algorithms produce pseudo labels to solve the problem of the lack of labels, as it is very difficult to obtain the pixel-level segmentation labels from doctors or researchers. However, most of the algorithms consider the videos as independent images, which cannot solve some issues caused by complex surgery scenarios, such as blurred instruments. The paper proposes a novel Cross Supervision of Inter-frame (CSI) method using inter-frame information from surgery video to crosswise supervise semantic segmentation. Specifically, we design Inter-frame Information Transformation (I2T) modules to transfer features with class prototypes between continuous frames mutually. Besides, we utilize ground truth to supervise inter-frame features for labeled frames, and for unlabeled frames, we propose a cross pseudo loss and a pixel-wise contrastive loss as the constraints. Extensive experiments are performed on a publicly available cataract surgery dataset, which proves that our CSI method improves the segmentation accuracy after considering the inter-frame information.

KW - Cataract surgery

KW - Inter-frame

KW - Segmentation

KW - Semi-supervised

UR - http://www.scopus.com/inward/record.url?scp=85172167317&partnerID=8YFLogxK

U2 - 10.1109/ISBI53787.2023.10230375

DO - 10.1109/ISBI53787.2023.10230375

M3 - Conference contribution

AN - SCOPUS:85172167317

T3 - Proceedings - International Symposium on Biomedical Imaging

BT - 2023 IEEE International Symposium on Biomedical Imaging, ISBI 2023

PB - IEEE Computer Society

T2 - 20th IEEE International Symposium on Biomedical Imaging, ISBI 2023

Y2 - 18 April 2023 through 21 April 2023

ER -

Semi-Supervised Surgical Video Semantic Segmentation with Cross Supervision of Inter-Frame

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this