Generative caption for diabetic retinopathy images

Luhui Wu; Cheng Wan; Yiquan Wu; Jiang Liu

doi:10.1109/SPAC.2017.8304332

Generative caption for diabetic retinopathy images

Luhui Wu, Cheng Wan, Yiquan Wu, Jiang Liu

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

22 Citations (Scopus)

Abstract

For a long time, the detection of diabetic retinopathy has always been a great challenge. People want to find a fast and effective computer-aided treatment to diagnose the disease. In recent years, the rapid development of the deep learning makes it gradually become an effective technique for the analysis of medical images. In this paper, we propose a method to deal with diabetic retinopathy images with generative caption technique of images to generate a simple sequence to explain the abnormal contents in fundus images. The generative technique of images is a generative model based on a deep recurrent architecture that combines convolution neural network (CNN) which is currently state-of-the-art for object recognition and detection with long-short-term-memory (LSTM) which is applied with great success to machine translation and sequence generation, and that can be used to generate natural sentences describing an image. The target of the model in training is to maximize the likelihood of the target description sentence given from the training images. The model built on dataset DIARETDB0, DIARETDB1 and Messidor can achieve good performance and generate fluent sequences. In addition, the experimental results show that the accuracy of diagnosis for individual abnormal discoveries is up to 88.53% and the diagnosis accuracy is more than 90%.

Original language	English
Title of host publication	2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	515-519
Number of pages	5
ISBN (Electronic)	9781538630167
DOIs	https://doi.org/10.1109/SPAC.2017.8304332
Publication status	Published - 2 Jul 2017
Externally published	Yes
Event	2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017 - Shenzhen, China Duration: 15 Dec 2017 → 17 Dec 2017

Publication series

Name	2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017
Volume	2018-January

Conference

Conference	2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017
Country/Territory	China
City	Shenzhen
Period	15/12/17 → 17/12/17

Keywords

Deep Learning
Diabetic Retinopathy
Image Caption
Retinopathy Lesions

ASJC Scopus subject areas

Safety, Risk, Reliability and Quality
Computer Vision and Pattern Recognition
Artificial Intelligence
Computer Networks and Communications
Computer Science Applications

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/SPAC.2017.8304332

Cite this

Wu, L., Wan, C., Wu, Y., & Liu, J. (2017). Generative caption for diabetic retinopathy images. In 2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017 (pp. 515-519). (2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017; Vol. 2018-January). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/SPAC.2017.8304332

@inproceedings{6a287e7bb9bb41e6bd0e320e7436c2db,

title = "Generative caption for diabetic retinopathy images",

abstract = "For a long time, the detection of diabetic retinopathy has always been a great challenge. People want to find a fast and effective computer-aided treatment to diagnose the disease. In recent years, the rapid development of the deep learning makes it gradually become an effective technique for the analysis of medical images. In this paper, we propose a method to deal with diabetic retinopathy images with generative caption technique of images to generate a simple sequence to explain the abnormal contents in fundus images. The generative technique of images is a generative model based on a deep recurrent architecture that combines convolution neural network (CNN) which is currently state-of-the-art for object recognition and detection with long-short-term-memory (LSTM) which is applied with great success to machine translation and sequence generation, and that can be used to generate natural sentences describing an image. The target of the model in training is to maximize the likelihood of the target description sentence given from the training images. The model built on dataset DIARETDB0, DIARETDB1 and Messidor can achieve good performance and generate fluent sequences. In addition, the experimental results show that the accuracy of diagnosis for individual abnormal discoveries is up to 88.53% and the diagnosis accuracy is more than 90%.",

keywords = "Deep Learning, Diabetic Retinopathy, Image Caption, Retinopathy Lesions",

author = "Luhui Wu and Cheng Wan and Yiquan Wu and Jiang Liu",

note = "Publisher Copyright: {\textcopyright} 2017 IEEE.; 2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017 ; Conference date: 15-12-2017 Through 17-12-2017",

year = "2017",

month = jul,

day = "2",

doi = "10.1109/SPAC.2017.8304332",

language = "English",

series = "2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "515--519",

booktitle = "2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017",

address = "United States",

}

Wu, L, Wan, C, Wu, Y & Liu, J 2017, Generative caption for diabetic retinopathy images. in 2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017. 2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017, vol. 2018-January, Institute of Electrical and Electronics Engineers Inc., pp. 515-519, 2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017, Shenzhen, China, 15/12/17. https://doi.org/10.1109/SPAC.2017.8304332

Generative caption for diabetic retinopathy images. / Wu, Luhui; Wan, Cheng; Wu, Yiquan et al.
2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017. Institute of Electrical and Electronics Engineers Inc., 2017. p. 515-519 (2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017; Vol. 2018-January).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Generative caption for diabetic retinopathy images

AU - Wu, Luhui

AU - Wan, Cheng

AU - Wu, Yiquan

AU - Liu, Jiang

PY - 2017/7/2

Y1 - 2017/7/2

N2 - For a long time, the detection of diabetic retinopathy has always been a great challenge. People want to find a fast and effective computer-aided treatment to diagnose the disease. In recent years, the rapid development of the deep learning makes it gradually become an effective technique for the analysis of medical images. In this paper, we propose a method to deal with diabetic retinopathy images with generative caption technique of images to generate a simple sequence to explain the abnormal contents in fundus images. The generative technique of images is a generative model based on a deep recurrent architecture that combines convolution neural network (CNN) which is currently state-of-the-art for object recognition and detection with long-short-term-memory (LSTM) which is applied with great success to machine translation and sequence generation, and that can be used to generate natural sentences describing an image. The target of the model in training is to maximize the likelihood of the target description sentence given from the training images. The model built on dataset DIARETDB0, DIARETDB1 and Messidor can achieve good performance and generate fluent sequences. In addition, the experimental results show that the accuracy of diagnosis for individual abnormal discoveries is up to 88.53% and the diagnosis accuracy is more than 90%.

AB - For a long time, the detection of diabetic retinopathy has always been a great challenge. People want to find a fast and effective computer-aided treatment to diagnose the disease. In recent years, the rapid development of the deep learning makes it gradually become an effective technique for the analysis of medical images. In this paper, we propose a method to deal with diabetic retinopathy images with generative caption technique of images to generate a simple sequence to explain the abnormal contents in fundus images. The generative technique of images is a generative model based on a deep recurrent architecture that combines convolution neural network (CNN) which is currently state-of-the-art for object recognition and detection with long-short-term-memory (LSTM) which is applied with great success to machine translation and sequence generation, and that can be used to generate natural sentences describing an image. The target of the model in training is to maximize the likelihood of the target description sentence given from the training images. The model built on dataset DIARETDB0, DIARETDB1 and Messidor can achieve good performance and generate fluent sequences. In addition, the experimental results show that the accuracy of diagnosis for individual abnormal discoveries is up to 88.53% and the diagnosis accuracy is more than 90%.

KW - Deep Learning

KW - Diabetic Retinopathy

KW - Image Caption

KW - Retinopathy Lesions

UR - http://www.scopus.com/inward/record.url?scp=85050596508&partnerID=8YFLogxK

U2 - 10.1109/SPAC.2017.8304332

DO - 10.1109/SPAC.2017.8304332

M3 - Conference contribution

AN - SCOPUS:85050596508

T3 - 2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017

SP - 515

EP - 519

BT - 2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2017 International Conference on Security, Pattern Analysis, and Cybernetics, SPAC 2017

Y2 - 15 December 2017 through 17 December 2017

ER -

Generative caption for diabetic retinopathy images

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this