Cross-document attention-based gated fusion network for automated medical licensing exam

Jiandong Liu; Jianfeng Ren; Zheng Lu; Wentao He; Menglin Cui; Zibo Zhang; Ruibin Bai

doi:10.1016/j.eswa.2022.117588

Cross-document attention-based gated fusion network for automated medical licensing exam

Jiandong Liu, Jianfeng Ren, Zheng Lu, Wentao He, Menglin Cui, Zibo Zhang, Ruibin Bai

School of Computer Science

Research output: Journal Publication › Article › peer-review

9 Citations (Scopus)

Abstract

One of the applications of machine-learning in the medical industry is to automatically learn knowledge from medical textbooks and transfer medical knowledge into diagnosis abilities. Because of complex nature of medical issues, the learning process usually requires multiple knowledge documents to form a comprehensive reasoning chain for diagnosis, which increases the difficulty of the automatic learning process. Existing models for multiple document comprehension either concatenate multiple documents together for inference or reason on every document independently. In this paper, we propose a Co-Attention-based Multi-document Inference (CAMI) framework for better reasoning over multiple documents. The proposed framework makes use of not only the attentional information among questions, answers and support documents but also the complementary attentional information across different documents. In addition, a gated fusion network is designed to fuse the cross-document information. The proposed model outperforms the state-of-the-art methods on Chinese National Medical Licensing Examination (CNMLE) dataset, ClinicQA, which contains 27,432 plain text documents and 13,827 CNMLE questions. We intend to make it publicly available as the first clinical OpenQA dataset.

Original language	English
Article number	117588
Journal	Expert Systems with Applications
Volume	205
DOIs	https://doi.org/10.1016/j.eswa.2022.117588
Publication status	Published - 1 Nov 2022

Keywords

Clinical diagnosis
Machine reading comprehension
Multiple document reasoning

ASJC Scopus subject areas

General Engineering
Computer Science Applications
Artificial Intelligence

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1016/j.eswa.2022.117588

Cite this

@article{cfb2028a46894017b241af84e601dd44,

title = "Cross-document attention-based gated fusion network for automated medical licensing exam",

abstract = "One of the applications of machine-learning in the medical industry is to automatically learn knowledge from medical textbooks and transfer medical knowledge into diagnosis abilities. Because of complex nature of medical issues, the learning process usually requires multiple knowledge documents to form a comprehensive reasoning chain for diagnosis, which increases the difficulty of the automatic learning process. Existing models for multiple document comprehension either concatenate multiple documents together for inference or reason on every document independently. In this paper, we propose a Co-Attention-based Multi-document Inference (CAMI) framework for better reasoning over multiple documents. The proposed framework makes use of not only the attentional information among questions, answers and support documents but also the complementary attentional information across different documents. In addition, a gated fusion network is designed to fuse the cross-document information. The proposed model outperforms the state-of-the-art methods on Chinese National Medical Licensing Examination (CNMLE) dataset, ClinicQA, which contains 27,432 plain text documents and 13,827 CNMLE questions. We intend to make it publicly available as the first clinical OpenQA dataset.",

keywords = "Clinical diagnosis, Machine reading comprehension, Multiple document reasoning",

author = "Jiandong Liu and Jianfeng Ren and Zheng Lu and Wentao He and Menglin Cui and Zibo Zhang and Ruibin Bai",

note = "Publisher Copyright: {\textcopyright} 2022 Elsevier Ltd",

year = "2022",

month = nov,

day = "1",

doi = "10.1016/j.eswa.2022.117588",

language = "English",

volume = "205",

journal = "Expert Systems with Applications",

issn = "0957-4174",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - Cross-document attention-based gated fusion network for automated medical licensing exam

AU - Liu, Jiandong

AU - Ren, Jianfeng

AU - Lu, Zheng

AU - He, Wentao

AU - Cui, Menglin

AU - Zhang, Zibo

AU - Bai, Ruibin

PY - 2022/11/1

Y1 - 2022/11/1

N2 - One of the applications of machine-learning in the medical industry is to automatically learn knowledge from medical textbooks and transfer medical knowledge into diagnosis abilities. Because of complex nature of medical issues, the learning process usually requires multiple knowledge documents to form a comprehensive reasoning chain for diagnosis, which increases the difficulty of the automatic learning process. Existing models for multiple document comprehension either concatenate multiple documents together for inference or reason on every document independently. In this paper, we propose a Co-Attention-based Multi-document Inference (CAMI) framework for better reasoning over multiple documents. The proposed framework makes use of not only the attentional information among questions, answers and support documents but also the complementary attentional information across different documents. In addition, a gated fusion network is designed to fuse the cross-document information. The proposed model outperforms the state-of-the-art methods on Chinese National Medical Licensing Examination (CNMLE) dataset, ClinicQA, which contains 27,432 plain text documents and 13,827 CNMLE questions. We intend to make it publicly available as the first clinical OpenQA dataset.

AB - One of the applications of machine-learning in the medical industry is to automatically learn knowledge from medical textbooks and transfer medical knowledge into diagnosis abilities. Because of complex nature of medical issues, the learning process usually requires multiple knowledge documents to form a comprehensive reasoning chain for diagnosis, which increases the difficulty of the automatic learning process. Existing models for multiple document comprehension either concatenate multiple documents together for inference or reason on every document independently. In this paper, we propose a Co-Attention-based Multi-document Inference (CAMI) framework for better reasoning over multiple documents. The proposed framework makes use of not only the attentional information among questions, answers and support documents but also the complementary attentional information across different documents. In addition, a gated fusion network is designed to fuse the cross-document information. The proposed model outperforms the state-of-the-art methods on Chinese National Medical Licensing Examination (CNMLE) dataset, ClinicQA, which contains 27,432 plain text documents and 13,827 CNMLE questions. We intend to make it publicly available as the first clinical OpenQA dataset.

KW - Clinical diagnosis

KW - Machine reading comprehension

KW - Multiple document reasoning

UR - http://www.scopus.com/inward/record.url?scp=85131222205&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2022.117588

DO - 10.1016/j.eswa.2022.117588

M3 - Article

AN - SCOPUS:85131222205

SN - 0957-4174

VL - 205

JO - Expert Systems with Applications

JF - Expert Systems with Applications

M1 - 117588

ER -

Cross-document attention-based gated fusion network for automated medical licensing exam

Abstract

Keywords

ASJC Scopus subject areas

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this