TY - GEN
T1 - Retrieving and ranking short medical questions with two stages neural matching model
AU - Li, Xiang
AU - Fu, Xinyu
AU - Lu, Zheng
AU - Bai, Ruibin
AU - Aickelin, Uwe
AU - Ge, Peiming
AU - Liu, Gong
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/6
Y1 - 2019/6
N2 - Internet hospital is a rising business thanks to recent advances in mobile web technology and high demand of health care services. Online medical services become increasingly popular and active. According to US data in 2018, 80 percent of internet users have asked health-related questions online. Numerous data is generated in unprecedented speed and scale. Those representative questions and answers in medical fields are valuable raw data sources for medical data mining. Automated machine interpretation on those sheer amount of data gives an opportunity to assist doctors to answer frequently asked medical-related questions from the perspective of information retrieval and machine learning approaches. In this work, we propose a novel two-stage framework for the semantic matching of query-level medical questions, which takes advantages of sentence similarity-based search engine techniques and Siamese inspired recent recurrent neural network. The two-stage hierarchical design optimises the performance of automatic information retrieval of user queries. Compared against the classical TFIDF search technique as a single-stage, our novel soft search technique performs significantly better. Incorporating an advanced deep learning model as the second stage can improve the results further, which we believe is the new state-of-the-art in the current problem setting with the unique medical corpus from one of the largest online healthcare provider in market.
AB - Internet hospital is a rising business thanks to recent advances in mobile web technology and high demand of health care services. Online medical services become increasingly popular and active. According to US data in 2018, 80 percent of internet users have asked health-related questions online. Numerous data is generated in unprecedented speed and scale. Those representative questions and answers in medical fields are valuable raw data sources for medical data mining. Automated machine interpretation on those sheer amount of data gives an opportunity to assist doctors to answer frequently asked medical-related questions from the perspective of information retrieval and machine learning approaches. In this work, we propose a novel two-stage framework for the semantic matching of query-level medical questions, which takes advantages of sentence similarity-based search engine techniques and Siamese inspired recent recurrent neural network. The two-stage hierarchical design optimises the performance of automatic information retrieval of user queries. Compared against the classical TFIDF search technique as a single-stage, our novel soft search technique performs significantly better. Incorporating an advanced deep learning model as the second stage can improve the results further, which we believe is the new state-of-the-art in the current problem setting with the unique medical corpus from one of the largest online healthcare provider in market.
KW - information retrieval
KW - machine learning
KW - medical questions answering
UR - http://www.scopus.com/inward/record.url?scp=85071332644&partnerID=8YFLogxK
U2 - 10.1109/CEC.2019.8790326
DO - 10.1109/CEC.2019.8790326
M3 - Conference contribution
AN - SCOPUS:85071332644
T3 - 2019 IEEE Congress on Evolutionary Computation, CEC 2019 - Proceedings
SP - 873
EP - 879
BT - 2019 IEEE Congress on Evolutionary Computation, CEC 2019 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2019 IEEE Congress on Evolutionary Computation, CEC 2019
Y2 - 10 June 2019 through 13 June 2019
ER -