TY - GEN
T1 - Improving Healthcare Outcomes by Identifying Populations with Higher Risk of Lung Cancer from Primary Care Data
AU - Shen, Yuan
AU - Mahmud, Mufti
AU - Rai, Teena
AU - He, Jun
AU - Brown, David J.
AU - Rahman, Muhammad Arifur
AU - Kaur, Jaspreet
AU - Baldwin, David R.
AU - O’Dowd, Emma
AU - Hubbard, Richard B.
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - Machine learning models have increasingly played an important role in medicine and healthcare. They can be readily adapted for clinical prognostic tasks. A prominent task in lung cancer healthcare is to select people with higher lung cancer risk from some population. The task can be undertaken using clinical predictive models along with real-world Electronic Healthcare Records. In this paper, we provide a worked example for such task using Logistic Regression as the model and using CPRD Dataset as the EHRs which cover 4.5% UK population [9]. Further, the use of clinical predictive models in cancer care has gone beyond cancer screening programme. That is, such models can also be employed to perform a variety of cancer healthcare management tasks. In this paper, we provide six “lung cancer”-related use cases to illustrate task diversity. It is also demonstrated that each of 6 use cases has chosen their appropriate set of prognostic predictors to optimally perform their task. Last, their task performance is also critically evaluated. Domains such as medicine and healthcare require trustworthiness and accountability. To meet this challenge, Explainable Artificial Intelligence (XAI) techniques have been timely developed. In this paper, we introduced impurity-, permutation-, LIME-, and SHAP-based importance measures. These XAI techniques were applied to 6 use cases for variable importance analysis. Last, we used domain-specific knowledge to critically interpret their XAI results. We also briefly reviewed a model-specific XAI application. It relies on knowledge-based constraints.
AB - Machine learning models have increasingly played an important role in medicine and healthcare. They can be readily adapted for clinical prognostic tasks. A prominent task in lung cancer healthcare is to select people with higher lung cancer risk from some population. The task can be undertaken using clinical predictive models along with real-world Electronic Healthcare Records. In this paper, we provide a worked example for such task using Logistic Regression as the model and using CPRD Dataset as the EHRs which cover 4.5% UK population [9]. Further, the use of clinical predictive models in cancer care has gone beyond cancer screening programme. That is, such models can also be employed to perform a variety of cancer healthcare management tasks. In this paper, we provide six “lung cancer”-related use cases to illustrate task diversity. It is also demonstrated that each of 6 use cases has chosen their appropriate set of prognostic predictors to optimally perform their task. Last, their task performance is also critically evaluated. Domains such as medicine and healthcare require trustworthiness and accountability. To meet this challenge, Explainable Artificial Intelligence (XAI) techniques have been timely developed. In this paper, we introduced impurity-, permutation-, LIME-, and SHAP-based importance measures. These XAI techniques were applied to 6 use cases for variable importance analysis. Last, we used domain-specific knowledge to critically interpret their XAI results. We also briefly reviewed a model-specific XAI application. It relies on knowledge-based constraints.
KW - Detection of early-stage lung cancer
KW - Interpretable representations
KW - Missed lung cancer cases
KW - Stability of XAI
UR - https://www.scopus.com/pages/publications/105009983862
U2 - 10.1007/978-981-96-6588-4_11
DO - 10.1007/978-981-96-6588-4_11
M3 - Conference contribution
AN - SCOPUS:105009983862
SN - 9789819665877
T3 - Lecture Notes in Computer Science
SP - 152
EP - 166
BT - Neural Information Processing - 31st International Conference, ICONIP 2024, Proceedings
A2 - Mahmud, Mufti
A2 - Doborjeh, Maryam
A2 - Wong, Kevin
A2 - Leung, Andrew Chi Sing
A2 - Doborjeh, Zohreh
A2 - Tanveer, M.
PB - Springer Science and Business Media Deutschland GmbH
T2 - 31st International Conference on Neural Information Processing, ICONIP 2024
Y2 - 2 December 2024 through 6 December 2024
ER -