Improving Healthcare Outcomes by Identifying Populations with Higher Risk of Lung Cancer from Primary Care Data

Yuan Shen, Mufti Mahmud, Teena Rai, Jun He, David J. Brown, Muhammad Arifur Rahman, Jaspreet Kaur, David R. Baldwin, Emma O’Dowd, Richard B. Hubbard

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

Abstract

Machine learning models have increasingly played an important role in medicine and healthcare. They can be readily adapted for clinical prognostic tasks. A prominent task in lung cancer healthcare is to select people with higher lung cancer risk from some population. The task can be undertaken using clinical predictive models along with real-world Electronic Healthcare Records. In this paper, we provide a worked example for such task using Logistic Regression as the model and using CPRD Dataset as the EHRs which cover 4.5% UK population [9]. Further, the use of clinical predictive models in cancer care has gone beyond cancer screening programme. That is, such models can also be employed to perform a variety of cancer healthcare management tasks. In this paper, we provide six “lung cancer”-related use cases to illustrate task diversity. It is also demonstrated that each of 6 use cases has chosen their appropriate set of prognostic predictors to optimally perform their task. Last, their task performance is also critically evaluated. Domains such as medicine and healthcare require trustworthiness and accountability. To meet this challenge, Explainable Artificial Intelligence (XAI) techniques have been timely developed. In this paper, we introduced impurity-, permutation-, LIME-, and SHAP-based importance measures. These XAI techniques were applied to 6 use cases for variable importance analysis. Last, we used domain-specific knowledge to critically interpret their XAI results. We also briefly reviewed a model-specific XAI application. It relies on knowledge-based constraints.

Original languageEnglish
Title of host publicationNeural Information Processing - 31st International Conference, ICONIP 2024, Proceedings
EditorsMufti Mahmud, Maryam Doborjeh, Kevin Wong, Andrew Chi Sing Leung, Zohreh Doborjeh, M. Tanveer
PublisherSpringer Science and Business Media Deutschland GmbH
Pages152-166
Number of pages15
ISBN (Print)9789819665877
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event31st International Conference on Neural Information Processing, ICONIP 2024 - Auckland, New Zealand
Duration: 2 Dec 20246 Dec 2024

Publication series

NameLecture Notes in Computer Science
Volume15290 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference31st International Conference on Neural Information Processing, ICONIP 2024
Country/TerritoryNew Zealand
CityAuckland
Period2/12/246/12/24

Keywords

  • Detection of early-stage lung cancer
  • Interpretable representations
  • Missed lung cancer cases
  • Stability of XAI

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Improving Healthcare Outcomes by Identifying Populations with Higher Risk of Lung Cancer from Primary Care Data'. Together they form a unique fingerprint.

Cite this