Advancing an LDA-GMM-CorEx topic model with prior domain knowledge in information systems research

Yuting Jiang, Mengyao Fu, Jie Fang, Matti Rossi, Yuting Wang, Chee Wee Tan

Research output: Journal PublicationArticlepeer-review

Abstract

Embedding topic models with domain knowledge is deemed to be effective in bolstering the models’ interpretability. Nevertheless, contemporary topic modeling techniques introduced in past studies lack consideration for circumstances in which prior domain knowledge either does not exist or becomes obsolete quickly. Combining the latent Dirichlet allocation (LDA) with the Gaussian mixture model (GMM) and the anchor correlation explanation (CorEx) topic model, we advanced a novel LDA-GMM-CorEx topic modeling approach to enhance the domain knowledge model's adaptability and improve the interpretability of topic modeling. We further verified the effectiveness of our proposed topic modeling approach on two separate datasets from different domains, thereby attesting to its general applicability.

Original languageEnglish
Article number104097
JournalInformation and Management
Volume62
Issue number2
DOIs
Publication statusPublished - Mar 2025

Keywords

  • CorEx
  • GMM
  • LDA
  • Topic-model embedded domain knowledge

ASJC Scopus subject areas

  • Management Information Systems
  • Information Systems
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'Advancing an LDA-GMM-CorEx topic model with prior domain knowledge in information systems research'. Together they form a unique fingerprint.

Cite this