Cluster Boost: Stacked Machine Learning Model for Customer Behaviour Analysis in E-commerce

Zheng Jie Wong, Jit Yan Lim, Kian Ming Lim, Chin Poo Lee, Yong Xuan Tan, Ee Mae Ang

Research output: Journal PublicationConference articlepeer-review

Abstract

This study focuses on the development of a stacked model, named Cluster Boost, which integrates K-means clustering and Gradient Boosting to analyse customer behaviour in e-commerce. Cluster Boost harnesses the strengths of both unsupervised and supervised learning methods, combining them to create a more robust and accurate predictive model. The proposed Cluster Boost model achieves an optimal testing accuracy of 97.92%, significantly outperforming individual models such as Random Forest and Decision Tree on a public ecommerce customer churn analysis dataset. It also demonstrates an impressive precision of 95.37%, a recall of 91.96%, and an F 1 -score of 93.64%. These results underscore the model's potential for accurately predicting and understanding customer behaviour in e-commerce, highlighting the advantages of stacking K-means clustering with Gradient Boosting to capture intricate data relationships and enhance predictive accuracy.

Original languageEnglish
Pages (from-to)119-124
Number of pages6
JournalProceedings of the IEEE Conference on Systems, Process and Control, ICSPC
Issue number2024
DOIs
Publication statusPublished - 2024
Event12th IEEE Conference on Systems, Process and Control, ICSPC 2024 - Malacca, Malaysia
Duration: 7 Dec 2024 → …

Keywords

  • Customer Behaviour Analysis
  • Customer Churn
  • Gradient Boosting
  • K-Means Clustering
  • Machine Learning
  • Stacking Model

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Information Systems
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality
  • Control and Optimization
  • Modelling and Simulation
  • Education

Cite this