Abstract
This study focuses on the development of a stacked model, named Cluster Boost, which integrates K-means clustering and Gradient Boosting to analyse customer behaviour in e-commerce. Cluster Boost harnesses the strengths of both unsupervised and supervised learning methods, combining them to create a more robust and accurate predictive model. The proposed Cluster Boost model achieves an optimal testing accuracy of 97.92%, significantly outperforming individual models such as Random Forest and Decision Tree on a public ecommerce customer churn analysis dataset. It also demonstrates an impressive precision of 95.37%, a recall of 91.96%, and an F 1 -score of 93.64%. These results underscore the model's potential for accurately predicting and understanding customer behaviour in e-commerce, highlighting the advantages of stacking K-means clustering with Gradient Boosting to capture intricate data relationships and enhance predictive accuracy.
Original language | English |
---|---|
Pages (from-to) | 119-124 |
Number of pages | 6 |
Journal | Proceedings of the IEEE Conference on Systems, Process and Control, ICSPC |
Issue number | 2024 |
DOIs | |
Publication status | Published - 2024 |
Event | 12th IEEE Conference on Systems, Process and Control, ICSPC 2024 - Malacca, Malaysia Duration: 7 Dec 2024 → … |
Keywords
- Customer Behaviour Analysis
- Customer Churn
- Gradient Boosting
- K-Means Clustering
- Machine Learning
- Stacking Model
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Science Applications
- Information Systems
- Information Systems and Management
- Safety, Risk, Reliability and Quality
- Control and Optimization
- Modelling and Simulation
- Education