Please use this identifier to cite or link to this item:
https://gnanaganga.inflibnet.ac.in:8443/jspui/handle/123456789/16758
Title: | Feature Engineering for Predicting Consumer Purchase Behavior: a Comprehensive Analysis |
Authors: | Karmakar, Mousumi Al Kafi, Md Abdullah Afridi, Arafat Sahin Sabbir, Wahid Raza, Dewan Mamun |
Keywords: | Classification Consumer Purchase Behavior Feature Engineering Machine Learning Prediction |
Issue Date: | 2024 |
Publisher: | Communications in Computer and Information Science Springer Science and Business Media Deutschland GmbH |
Citation: | Vol. 2093 CCIS; pp. 135-157 |
Abstract: | Feature engineering is essential for consumer behaviour prediction machine learning models. Analyzing customer behaviour reveals the complexity of feature development. A thorough literature review found that feature engineering has improved consumer purchase behaviour model prediction accuracy in several studies. We use six machine learning algorithms: Random Forest, Decision Trees, K-Nearest Neighbors, Naive Bayes, and Logistic Regression. This study examined Decision Tree, Gradient Boosting Classifier, K-Nearest Neighbors, Random Forest, Logistic Regression, and Gaussian Naive Bayes. The models were trained and evaluated using consumer purchase activity data on demographics, product preferences, online behaviour, and temporal factors. Every model achieved 80% accuracy, with Gradient Boosting Classifier, Random Forest, Decision Tree, and Logistic Regression performing best. Due to careful feature selection and preprocessing, the six machine learning models have similar accuracy and F1 scores. Proper feature engineering techniques affect consumer purchase behaviour, which this study investigates. This paper proposes feature engineering, which is novel. A correlation matrix is more efficient and effective than traditional feature selection methods for selecting relevant features. The accuracy depends on the machine learning method and characteristics used. However, the four models that performed well in our study—Decision Tree accuracy 86%, Gradient Boosting Classifier accuracy 86%, Random Forest accuracy 86%, and Logistic Regression accuracy 86%—are reliable and trustworthy for predicting customer purchasing habits. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024. |
URI: | https://doi.org/10.1007/978-3-031-64067-4_10 https://gnanaganga.inflibnet.ac.in:8443/jspui/handle/123456789/16758 |
ISBN: | 9783031640667 |
ISSN: | 1865-0929 |
Appears in Collections: | Conference Papers |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.