Comparison of the RFM Model's Actual Value and Score Value for Clustering
Abstract
Clustering algorithms and Recency-Frequency-Money (RFM) models are widely implemented in various sectors of e-commerce, banking, telecommunications and other industries to obtain customer segmentation. The RFM model will assess a line of data which includes the recency and frequency of data appearance, as well as the monetary value of a transaction made by a customer. Choosing the right RFM model also influences the analysis of cluster results, the output of cluster results is more compact for the same clusters (inter-cluster) and separate for other clusters (intra-cluster). Through an experimental approach, this research aims to find the best data set transformation model between actual RFM values and RFM scores. The method used is to compare the actual RFM value model and the RFM score and use the silhouette score value as an indicator to obtain the best clustering results using the K-Means algorithm. The subject of this research is a stall-based e-Commerce application, where data was taken in the Wiradesa area, Central Java. The resulting data set consisted of 273,454 rows with 18 attributes from January 2022 to December 2022 by collecting historical data from shopping outlets to wholesalers. The analysis of the data set was carried out by transforming the data set using the RFM method into actual values and score values; then the dataset was used to obtain the best cluster data. The results of this research show that transaction data based on time (time series) can be transformed into data in the RFM model where the actual value is better than the RFM score model with a silhouette score = 0.624646 and the number of clusters (K) =3. The results of the clustering process also form a series of data with a cluster label, thus forming supervised learning data.
Downloads
References
M. Y. Smaili and H. Hachimi, “Hybridization of improved binary bat algorithm for optimizing targeted offers problem in direct marketing campaigns,” Adv. Sci. Technol. Eng. Syst., vol. 5, no. 6, pp. 239–246, 2020, doi: 10.25046/aj050628.
A. J. Christy, A. Umamakeswari, L. Priyatharsini, and A. Neyaa, “RFM ranking – An effective approach to customer segmentation,” J. King Saud Univ. - Comput. Inf. Sci., vol. 33, no. 10, pp. 1251–1257, 2021, doi: 10.1016/j.jksuci.2018.09.004.
R. Srivastava, A. Parvaneh, and H. Abbasimehr, “Identification of Customer Clusters using RFM Model: A Case of Diverse Purchaser Classification,” Int. J. Bus. Anal. Intell., vol. 4, no. 2, p. 6, 2016.
S. Dibb, “Market segmentation: Strategies for success,” Mark. Intell. Plan., vol. 16, no. 7, pp. 394–406, 1998, doi: 10.1108/02634509810244390.
F. Safari, N. Safari, and G. A. Montazer, “Customer lifetime value determination based on RFM model,” Mark. Intell. Plan., vol. 34, no. 4, pp. 446–461, 2016, doi: 10.1108/MIP-03-2015-0060.
Y. Huang, M. Zhang, and Y. He, “Research on improved RFM customer segmentation model based on K-Means algorithm,” Proc. - 2020 5th Int. Conf. Comput. Intell. Appl. ICCIA 2020, pp. 24–27, 2020, doi: 10.1109/ICCIA49625.2020.00012.
J. Wei, S. Lin, and H. Wu, “A review of the application of RFM model,” African J. Bus. Manag., vol. 4, no. 19, pp. 4199–4206, 2010.
P. D. Bangsa and I. Hermawan, “Jurnal Teknologi Terpadu,” J. Teknol. Terpadu, vol. 7, no. 1, pp. 15–22, 2021.[9] J. Wu et al., “An Empirical Study on Customer Segmentation by Purchase Behaviors Using a RFM Model and K -Means Algorithm,” Math. Probl. Eng., vol. 2020, no. April 2019, 2020, doi: 10.1155/2020/8884227.
D. Chen, S. L. Sain, and K. Guo, “Data mining for the online retail industry: A case study of RFM model-based customer segmentation using data mining,” J. Database Mark. Cust. Strateg. Manag., vol. 19, no. 3, pp. 197–208, 2012, doi: 10.1057/dbm.2012.17.
B. Sohrabi and A. Khanlari, “Customer lifetime value determination based on RFM model,” Mark. Intell. Plan., vol. 14, 2007, doi: 10.1108/MIP-03-2015-0060.
C. Y. Tsai and C. C. Chiu, “A purchase-based market segmentation methodology,” Expert Syst. Appl., vol. 27, no. 2, pp. 265–276, 2004, doi: 10.1016/j.eswa.2004.02.005.
S. H. Shihab, S. Afroge, and S. Z. Mishu, “RFM Based Market Segmentation Approach Using Advanced K-means and Agglomerative Clustering: A Comparative Study,” 2019 Int. Conf. Electr. Comput. Commun. Eng., pp. 1–4, 2019.
D. Devarapalli, S. Veera, V. Satya, S. Geddam, A. S. Sravya, and A. P. Devi, “Analysis of RFM Customer Segmentation Using Clustering Algorithms,” Int. J. Mech. Eng. Vol., vol. 7, no. February, 2022.
M. Aliyev, E. Ahmadov, H. Gadirli, A. Mammadova, and E. Alasgarov, “Segmenting Bank Customers via RFM Model and Unsupervised Machine Learning,” 2020.
B. Arivazhagan and G. Vijaiprabhu, “An Enhanced Hierarchical Model for Customer Segmentation in Customer Relationship Management with Demographic , Recency , Frequency and Monetary Values,” Int. J. Mech. Eng., vol. 7, no. 2, pp. 1878–1886, 2022.
D. Elzanfaly and S. Salama, “Investigation in Customer Value Segmentation Quality under Different Preprocessing Types of RFM Attributes,” vol. 4, no. 4, pp. 5–10, 2016.
A. Gülcü and S. Çalişkan, “Clustering electricity market participants via FRM models,” Intell. Decis. Technol., vol. 14, no. 4, pp. 481–492, 2020, doi: 10.3233/IDT-200092.
C. Yuan and H. Yang, “Research on K-Value Selection Method of K-Means Clustering Algorithm,” J, vol. 2, no. 2, pp. 226–235, 2019, doi: 10.3390/j2020016.
I. Karacan, I. Erdogan, and U. Cebeci, “A Comprehensive Integration of RFM Analysis, Cluster Analysis, and Classification for B2B Customer Relationship Management,” Proc. Int. Conf. Ind. Eng. Oper. Manag., pp. 497–508, 2021.
D. A. Nasution, H. H. Khotimah, and N. Chamidah, “Perbandingan Normalisasi Data untuk Klasifikasi Wine Menggunakan Algoritma K-NN,” Comput. Eng. Sci. Syst. J., vol. 4, no. 1, p. 78, 2019, doi: 10.24114/cess.v4i1.11458.
B. Rizki, N. G. Ginasta, M. A. Tamrin, and A. Rahman, “Customer Loyality Segmentation on Point of Sale System Using Recency-Frequency-Monetary (RFM) and K-Means,” J. Online Inform., vol. 5, no. 2, p. 130, 2020, doi: 10.15575/join.v5i2.511.
A. Nowak-Brzezinska and C. Horyn, “ScienceDirect Outliers Outliers in in rules - the the comparision comparision of of LOF , LOF , COF COF and and K-MEANS K-MEANS,” vol. 00, 2020, doi: 10.1016/j.procs.2020.09.152.
Copyright (c) 2023 Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright in each article belongs to the author
- The author acknowledges that the RESTI Journal (System Engineering and Information Technology) is the first publisher to publish with a license Creative Commons Attribution 4.0 International License.
- Authors can enter writing separately, arrange the non-exclusive distribution of manuscripts that have been published in this journal into other versions (eg sent to the author's institutional repository, publication in a book, etc.), by acknowledging that the manuscript has been published for the first time in the RESTI (Rekayasa Sistem dan Teknologi Informasi) journal ;