Seleksi Fitur Berbasis Pearson Correlation Untuk Optimasi Opinion Mining Review Pelanggan
Abstract
The comments contained on e-commerce users generally contain opinions about positive or negative experiences at several online shops. Sentences that can be written indirectly both a little or a lot, will affect other potential customers. So as a result of these comments cause a product sold at an online store has a rating of two things namely "recommended" or "non-recommended". However, detection of positive and negative opinions manually will require more time because of the large amount of data. For this reason opinion mining using technology in data mining can be used to automate positive and negative detection of comments. However, one of the main problems in opinion mining is limited data but has a large number of attributes. In this study, we propose the application of Pearson correlation (PC) based feature selection for opinion mining optimization. The results of the experiment show that the application of PC increases the performance of opinion mining systems in 3 types of classification, namely Logistic Regression, Naïve Bayes and Support Vector Machine, resulting in more optimal accuracy, namely 98.80%, 87.87% and 98.12%.
Downloads
References
Widiyanto, I., & Prasilowati, S. L. (2015). Perilaku Pembelian Melalui Internet. Jurnal Manajemen Dan Kewirausahaan (Journal of Management and Entrepreneurship), 17(2), 109–112. https://doi.org/10.9744/jmk.17.2.109-122.
Agustina, L., & Fayardi, A. O. (2019). Online Review : Indikator Penilaian Kredibilitas Online dalam Platform E-commerce. (4), 141–154.
Kusumasondjaja, S., Shanka, T., & Marchegiani, C. (2012). Journal of Vacation Marketing. https://doi.org/10.1177/1356766712449365
C, A. R., Lukito, Y., Informatika, P. T., Informasi, F. T., Kristen, U., & Wacana, D. (2017). Deteksi Komentar Spam Bahasa Indonesia Pada Instagram Menggunakan Naive Bayes. IX(1).
Asghar, M. Z., Kundi, F. M., Khan, A., & Ahmad, S. (2014). Lexicon-Based Sentiment Analysis in the Social Web. J. Basic. Appl. Sci. Res.
Zubrinic, K., SJEKAVICA, T., MILICEVIC, M., & OBRADOVIC, I. (2018). A Comparison of Machine Learning Algorithms in Opinion Polarity Classification of Customer Reviews. International Journal of Computers, 3, 159–163.
Wen, H., & Zhao, J. (2017). Aspect term extraction of E-commerce comments based on model ensemble. 2016 13th International Computer Conference on Wavelet Active Media Technology and Information Processing, ICCWAMTIP 2017, 2018-February,24–27. https://doi.org/10.1109/ICCWAMTIP.2017.8301421
Purwanto, D. D., & Santoso, J. (2015). Multinomial Naïve Bayes Classifier Untuk Menentukan Review. (March), 117–122. Retrieved from https://www.researchgate.net/publication/319256329%0AMULTINOMIAL
Rozy, F., Rangkuti, S., Fauzi, M. A., Sari, Y. A., Dewi, E., & Sari, L. (2018). Analisis Sentimen Opini Film Menggunakan Metode Naïve Bayes dengan Ensemble Feature dan Seleksi Fitur Pearson Correlation Coefficient. Jurnal Pengembangan Teknologi Informasi Dan Ilmu Komputer (J-PTIIK) Universitas Brawijaya, 2(12), 6354–6361.
Sharma, A., & Dey, S. (2012). Performance Investigation of Feature Selection Methods and Sentiment Lexicons for Sentiment Analysis. International Journal of Computer Applications, (June), 15–20. Retrieved from http://scholar.google.com/scholar?hl=en&btnG=Search&q=intitle:Performance+Investigation+of+Feature+Selection+Methods+and+Sentiment+Lexicons+for+Sentiment+Analysis#0
Sugiyono. (2013). Metode Penelitian Pendidikan Pendekatan Kuantitatif, Kualitatif, dan R&D. Bandung: Alfabeta.
Shardlow, M. (2016). An Analysis of Feature Selection Techniques. The University of Manchester, (1), 1–7. Retrieved from http://syllabus.cs.manchester.ac.uk/pgt/2018/COMP61011/goodProjects/Shardlow.pdf%0Ahttps://studentnet.cs.manchester.ac.uk/pgt/COMP61011/goodProjects/Shardlow.pdf%0Ahttp://ro.utia.czhttp//poseidon.csd.auth.gr%0Ahttp://clopinet.com/isabelle/Projects/NIPS200
Copyright (c) 2019 Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright in each article belongs to the author
- The author acknowledges that the RESTI Journal (System Engineering and Information Technology) is the first publisher to publish with a license Creative Commons Attribution 4.0 International License.
- Authors can enter writing separately, arrange the non-exclusive distribution of manuscripts that have been published in this journal into other versions (eg sent to the author's institutional repository, publication in a book, etc.), by acknowledging that the manuscript has been published for the first time in the RESTI (Rekayasa Sistem dan Teknologi Informasi) journal ;