Penggunaan Feature Selection di Algoritma Support Vector Machine untuk Sentimen Analisis Komisi Pemilihan Umum
Abstract
At this time sentiment analysis is very widely used by people to see the extent of people's sentiments towards an object. Objects that can be used in sentiment analysis can be various kinds, for example about the product regarding receipt by consumers, agencies or institutions regarding the performance of the agency. Whereas for this study taking sentiment analysis of the State Institution namely the General Election Commission (KPU) about the sentiments of the implementation of the ELECTION simultaneously and also the results of the implementation of the ELECTION which have become the subject of discussion by netizens on social media. So this research takes retweet data and retention comments from Twitter social media users. The algorithm used in this study is Support Vector Machine (SVM), with optimization of the use of Weight by Correlation Feature Selection (FS). The results of cross validation SVM without FS are 66.49% for accuracy and 0.716 for AUC. Whereas SVM with FS is 81.18% for accuracy and 0.943 for AUC. Very significant improvement with the use of Weight by Correlation Feature Selection (FS).
Downloads
References
Chaovalit, Pimwadee and Lina Zhou, 2005. Movie Review Mining: a Comparison between Supervised and Unsupervised Classification Approaches, IEEE, pp. 1-9, .
Haddi, E., Liu, X., & Shi, Y., 2013. The Role of Text Pre-processing in Sentiment Analysis. First International Conference on Information Technology and Quantitative Management, 17, 26–32. https://doi.org/10.1016/j.procs.2013.05.05
Moraes, R., Valiati, J. F., & Gavião Neto, W. P., 2013. Document-level sentiment classification: An empirical comparison between SVM and ANN. Expert Systems with Applications, 40(2), 621–633. https://doi.org/10.1016/j.eswa.2012.07.05.
Andilala, 2016, Movie Review Sentimen Analis dengan Metode Naïve Bayes Base On Feature Selection, Jurnal Pseudocode, Volume III Nomor 1, Februari 2016, ISSN 2355 – 5920.
Fikriya, Zulfa Afiq; Irawan, Mohammad Isa; Soetrisno, 2017. “Implementasi Extreme Learning Machine untuk Pengenalan Objek Citra Digital”, Jurnal Sains dan Seni ITS, Vol. 6, No. 1. 2337-3520,
Rahmansyah A., Dewi O., Andini P., Hastuti PN, Triana and Eka Suryana, Muhammad. 2016, Membandingkan Pengaruh Feature Selection Terhadap Algoritma Naïve Bayes dan Support Vector Machine. Seminar Nasional Aplikasi Teknologi Informasi (SNATi) , 2018 p. A1 - A7.
Ana Azevedo, Manuel Filipe Santos, 2008., KDD, SEMMA and CRISP-DM: A Parallel Overview, IADIS European Conference Data Mining, 2008, pp 182 - 185
Guyon, I., Weston, J., and Barnhill, S. (2002), Machine Learning, Gene Selection for Cancer Classification using Support Vector Machines, Netherland , Kluwer Academic Publishers.
Rapidminer Documentation, 2019, Weight by Correllation, [ Online ] tersedia di : https://docs.rapidminer.com/latest/studio/operators/modeling/feature_weights/weight_by_correlation.html [ Accessed : 24 Juli 2019 ]
Achyani, Yuni Eka. 2018, Penerapan Metode Particle Swarm Optimization Pada Optimasi Prediksi Pemasaran Langsung, Jurnal Informatika, Vol.5 No.1 April 2018, pp. 1~11
Copyright (c) 2019 Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright in each article belongs to the author
- The author acknowledges that the RESTI Journal (System Engineering and Information Technology) is the first publisher to publish with a license Creative Commons Attribution 4.0 International License.
- Authors can enter writing separately, arrange the non-exclusive distribution of manuscripts that have been published in this journal into other versions (eg sent to the author's institutional repository, publication in a book, etc.), by acknowledging that the manuscript has been published for the first time in the RESTI (Rekayasa Sistem dan Teknologi Informasi) journal ;