Penggunaan Feature Selection di Algoritma Support Vector Machine untuk Sentimen Analisis Komisi Pemilihan Umum

  • Imam Santoso STMIK NUSA MANDIRI
  • Windu Gata STMIK Nusa Mandiri
  • Atik Budi Paryanti STIKOM Cipta Karya Informatika
Keywords: KPU, Support Vector Machine, Feature Selection, Weight by Correlation

Abstract

At this time sentiment analysis is very widely used by people to see the extent of people's sentiments towards an object.  Objects that can be used in sentiment analysis can be various kinds, for example about the product regarding receipt by consumers, agencies or institutions regarding the performance of the agency. Whereas for this study taking sentiment analysis of the State Institution namely the General Election Commission (KPU) about the sentiments of the implementation of the ELECTION simultaneously and also the results of the implementation of the ELECTION which have become the subject of discussion by netizens on social media. So this research takes retweet data and retention comments from Twitter social media users. The algorithm used in this study is Support Vector Machine (SVM), with optimization of the use of Weight by Correlation Feature Selection (FS). The results of cross validation SVM without FS are 66.49% for accuracy and 0.716 for AUC. Whereas SVM with FS is 81.18% for accuracy and 0.943 for AUC. Very significant improvement with the use of Weight by Correlation Feature Selection (FS).

Downloads

Download data is not yet available.

References

Chaovalit, Pimwadee and Lina Zhou, 2005. Movie Review Mining: a Comparison between Supervised and Unsupervised Classification Approaches, IEEE, pp. 1-9, .

Haddi, E., Liu, X., & Shi, Y., 2013. The Role of Text Pre-processing in Sentiment Analysis. First International Conference on Information Technology and Quantitative Management, 17, 26–32. https://doi.org/10.1016/j.procs.2013.05.05

Moraes, R., Valiati, J. F., & Gavião Neto, W. P., 2013. Document-level sentiment classification: An empirical comparison between SVM and ANN. Expert Systems with Applications, 40(2), 621–633. https://doi.org/10.1016/j.eswa.2012.07.05.

Andilala, 2016, Movie Review Sentimen Analis dengan Metode Naïve Bayes Base On Feature Selection, Jurnal Pseudocode, Volume III Nomor 1, Februari 2016, ISSN 2355 – 5920.

Fikriya, Zulfa Afiq; Irawan, Mohammad Isa; Soetrisno, 2017. “Implementasi Extreme Learning Machine untuk Pengenalan Objek Citra Digital”, Jurnal Sains dan Seni ITS, Vol. 6, No. 1. 2337-3520,

Rahmansyah A., Dewi O., Andini P., Hastuti PN, Triana and Eka Suryana, Muhammad. 2016, Membandingkan Pengaruh Feature Selection Terhadap Algoritma Naïve Bayes dan Support Vector Machine. Seminar Nasional Aplikasi Teknologi Informasi (SNATi) , 2018 p. A1 - A7.

Ana Azevedo, Manuel Filipe Santos, 2008., KDD, SEMMA and CRISP-DM: A Parallel Overview, IADIS European Conference Data Mining, 2008, pp 182 - 185

Guyon, I., Weston, J., and Barnhill, S. (2002), Machine Learning, Gene Selection for Cancer Classification using Support Vector Machines, Netherland , Kluwer Academic Publishers.

Rapidminer Documentation, 2019, Weight by Correllation, [ Online ] tersedia di : https://docs.rapidminer.com/latest/studio/operators/modeling/feature_weights/weight_by_correlation.html [ Accessed : 24 Juli 2019 ]

Achyani, Yuni Eka. 2018, Penerapan Metode Particle Swarm Optimization Pada Optimasi Prediksi Pemasaran Langsung, Jurnal Informatika, Vol.5 No.1 April 2018, pp. 1~11

Published
2019-12-02
Section
Artikel Teknologi Informasi