Pemantauan Perhatian Publik terhadap Pandemi COVID-19 melalui Klasifikasi Teks dengan Deep Learning

Novrindah Alvi Hasanah; Nanik Suciati; Diana Purwitasari

doi:10.29207/resti.v5i1.2927

Novrindah Alvi Hasanah Department of Informatics, Institut Teknologi Sepuluh Nopember, Surabaya
Nanik Suciati ITS Surabaya
Diana Purwitasari ITS Surabaya

DOI: https://doi.org/10.29207/resti.v5i1.2927

Keywords: monitoring public concern, twitter, covid-19, word embedding, deep learning

Abstract

Monitoring public concern in the surrounding environment to certain events is done to address changes in public behavior individually and socially. The results of monitoring public attention can be used as a benchmark for related parties in making the right policies and strategies to deal with changes in public behavior as a result of the COVID-19 pandemic. Monitoring public attention can be done using Twitter social media data because the users of the media are quite high, so that they can represent the aspirations of the general public. However, Twitter data contains varied topics, so a classification process is required to obtain data related to COVID-19. Classification is done by using word embedding variations (Word2Vec and fastText) and deep learning variations (CNN, RNN, and LSTM) to get the classification results with the best accuracy. The percentage of COVID-19 data based on the best accuracy is calculated to determine how high the public's attention is to the COVID-19 pandemic. Experiments were carried out with three scenarios, which were differentiated by the number of data trains. The classification results with the best accuracy are obtained by the combination of fasText and LSTM which shows the highest accuracy of 97.86% and the lowest of 93.63%. The results of monitoring public attention to the time vulnerability between June and October show that the highest public attention to COVID-19 is in June.

Downloads

Download data is not yet available.

References

Yan, L., and Pedraza-Martinez, A.J., 2019. Social Media for Disaster Management: Operational Value of the Social Conversation. Production and Operations Management Society. 28 (10), pp. 2514–2532

Fahmi U., and Wibowo, C.P., 2017. Ruang Publik Online: Sebuah Dimensi Baru Dalam Proses Pembuatan Kebijakan? (Studi Kasus Penggunaan Twitter Di Indonesia). International Seminar: Reconstructing Public Administration Reform to Build World Class Governmentn. Jakarta, Indonesia, August 2017. Lembaga Administrasi Negara: Indonesia.

Duli, M.R., 2018. Analisis Konten Interaksi Pengguna Twitter pada Masa 100 Hari Pertama Pemerintahan Baru DKI Jakarta Menggunakan Text Mining Content Analysis on Twitter Users Interaction within First 100 Days of Jakarta’s New Government by Using Text Mining. Jurnal Pekommas, 3m (2) pp.137-142.

Abdurrahman, M.S., 2018. Twitter dan Ruang Publik Pemerintahan Lokal yang Partisipatif (Telaah atas Komunikasi Politik Ridwan Kamil Melalui Twitter). Jurnal Penelitian Komununikasi dan Pembangunan, 15 (2), pp. 152-168.

Rathod, T., and Barot, M., 2018. Trend Analysis on Twitter for Predicting Public Opinion on Ongoing Events. International Journal of Computer Applications, 180 (26), pp. 13–17.

Vieweg, S., Hughes, A.L., Starbird, K., and Palen, L., 2010. Microblogging During Two Natural Hazards Events: What Twitter May Contribute to Situational Awareness. In: Association for Computing Machinery-SIGCHI, Conference on Human Factors in Computing Systems - Proceedings. Atlanta, Georgia, USA 10-15 April 2010, Association for Computing Machinery: United States.

Boukil, S., Biniz, M., El-Adnani, F., Cherrat, L., and El Moutaouakkil, A.E., 2018. Arabic Text Classification using Deep Learning Technics. International Journal of Grid and Distributed Computing, 11 (9), pp. 103–114.

Calix, R.A., Gupta, R., Gupta, M., and Jiang, K., 2017. Deep Gramulator: Improving Precision in the Classification of Personal Health-Experience Tweets with Deep Learning. In: IEEE Computer Society, IEEE International Conference on Bioinformatics and Biomedicine (BIBM). Kansas City, MO, USA 13-17 November 2017. IEEE: USA.

Kim, Y., 2014. Convolutional Neural Networks for Sentence Classificatio. In: Association for Computational Linguistics, Proceedings ofthe 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Doha, Qatar 25-29 October 2014. Qatar: Doha.

Hughes, M., Li, I., Kotoulas, S., and Suzumura, T., 2017. Medical Text Classification using Convolutional Neural Networks. Studies in Health Technology and Informatics. April 2017.

Severyn, A., and Moschitti, A., 2015. UNITN: Training Deep Convolutional Neural Network for Twitter Sentiment Classification. In: Association for Computational Linguistics, Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015). Denver, Corolado, June 2015. Association for Computational Linguistics: Corolado.

Kim H., and Jeong, Y., 2019. Sentiment Classification Using Convolutional Neural Networks. Applied Sciences MDPI, 9 (11), pp. 1–14.

Cho K., et al, 2014. Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. In: Association for Computational Linguistics, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Doha, Qatar, October 2014. ACL: Qatar.

Du, C., and Huang, L., 2018. Text Classification Research with Attention-based Recurrent Neural Networks. International Journal of Computers Communications & Control, 13 (1), pp. 50–61.

Lai, S.,Xu, L., Liu, K., and Zhao, J., 2015. Recurrent Convolutional Neural Networks for Text Classification. In: Association for the Advancement of Artificial Intelligence, Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence. Austin, Texas USA, 25-30 January 2015. The AAAI Press: California.

Zhou, C., Sun, C., Liu, Z., and Lau, F.C.M., 2015. A C-LSTM Neural Network for Text Classification. Cornell University, arXiv: 1511.08630.

Tholusuri, A., Anumala, M., Malapolu, B., and Jaya Lakshmi, G., 2019. Sentiment Analysis using LSTM. International Journal of Engineering and Advanced Technology, 8 (6), pp. 1338–1340.

Rao, A., and Spasojevic, N., 2016. Actionable and Political Text Classification using Word Embeddings and LSTM. Cornell University, arXiv: 1607.02501.

Wang, B., Wang, A., Chen, F., Wang, Y., and Kuo, C.C.J., 2019. Evaluating Word Embedding Models: Methods and Experimental Results. APSIPA Transactions on Signal and Information Processing, 8, pp. 1–13.

Mandelbaum A., and Shalev, A., 2016. Word Embeddings and Their Use In Sentence Classification Tasks, arXiv:1610.08229, pp. 1–16.

Lilleberg, J., Zhu, Y., and Zhang, Y., 2015. Support Vector Machines and Word2vec for Text Classification with Semantic Features. Proceedings of 2015 IEEE 14th International Conference on Cognitive Informatics and Cognitive Computing, ICCI*CC 2015. Beijing, China, 6-8 July 2015. IEEE: United States.

Kuyumcu, B., Aksakalli, C., and Delil, S., 2019. An automated new approach in fast text classification (fastText): A case study for Turkish text classification without pre-processing. In: Association for Computing Machinery, ICSEB 2019: Proceedings of the 2019 3rd International Conference on Software and e-Business. Tokyo, Japan, December 2019. Association for Computing Machinery: United States.

Khattak, F.K., Jeblee, S., Pou-Prom, C., Abdalla, M., Meaney, C., and Rudzicz, F., 2019. A survey of word embeddings for clinical text. Journal of Biomedical Informatics X, 4, 100057.

Dabiri, S. and Heaslip, K., 2018. Developing a Twitter-Based Traffic Event Detection Model using Deep Learning Architectures. Expert Systems with Applocations, 118, pp. 425–439.

Zhang, Z., He, Q., Gao, J., and Ni, M., 2017. A Deep Learning Approach for Detecting Traffic Accidents from Social Media Data. Transportation Research Part C: Emerging Technologies, 86, pp. 580–596.

Pemantauan Perhatian Publik terhadap Pandemi COVID-19 melalui Klasifikasi Teks dengan Deep Learning

Monitoring of Public Attention to the COVID-19 Pandemic through Text Classification with Deep Learning

Abstract

Downloads

References