Penerapan ECS Stemmer untuk Modifikasi Nazief & Adriani Berbahasa Jawa
Abstract
Stemming Javanese affix words using Nazief & Adriani modifications still has problems that cannot be solved such as overstemming, understemming, and unchange. Then there needs to be improvements to improve the performance of Nazief & Adriani modifications. This study aims to improve the performance of Nazief & Adriani modifications using the Enahnced Confix Stripping (ECS) modification method. The results of this study indicate that Enhanced Confix Stripping can improve performance that previously had an accuracy of only 78.2% to 97.9% with an error rate of 2.1%. And fixing errors that originally numbered 98 to 9 errors. But Enhanced Confix Stripping still has problems with the words "ngetan, kumanggah, kumarut, kumasis, kumareg, kumadul, kumaras, katawakake, and pangenan". The next research is expected to be able to solve this problem.
Downloads
References
I. B. Arif Antono, Ida Zulaeha, “Pemertahanan Fonologis dan Leksikal Bahasa Jawa di Kabupaten Wonogiri,” J. Sastra Indones., vol. 8, no. 1, pp. 47–56, 2019.
P. F. Ariyani, A. Rahmala, and N. Juliasari, “Implementasi Metode Stemming Tala dan Fungsi Jaccard pada Aplikasi Katalog Perpustakaan,” Semin. Nas. Inov. dan Apl. Teknol. di Ind. 2019, vol. 5, pp. 128–133, 2019.
M. N. Kassim, S. Hisham, M. Jali, and M. A. Maarof, “Towards Stemming Error Reduction for Malay Texts,” pp. 13–23, 2019.
D. Sa, W. B. Zulfikar, C. Slamet, M. A. Ramdhani, and Y. A. Gerhana, “An Improved of Stemming Algorithm for Mining Indonesian Text with Slang on Social Media,” no. Citsm, 2018.
D. F. H. P. Amalia Sahira Rahma, Vit Zuraida, “Penggunaan Dictionary-Based dan Corpus-Based Thesaurus untuk Pembobotan Term pada Pengelompokan Dokumen Berita Berbahasa Indonesia,” vol. 2, no. 1, 2017.
D. Farrar and J. H. Hayes, “A Comparison of Stemming Techniques in Tracing,” pp. 1–8, 2019.
A. S. Rizki, A. Tjahyanto, and R. Trialih, “Comparison of Stemming Algorithms on Indonesian Text Processing,” vol. 17, no. 1, pp. 95–103, 2019.
N. Husni Mubarok, “Analisis Morfologi pada Bahasa Mandar dalam Ruang Lingkup Keluarga di Desa Tanjung Lalak Kecamatan Pulau Laut Kepulauan Kabupaten Kotabaru,” vol. 6, no. 2, pp. 63–79, 2018.
T. Winarti and S. Arief, “Determining Term on Text Document Clustering using Algorithm of Enhanced Confix Stripping Stemming,” vol. 157, no. 9, pp. 8–13, 2017.
Y. D. Pramudita, S. S. Putro, N. Makhmud, B. Olahraga, E. Confix, and S. Stemmer, “Klasifikasi Berita Olahraga Menggunakan Metode Naive Bayes dengan Enhanced Confix Stripping Stemmer,” vol. 5, no. 3, 2018.
M. N. Khidfi and J. Y. Sari, “Rancang Bangun Aplikasi Pendeteksian Kesamaan pada Dokumen Teks Menggunakan Algoritma Enhaced Confix Stripping dan Algoritma Winnowing,” no. September, 2018.
K. N. Sistem, A. Ridok, R. Latifah, F. Unibraw, and N. Neighbor, “Klasifikasi Teks Bahasa Indonesia pada Corpus Tak Seimbang Menggunakan NWKNN,” pp. 9–10, 2015.
T. Yusnitasari et al., “Uji Coba Stemming ECS ( Enhance Confix Stripping ) Ayat- Ayat Al Qur ’ an Dan Hadist Terjemahan Bahasa Indonesia,” pp. 24–26, 2018.
T. Yusnitasari, L. Wulandari, D. Ikasari, I. Humaini, K. K. Informasi, and I. Retrieval, “Perancangan Smart Digital Al Quran dan Hadist Bukhori Muslim untuk Platform Mobile Application,” no. September, 2018.
Y. N. Fadziah and E. F. R, “Penerapan Algoritma Enchanced Confix Stripping dalam Pengukuran Keterbacaan Teks Menggunakan Gunning Fog Index,” vol. 1, no. 1, pp. 15–24, 2018.
P. D. & T. D. Andita, “Implementasi Modifikasi Enhanced Confix Stripping Stemmer untuk Bahasa Indonesia dengan Metode Corpus Based Stemming,” pp. 1–15, 2012.
Copyright (c) 2019 Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright in each article belongs to the author
- The author acknowledges that the RESTI Journal (System Engineering and Information Technology) is the first publisher to publish with a license Creative Commons Attribution 4.0 International License.
- Authors can enter writing separately, arrange the non-exclusive distribution of manuscripts that have been published in this journal into other versions (eg sent to the author's institutional repository, publication in a book, etc.), by acknowledging that the manuscript has been published for the first time in the RESTI (Rekayasa Sistem dan Teknologi Informasi) journal ;