ANoM STEMMER: Nazief & Andriani Modification for Madurese Stemming
Abstract
Madurese is one of the regional languages in Indonesia. This is a cultural property that needs to be preserved. With various uniqueness and word formation rules, the Madurese language can be used in information retrieval, namely stemming. The Madurese language has a close relationship with the Javanese language; in several studies, the stemming method is often used, such as the modification of the Nazief and Adriani method, which has good performance for the Javanese language, but there has never been any research on the Madurese language and it has not been proven successful. Previous studies also have not used morphophonemic rules that influence word formation in Madurese. Therefore, this research was developed by modifying Nazief and Adriani's algorithm for Madurese based on Madurese language morphology by removing affixes, namely ter-ater (prefix), panoteng (suffix), and morphophonemic rules. Corpus uses 1000 words from the Madurese language dictionary that have received affixes. The accuracy of the algorithm is 89% with 890 words that match; the prefix has an accuracy of 93.81%; the suffix has an accuracy of 83.78%; and the confix has an accuracy of 80.07%. As for the overall performance, it produces an accuracy of 89.0% with an error rate of 11%. Understemming is found in 104 words, and overstemming in 6 words. The time it takes to compile is 31.31 seconds.
Downloads
References
I. Irwiadi and M. Norman Antono, “Proses Morfologis pada Bahasa Madura: Studi pada Mahasiswa Madura di Universitas Trunojoyo,” 2019.
F. H. Rachman, N. Ifada, S. Wahyuni, G. D. Ramadani, and A. Pawitra, “ModifiedECS (mECS) Algorithm for Madurese-Indonesian Rule-Based Machine Translation,” in 2022 International Conference of Science and Information Technology in Smart Administration (ICSINTESA), IEEE, Nov. 2022, pp. 51–56. doi: 10.1109/ICSINTESA56431.2022.10041470.
A. Andriani, “MORFOFONEMIK BAHASA INDONESIA PADA MASYARAKAT TUTUR BUGIS DIALEK SIDENRENG RAPPANG,” 2021. Accessed: Nov. 12, 2023. [Online]. Available: http://eprints.unm.ac.id/20489/1/ARTIKEL.pdf
S. Ibrihich, A. Oussous, O. Ibrihich, and M. Esghir, “A Review on recent research in information retrieval,” in Procedia Computer Science, Elsevier B.V., 2022, pp. 777–782. doi: 10.1016/j.procs.2022.03.106.
Y. K. Paskahningrum, E. Utami, and A. Yaqin, “A Systematic Literature Review of Stemming in Non-Formal Indonesian Language,” Int J Innov Sci Res Technol, vol. 8, no. 1, 2023, doi: 10.5281/zenodo.7547482.
M. A. Nq, L. P. Manik, and D. Widiyatmoko, “Stemming Javanese: Another Adaptation of the Nazief-Adriani Algorithm,” in 2020 3rd International Seminar on Research of Information Technology and Intelligent Systems (ISRITI), IEEE, Dec. 2020, pp. 627–631. doi: 10.1109/ISRITI51436.2020.9315420.
P. Ruriana, “HUBUNGAN KEKERABATAN BAHASA JAWA DAN MADURA,” Kandai, vol. 14, no. 1, p. 15, Jul. 2018, doi: 10.26499/jk.v14i1.512.
W. Hidayat, E. Utami, and A. D. Hartanto, “Effect of Stemming Nazief Adriani on the Ratcliff/Obershelp algorithm in identifying level of similarity between slang and formal words,” in 2020 3rd International Conference on Information and Communications Technology, ICOIACT 2020, Institute of Electrical and Electronics Engineers Inc., Nov. 2020, pp. 22–27. doi: 10.1109/ICOIACT50329.2020.9331973.
A. P. Wibawa, F. A. Dwiyanto, I. A. E. Zaeni, R. K. Nurrohman, and A. Afandi, “Stemming javanese affix words using nazief and adriani modifications,” Jurnal Informatika, vol. 14, no. 1, p. 36, Jan. 2020, doi: 10.26555/jifo.v14i1.a17106.
N. Hidayatullah, A. Prasetya Wibawa, and H. A. Rosyid, “Terakreditasi SINTA Peringkat 2 Penerapan ECS Stemmer untuk Modifikasi Nazief & Adriani Berbahasa Jawa,” masa berlaku mulai, vol. 1, no. 3, pp. 343–348, 2019.
N. W. Wardani and P. G. S. C. Nugraha, “Stemming Teks Bahasa Bali dengan Algoritma Enhanced Confix Stripping,” International Journal of Natural Science and Engineering, vol. 4, no. 3, pp. 103–113, Dec. 2020, doi: 10.23887/ijnse.v4i3.30309.
J. Jumadi, D. S. Maylawati, L. D. Pratiwi, and M. A. Ramdhani, “Comparison of Nazief-Adriani and Paice-Husk algorithm for Indonesian text stemming process,” IOP Conf Ser Mater Sci Eng, vol. 1098, no. 3, p. 032044, Mar. 2021, doi: 10.1088/1757-899x/1098/3/032044.
I. Putu et al., “ALGORITMA BASTAL: ADAPTASI ALGORITMA NAZIEF & ADRIANI UNTUK STEMMING TEKS BAHASA BALI,” 2019.
D. Wahyudi, T. Susyanto, D. Nugroho, P. Studi Teknik Informatika, S. Sinar Nusantara Surakarta, and P. Studi Sistem Informasi, “IMPLEMENTASI DAN ANALISIS ALGORITMA STEMMING NAZIEF & ADRIANI DAN PORTER PADA DOKUMEN BERBAHASA INDONESIA,” jurnal ilmiah SINUS, 2018.
K. N. Lakonawa, S. A. S. Mola, and A. Fanggidae, “NAZIEF-ADRIANI STEMMER DENGAN IMBUHAN TAK BAKU PADA NORMALISASI BAHASA PERCAKAPAN DI MEDIA SOSIAL,” Jurnal Komputer dan Informatika, vol. 9, no. 1, pp. 65–73, Mar. 2021, doi: 10.35508/jicon.v9i1.3749.
S. Firman, W. Desena, A. Wibowo, M. I. Komputer, and U. B. Luhur, “Penerapan Algoritma Stemming Nazief & Adriani Pada Proses Klasterisasi Berita Berdasarkan Tematik Pada Laman (Web) Direktorat Jenderal HAM Menggunakan Rapidminer,” 2022.
N. Justina, M. Verdaningroem, and A. Saifudin, “PENERAPAN KAMUS DASAR PADA ALGORITMA PORTER UNTUK MENGURANGI KESALAHAN STEMMING BAHASA INDONESIA,” 2018, doi: 10.24853/jurtek.10.2.103-112.
S. Betha and R. Hersianie, “ANALISA MODIFIKASI ALGORITMA STEMMING UNTUK KASUS OVERSTEMMING,” vol. 3, no. 2, 2020.
M. Hafid, E. Dosen, J. Tarbiyah, and S. Pamekasan, “PROBLEMATIKA PERIODISASI EJAAN BAHASA MADURA DALAM PERSPEKTIF PRAKTISI MADURA.”
M. A. Nq, L. P. Manik, and D. Widiyatmoko, “Stemming Javanese: Another Adaptation of the Nazief-Adriani Algorithm,” in 2020 3rd International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2020, Institute of Electrical and Electronics Engineers Inc., Dec. 2020, pp. 627–631. doi: 10.1109/ISRITI51436.2020.9315420.
A. Yaqin, M. Rahardi, and F. F. Abdulloh, “Accuracy Enhancement of Prediction Method using SMOTE for Early Prediction Student’s Graduation in XYZ University,” International Journal of Advanced Computer Science and Applications, vol. 13, no. 6, pp. 418–424, 2022, doi: 10.14569/IJACSA.2022.0130652.
D. Soyusiawaty, A. H. S. Jones, and N. L. Lestariw, “The Stemming Application on Affixed Javanese Words by using Nazief and Adriani Algorithm,” in IOP Conference Series: Materials Science and Engineering, Institute of Physics Publishing, Mar. 2020. doi: 10.1088/1757-899X/771/1/012026.
Copyright (c) 2023 Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright in each article belongs to the author
- The author acknowledges that the RESTI Journal (System Engineering and Information Technology) is the first publisher to publish with a license Creative Commons Attribution 4.0 International License.
- Authors can enter writing separately, arrange the non-exclusive distribution of manuscripts that have been published in this journal into other versions (eg sent to the author's institutional repository, publication in a book, etc.), by acknowledging that the manuscript has been published for the first time in the RESTI (Rekayasa Sistem dan Teknologi Informasi) journal ;