Comparative Evaluation of IndoBERT, IndoBERTweet, and mBERT for Multilabel Student Feedback Classification

  • Fatma Indriani Universitas Lambung Mangkurat
  • Radityo Adi Nugroho Universitas Lambung Mangkurat
  • Mohammad Reza Faisal Universitas Lambung Mangkurat
  • Dwi Kartini Universitas Lambung Mangkurat
Keywords: BERT models, education data, finetuning, multilabel classification, sequence length, student feedback

Abstract

Student feedback plays a crucial role in enhancing the quality of educational programs, yet analyzing this feedback, especially in informal contexts, remains challenging. In Indonesia, where student comments often include colloquial language and vary widely in content, effective multilabel classification is essential to accurately identify the aspects of courses being critiqued. Despite the development of several BERT-based models, the effectiveness of these models for classifying informal Indonesian text remains underexplored. Here we evaluate the performance of three BERT variants—IndoBERT, IndoBERTweet, and mBERT—on the task of multilabel classification of student feedback. Our experiments investigate the impact of different sequence lengths and truncation strategies on model performance. We find that IndoBERTweet, with a macro F1-score of 0.8462, outperforms IndoBERT (0.8243) and mBERT (0.8230) when using a sequence length of 64 tokens and truncation at the end. These findings suggest that IndoBERTweet is well-suited for handling the informal, abbreviated text common in Indonesian student feedback, providing a robust tool for educational institutions aiming for actionable insights from student comments.

Downloads

Download data is not yet available.

References

[1] A. S. Sunar and M. S. Khalid, “Natural Language Processing of Student’s Feedback to Instructors: A Systematic Review,” IEEE Trans. Learning Technol., vol. 17, pp. 741–753, 2024, doi: 10.1109/TLT.2023.3330531.
Published
2024-12-27
How to Cite
Indriani, F., Nugroho, R. A., Faisal, M. R., & Kartini, D. (2024). Comparative Evaluation of IndoBERT, IndoBERTweet, and mBERT for Multilabel Student Feedback Classification. Jurnal RESTI (Rekayasa Sistem Dan Teknologi Informasi), 8(6), 748 - 757. https://doi.org/10.29207/resti.v8i6.6100
Section
Information Technology Articles