PENERAPAN TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY DAN WORD EMBEDDING UNTUK PENILAIAN ESAI OTOMATIS
DOI:
https://doi.org/10.33884/jif.v14i01.10829Keywords:
Penilaian Esai Otomatis, Text Mining, TF-IDF, Word Embedding, Cosine SimilarityAbstract
Manual essay grading at the vocational school level is a time-consuming and subjective process. This research implemented and evaluated an automatic essay scoring model using a combination of the Term Frequency-Inverse Document Frequency (TF-IDF) algorithm for word weighting and Word Embedding for semantic meaning analysis. The model was tested using a dataset of 360 essay answers from 36 students at SMK Budi Bakti Ciwidey with train-test split validation. The tuning process on the training data showed that a weighting that prioritized semantic analysis (90% Word Embedding) provided the best performance. In the final testing on 90 test data, the model achieved an excellent Mean Absolute Error (MAE) of 6.80, but with a weak Pearson correlation of 0.12 against the teacher's scores. This research concludes that the proposed model is successful in generating scores that are very close to the teacher's scores (low MAE), but still has limitations in terms of scoring consistency (weak correlation), which is influenced by the quality of the key answers and an imbalanced dataset.
References
A. Nasihi, T. Asihati Ratna Hapsari, and K. Kota Jakarta Selatan, “Indonesian Journal of Teaching and Learning,” vol. 1, no. 1, pp. 77–88, 2022, doi: 10.56855/intel.v1i1.112.
C. Hasanudin, “EVALUASI PERKULIAHAN DARING KETERAMPILAN MENULIS SELAMA MASA PANDEMI COVID-19 DENGAN MODEL EVALUASI CIPP,” JPE (Jurnal Pendidikan Edutama, vol. 8, no. 2, 2021, [Online]. Available: http://ejurnal.ikippgribojonegoro.ac.id/index.php/JPE
Miftha Huljannah, “Pentingnya Proses Evaluasi Dalam Pembelajaran Di Sekolah Dasar,” EDUCATOR (DIRECTORY OF ELEMENTARY EDUCATION JOURNAL), vol. 2, no. 2, pp. 164–180, Dec. 2021, doi: 10.58176/edu.v2i2.157.
Z. Wahiah, S. Marganingrum Prabowo, and H. A. Safitri, “Eksplorasi Efektivitas Tes Pilihan Ganda Berbasis Komputer Sebagai Evaluasi Pembelajaran,” EDUCATIVO: JURNAL PENDIDIKAN, vol. 2, no. 2, p. Page, 2023, doi: 10.56248/educativo.v2i2.
A. Ahadi, A. Singh, M. Bower, and M. Garrett, “Text Mining in Education—A Bibliometrics-Based Systematic Review,” Educ Sci (Basel), vol. 12, no. 3, 2022, doi: 10.3390/educsci12030210.
V. Prasetyo, M. Widiasri, and M. Angkiriwang, “Amalia Dkk.Pdf,” vol. Volume 11(1), Mar. 2022, doi: 10.34148/teknika.v11i1.449.
D. A. Suryaningrum, R. Syaifudin, and H. R. P. Putra, “INTEGRASI WORD EMBEDDINGS DAN INVERSE BOOK FREQUENCY DALAM PEMBOBOTAN TERM UNTUK PENINGKATAN PENCARIAN DOKUMEN,” JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika), vol. 9, no. 4, pp. 2529–2537, Dec. 2024, doi: 10.29100/jipi.v9i4.7557.
A. Saputra, “Strategi Evaluasi Pembelajaran Pendidikan Agama Islam Pada SMP.”
Y. B. Utomo, I. Kurniasari, and I. Yanuartanti, “PENERAPAN KNOWLEDGE DISCOVERY IN DATABASE UNTUK ANALISA TINGKAT KECELAKAAN LALU LINTAS,” Jurnal Teknik Informatika Kaputama (JTIK), vol. 7, no. 1, 2023.
S. Zanki, N. Pusparini, N. Kharisma, and Samuel, “Journal tujuan 1,” Sep. 2025.
H. D. Abubakar and M. Umar, “Sentiment Classification: Review of Text Vectorization Methods: Bag of Words, Tf-Idf, Word2vec and Doc2vec,” SLU Journal of Science and Technology, vol. 4, no. 1 & 2, pp. 27–33, Aug. 2022, doi: 10.56471/slujst.v4i.266.
A. T. Laksana, S. Sylviani, and A. Triska, “STUDI PENERAPAN KONSEP VEKTOR DALAM PERMASALAHAN PENYISIPAN KATA-KATA MELALUI PROSES NORMALISASI VECTOR DAN TRANSFORMASI ORTHOGONAL,” vol. 5, no. 2, 2024, doi: 10.46306/lb.v5i2.
F. Teknik, “PENERAPAN TEKS MINING DAN COSINE SIMILARITY UNTUK MENENTUKAN KESAMAAN DOKUMEN SKRIPSI APPLICATION OF TEXT MINING AND COSINE SIMILARITY TO DETERMINE THE SIMILARITY OF THESIS DOCUMENTS,” 2024.
A. Pakpahan, F. Ferdiansyah, R. Gustian, M. Faiz, and A. Sukma, “Andy Victor,” vol. 7 No. 1 June 2025, Jun. 2025, doi: doi.org/10.35970/jinita.v7i1.2724.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 JURNAL ILMIAH INFORMATIKA

This work is licensed under a Creative Commons Attribution 4.0 International License.


















