IMPLEMENTASI DATA MINING UNTUK PREDIKSI DROP-OUT DENGAN MENGGUNAKAN RANDOM FOREST METHOD

UTARI, Meylani and Warsito, Budi and Kusumaningrum, Retno (2020) IMPLEMENTASI DATA MINING UNTUK PREDIKSI DROP-OUT DENGAN MENGGUNAKAN RANDOM FOREST METHOD. Masters thesis, School of Postgraduate.

Preview

PDF
1540Kb

Preview

PDF
181Kb

Preview

PDF
352Kb

PDF
Restricted to Repository staff only
1213Kb

PDF
Restricted to Repository staff only
2164Kb

Preview

PDF
176Kb

Preview

PDF
298Kb

Abstract

Akreditasi adalah salah satu tolak ukur kualitas pada sebuah Perguruan Tinggi. Beberapa unsur pengukuran tersebut adalah mahasiswa dan lulusan. Pencegahan mahasiswa yang drop-out menjadi permasalahan yang dianggap sangat penting bagi Perguruan Tinggi itu sendiri. Tingginya tingkat putus sekolah akan memberikan dampak yang buruk pada Perguruan Tinggi, seperti reputasi dan akreditasi yang kurang baik. Penelitian ini menyajikan hasil analisis studi kasus data pendidikan dengan menggunakan teknik klasifikasi pada data mining yang berfokus pada deteksi drop-out mahasiswa sarjana dan diploma pada Fakultas ABC di Perguruan Tinggi XYZ. Data mentah berasal dari data akademik siswa yang mendaftar di universitas dari tahun 2008 hingga tahun 2012. Data mentah lalu dilakukan proses preprocessing untuk mengatasi permasalahan ketidakseimbangan data. Teknik synthetic minority oversampling (SMOTE) untuk menangani ketidakseimbangan dataset dan algoritma random forest untuk memprediksi drop-out dengan data latih awal sebanyak 2492 dataset. Sebagai hasil penelitian, algoritma random forest disertai dengan SMOTE dapat memberikan hasil akurasi terbaik sebesar 93,43%. Sedangkan hasil utama dari penelitian ini dapat digunakan untuk mengurangi tingkat drop-out dengan cara melakukan prediksi dini terhadap mahasiswa berpotensi tinggi untuk drop-out dan mengidentifikasi faktor-faktor potensial yang terkait dengan penyebab mahasiswa drop-out. Kata Kunci : drop out; random forest; synthetic minority over sampling; SMOTE; data pendidikan; data mining; classification; prediction; imbalance dataset Accreditation is one of the quality measurements for a University. Some elements of these measurements are students and graduate students. Prevention of students to drop out is a problem that is considered very important for the university itself. High levels of drop out students will have a bad impact on the university, such as bad reputation or low-grade accreditation. This research presenting the results of a case study analysis in educational data, by analyzing the data using the data mining technique. The author using the classification method, that focuses on drop-out prediction of undergraduate and diploma students at the ABC Faculty at XYZ University. To predict drop-out classification, academic data are needed. The raw data are student's academic data that enroll in university from 2008 to 2012. The raw data preprocessing then carried out to handle imbalanced data. This research uses synthetic minority oversampling technique (SMOTE) to handle imbalance dataset and random forest algorithm to predict drop-out within 2492 data. As a research result, the random forest algorithm accompanied by SMOTE can provide the best accuracy results by 93.43%. The main results of this research can be used to reduce drop-out levels by predicting potential drop out students and identifying potential factors related to drop out students. Keywords: drop out; random forest; synthetic minority over-sampling; SMOTE; educational data; data mining; classification; prediction; imbalance dataset

Item Type:	Thesis (Masters)
Uncontrolled Keywords:	drop out; random forest; synthetic minority over sampling; SMOTE; data pendidikan; data mining; classification; prediction; imbalance dataset
Subjects:	Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:	School of Postgraduate > Master Program in Information System
ID Code:	82271
Deposited By:	INVALID USER
Deposited On:	14 Jan 2021 16:09
Last Modified:	14 Jan 2021 16:09

Repository Staff Only: item control page