Comparative Analysis of Machine Learning Models on Student Performance Data: Insights from Test Scores and Survey Data

Journal Title: European Journal of Teaching and Education - Year 2025, Vol 7, Issue 1

Abstract

With the increasing use of digital learning platforms, large volumes of student data have become available for analysis. This paper investigates how machine learning, learning analytics, and educational data mining can be utilized to gain insights into student performance. Various predictive modeling techniques, including Random Forest (RF), K-Nearest Neighbor (KNN), and Decision Trees (DT), are evaluated for their ability to forecast student test scores. Clustering algorithms like K-means are employed to identify patterns within the data. The study integrates these predictive models with survey data collected from undergraduate students at Heriot-Watt University Dubai, aiming to identify factors that influence academic outcomes. The research uses comparative analysis across different machine learning models which is applied to both the survey data and Kaggle test score data. The analysis reveals that linear regression is the most effective model for the Kaggle test score dataset, while K-means clustering provides the best insights from the survey data. The survey model is determined to be more comprehensive due to its inclusion of more predictors. Key metrics, such as accuracy scores, precision, recall, F1 score, and mean squared error, were calculated for both datasets to provide a quantitative overview, enabling a comparative evaluation of model performance and predictor effectiveness for both the datasets. The findings contribute to understanding how data-driven approaches can support educational decisions and interventions while addressing ethical considerations and inclusivity in educational settings.

Authors and Affiliations

Sanjana Sundararaman,Maheen Hasib,

Keywords

Related Articles

The Setting of School-Enterprise Major Curriculum Based on Students’ Satisfaction

The current curriculum cannot meet the needs of students for learning and employment. In this study, the researchers shall improve students’ satisfaction with school-enterprise cooperation specialty by optimizing the cur...

A Comparative Study of Learning Related Emotions among Male and Female Health Field University Students in Saudi Arabia

The present study investigates the level of learning-related emotions as well as to looks at the differences between female and male undergraduate health tack students for learning-related emotions levels. A self-reporte...

Motivation Factors for Elementary School Students

It is not hard to recognise motivated student. He is interested, curious, active, enthusiastic and does not give up when encounters difficulties, but thinks for further education. Motivation during schooling is highly re...

A South African Perspective on Inclusive Education Conceptualisation and Impact on Practices

This study explores the impact of primary school principals and Foundation Phase teachers' conceptualisations of inclusive education on their teaching practices in South Africa. The importance of these conceptualisations...

Post-Covid-19 Shifts: Analysing Changes in Bangladeshi Undergraduate Students' Attitudes Toward Face-to-Face Learning

This study explores shifts in undergraduate students' attitudes toward face-to-face classes after returning to in-person education following the COVID-19 pandemic. The pandemic necessitated a shift to online learning, re...

Download PDF file
  • EP ID EP760042
  • DOI https://doi.org/10.33422/ejte.v7i1.1459
  • Views 30
  • Downloads 0

How To Cite

Sanjana Sundararaman, Maheen Hasib, (2025). Comparative Analysis of Machine Learning Models on Student Performance Data: Insights from Test Scores and Survey Data. European Journal of Teaching and Education, 7(1), -. https://europub.co.uk./articles/-A-760042