Service: P.IA.ES - Modeling and prediction of failure and school dropout in Higher Education

Responsible organisation: AMA (Agência para a Modernização Administrativa) (Central-Government)

This work aims to contribute to the reduction of academic failure at higher education, by using machine learning techniques to identify students at risk of failure at an early stage of their academic path, so that strategies to support them can be put into place. A dataset from a higher education institution is used to build classification models to predict academic performance of students. The dataset includes information known at the time of student’s enrollment – academic path, demographics and social-economic factors. The problem is formulated as a three category classification task, in which there’s a strong imbalance towards one of the classes. Algorithms to promote class balancing with synthetic oversampling are tested, and classification models are trained and evaluated, both with standard machine learning algorithms and state of the art boosting algorithms. Our results show that boosting algorithms respond better to the specific classification task than standard methods. However, even these state of the art algorithms fall short in correctly identifying the majority of cases in one of the minority classes. Future directions of this study include the addition of information regarding student’s first year performance, such as academic grades from the first academic semesters.

Additional information

Source Open Innovation Regione Lombardia
Web site https://www.researchgate.net/publication/351066139_Early_Prediction_of_Student%27s_Performance_in_Higher_Education_A_Case_Study
Start/end date 2019 - 2021.0
Still active?

Related cases