Beyond Early Warning Indicators: High School Dropout and Machine Learning

B-Tier
Journal: Oxford Bulletin of Economics and Statistics
Year: 2019
Volume: 81
Issue: 2
Pages: 456-485

Score contribution per author:

2.011 = (α=2.01 / 1 authors) × 1.0x B-tier

α: calibrated so average coauthorship-adjusted count equals average raw count

Abstract

This paper combines machine learning with economic theory in order to analyse high school dropout. It provides an algorithm to predict which students are going to drop out of high school by relying only on information from 9th grade. This analysis emphasizes that using a parsimonious early warning system – as implemented in many schools – leads to poor results. It shows that schools can obtain more precise predictions by exploiting the available high‐dimensional data jointly with machine learning tools such as Support Vector Machine, Boosted Regression and Post‐LASSO. Goodness‐of‐fit criteria are selected based on the context and the underlying theoretical framework: model parameters are calibrated by taking into account the policy goal – minimizing the expected dropout rate ‐ and the school budget constraint. Finally, this study verifies the existence of heterogeneity through unsupervised machine learning by dividing students at risk of dropping out into different clusters.

Technical Details

RePEc Handle
repec:bla:obuest:v:81:y:2019:i:2:p:456-485
Journal Field
General
Author Count
1
Added to Database
2026-01-29