How Well Can We Predict Bad Programs?
A gradient-boosted classifier trained on non-tautological features—things a decision-maker would know before seeing outcomes—achieves strong predictive performance across 5-fold cross-validation on 65,935 programs.
Gradient boosting classifier, 5-fold cross-validation, non-tautological features only (no outcome leakage). N = 65,935 programs; 22,692 negative ROI.
What Drives the Prediction?
The model relies heavily on just three features: the institution's overall track record, the field's national success rate, and credential level. Together these account for 93% of the model's predictive power.
What the Model Tells Us
What This Means for a University CFO
Check your institution's overall track record first. If more than 40% of your programs have negative ROI, every program should be under review—not just the obvious ones. The institution-level signal is so strong that individual program characteristics are secondary.