Predicting and analyzing secondary education placement-test scores: A data mining approach

ŞEN B. , Ucar E., Delen D.

EXPERT SYSTEMS WITH APPLICATIONS, vol.39, no.10, pp.9468-9476, 2012 (Journal Indexed in SCI) identifier identifier

  • Publication Type: Article / Article
  • Volume: 39 Issue: 10
  • Publication Date: 2012
  • Doi Number: 10.1016/j.eswa.2012.02.112
  • Page Numbers: pp.9468-9476


Understanding the factors that lead to success (or failure) of students at placement tests is an interesting and challenging problem. Since the centralized placement tests and future academic achievements are considered to be related concepts, analysis of the success factors behind placement tests may help understand and potentially improve academic achievement. In this study using a large and feature rich dataset from Secondary Education Transition System in Turkey we developed models to predict secondary education placement test results, and using sensitivity analysis on those prediction models we identified the most important predictors. The results showed that CS decision tree algorithm is the best predictor with 95% accuracy on hold-out sample, followed by support vector machines (with an accuracy of 91%) and artificial neural networks (with an accuracy of 89%). Logistic regression models came out to be the least accurate of the four with and overall accuracy of 82%. The sensitivity analysis revealed that previous test experience, whether a student has a scholarship, student's number of siblings, previous years' grade point average are among the most important predictors of the placement test scores. (C) 2012 Elsevier Ltd. All rights reserved.