Predicting student success with and without library instruction using supervised machine learning methodsKaren Harker, Carol Hargis, Jennifer RowePerformance Measurement and Metrics, Vol. 25, No. 2, pp.77-90
The main purpose of this analysis was to demonstrate the value of predictive modeling of student success and identify the key groups of students for which library instruction could provide the most impact.
Data regarding the attendance of library instruction associated with a first-year writing course were combined with student demographic and academic data over a four year period representing over 10,000 students. We applied supervised machine learning methods to determine the most accurate model for predicting student outcomes, including course outcome, persistence and graduation. We also assessed the impact of library instruction on these outcomes.
The gradient-boosted decision tree model provided the most accurate predictions. The impact of library instruction was modest but still was second only to the previous grade point average (GPA). The value of this metric, however, was greatest for students who were struggling, especially those who were first-generation students, regardless of ethnicity. More notably, the impact of library instruction was substantially greater for specific student demographics, including students with lower cumulative GPAs.
Features of the models were limited to high-level academic metrics, some of which may not be very useful in predicting outcomes. Measures more closely related to learning styles, the course or course of study could provide for greater accuracy.
Prediction modeling could allow for a more selective approach to outreach and offers information that the librarian can use to customize instruction sessions and reference interactions.
Targeting students who may be at risk of not succeeding in a course has ethical implications either way. If used to bias the subjective assessments, these predictions could produce self-fulfilling prophecies. Conversely, to ignore indicators of possible difficulties the student may have with the material is a disservice to the education of that student.
There are few studies that have incorporated library instruction into models of predicting student outcomes. Library resources and services can play a major role in the success of students, particularly those who have had less exposure to the resources and skills needed to use these resources.