Abstract
Course instructors need to be able to identify students in need of assistance as early in the course as possible. Recent work has suggested that machine learning approaches applied to snapshots of small programming exercises may be an effective solution to this problem. However, these results have been obtained using data from a single institution, and prior work using features extracted from student code has been highly sensitive to differences in context. This work provides two contributions: first, a partial reproduction of previously published results, but in a different context, and second, an exploration of the efficacy of neural networks in solving this problem. Our findings confirm the importance of two features (the number of steps required to solve a problem and the correctness of key problems), indicate that machine learning techniques are relatively stable across contexts (both across terms in a single course and across courses), and suggest that neural network based approaches are as effective as the best Bayesian and decision tree methods. Furthermore, neural networks can be tuned to be reliably pessimistic, so they may serve a complementary role in solving the problem of identifying students who need assistance.
Original language | English |
---|---|
Title of host publication | SIGCSE 2017 |
Subtitle of host publication | Proceedings of the 2017 ACM SIGCSE Technical Symposium on Computer Science Education |
Place of Publication | New York, NY |
Publisher | Association for Computing Machinery |
Pages | 111-116 |
Number of pages | 6 |
ISBN (Electronic) | 9781450346986 |
DOIs | |
Publication status | Published - 2017 |
Externally published | Yes |
Event | ACM SIGCSE Technical Symposium on Computer Science Education (48th : 2017) - Seattle, United States Duration: 8 Mar 2017 → 11 Mar 2017 |
Conference
Conference | ACM SIGCSE Technical Symposium on Computer Science Education (48th : 2017) |
---|---|
Abbreviated title | SIGCSE'17 |
Country/Territory | United States |
City | Seattle |
Period | 8/03/17 → 11/03/17 |
Keywords
- introductory programming
- CS1
- at-risk students
- source code snapshot analysis
- educational data mining
- learning analytics
- replication
- reproduction