Projects per year
Abstract
The emergence of large language models (LLMs) has transformed research and practice across a wide range of domains. Within the computing education research (CER) domain, LLMs have garnered significant attention, particularly in the context of learning programming. Much of the work on LLMs in CER, however, has focused on applying and evaluating proprietary models. In this article, we evaluate the efficiency of open-source LLMs in generating high-quality feedback for programming assignments and judging the quality of programming feedback, contrasting the results with proprietary models. Our evaluations on a dataset of students’ submissions to introductory Python programming exercises suggest that state-of-the-art open-source LLMs are nearly on par with proprietary models in both generating and assessing programming feedback. Additionally, we demonstrate the efficiency of smaller LLMs in these tasks and highlight the wide range of LLMs accessible, even for free, to educators and practitioners.
Original language | English |
---|---|
Title of host publication | SIGCSE TS 2025 - Proceedings of the 56th ACM Technical Symposium on Computer Science Education |
Publisher | ACM |
Pages | 624-630 |
Number of pages | 7 |
Volume | 1 |
ISBN (Electronic) | 979-8-4007-0531-1 |
DOIs | |
Publication status | Published - 18 Feb 2025 |
MoE publication type | A4 Conference publication |
Event | ACM Technical Symposium on Computer Science Education - Pittsburgh, United States Duration: 26 Feb 2025 → 1 Mar 2025 Conference number: 56 |
Conference
Conference | ACM Technical Symposium on Computer Science Education |
---|---|
Abbreviated title | SIGCSE |
Country/Territory | United States |
City | Pittsburgh |
Period | 26/02/2025 → 01/03/2025 |
Keywords
- automatic evaluation
- automatic feedback
- generative AI
- large language models
- LLM-as-a-judge
- open source
- programming feedback
Fingerprint
Dive into the research topics of 'Evaluating Language Models for Generating and Judging Programming Feedback'. Together they form a unique fingerprint.Projects
- 1 Active
-
Leinonen Juho /AT tot.: Large Language Models for Computing Education
Leinonen, J. (Principal investigator)
01/09/2023 → 31/08/2027
Project: RCF Academy Research Fellow (new)