Training Language Models for Programming Feedback Using Automated Repair Tools

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

5 Citations (Scopus)
61 Downloads (Pure)

Abstract

In introductory programming courses, automated repair tools (ARTs) are used to provide feedback to students struggling with debugging. Most successful ARTs take advantage of context-specific educational data to construct repairs to students’ buggy codes. Recent work in student program repair using large language models (LLMs) has also started to utilize such data. An underexplored area in this field is the use of ARTs in combination with LLMs. In this paper, we propose to transfer the repairing capabilities of existing ARTs to open large language models by finetuning LLMs on ART corrections to buggy codes. We experiment with this approach using three large datasets of Python programs written by novices. Our results suggest that a finetuned LLM provides more reliable and higher-quality repairs than the repair tool used for finetuning the model. This opens venues for further deploying and using educational LLM-based repair techniques.
Original languageEnglish
Title of host publicationArtificial Intelligence in Education
Subtitle of host publication24th International Conference, AIED 2023, Tokyo, Japan, July 3–7, 2023, Proceedings
EditorsNing Wang, Genaro Rebolledo-Mendez, Noboru Matsuda, Olga C. Santos, Vania Dimitrova
PublisherSpringer
Pages830–835
ISBN (Electronic)978-3-031-36272-9
ISBN (Print)978-3-031-36271-2
DOIs
Publication statusPublished - 2023
MoE publication typeA4 Conference publication
EventInternational Conference on Artificial Intelligence in Education - Tokyo, Japan
Duration: 3 Jul 20237 Jul 2023
Conference number: 24

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume13916
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Conference on Artificial Intelligence in Education
Abbreviated titleAIED
Country/TerritoryJapan
CityTokyo
Period03/07/202307/07/2023

Fingerprint

Dive into the research topics of 'Training Language Models for Programming Feedback Using Automated Repair Tools'. Together they form a unique fingerprint.
  • Science-IT

    Hakala, M. (Manager)

    School of Science

    Facility/equipment: Facility

Cite this