On the Opportunities of Large Language Models for Programming Process Data

John Edwards*, Arto Hellas, Juho Leinonen

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

2 Downloads (Pure)

Abstract

Computing educators and researchers have long used programming process data to understand how students construct programs and address challenges. Despite its potential, fully automated feedback systems remain underexplored. The emergence of Large Language Models (LLMs) offers new opportunities for analyzing programming data and providing formative feedback. This study explores using LLMs to summarize programming processes and deliver formative feedback. A case study analyzed keystroke-level data from an introductory programming course, processed into code snapshots. Three state-of-the-art LLMs - Claude 3 Opus, GPT-4 Turbo, and LLaMa2 70B Chat - were evaluated for their feedback capabilities. Results show LLMs effectively provide tailored feedback, emphasizing incremental development, algorithmic planning, and code readability. Our findings highlight the potential of combining keystroke data with LLMs to automate formative feedback, showing that the computing education research and practice community is again one step closer to automating formative programming process feedback.

Original languageEnglish
Title of host publicationACE 2025 - Proceedings of the 27th Australasian Computing Education Conference, Held in conjunction with
PublisherACM
Pages105-113
Number of pages9
ISBN (Electronic)9798400714252
DOIs
Publication statusPublished - 7 Apr 2025
MoE publication typeA4 Conference publication
EventAustralasian Computing Education Conference - Brisbane, Australia
Duration: 12 Feb 202513 Feb 2025
Conference number: 27

Conference

ConferenceAustralasian Computing Education Conference
Abbreviated titleACE
Country/TerritoryAustralia
CityBrisbane
Period12/02/202513/02/2025

Keywords

  • generative AI
  • keystroke data
  • large language models
  • programming process data
  • programming process feedback
  • programming process summarization

Fingerprint

Dive into the research topics of 'On the Opportunities of Large Language Models for Programming Process Data'. Together they form a unique fingerprint.

Cite this