Zur Hauptnavigation wechseln Zur Suche wechseln Zum Hauptinhalt wechseln

Automatic feedback on physics tasks using open-source generative artificial intelligence

  • André Meyer*
  • , Tom Bleckmann
  • , Gunnar Friege
  • *Korrespondierende*r Autor*in für diese Arbeit

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Abstract

This study explores the feasibility of using open-source large language models (LLMs) to generate automatic feedback on physics problem-solving tasks in educational settings. A quantised version of the open-source LLM OpenChat 3.6 was employed to generate German-language feedback for high school students on standard school hardware. The study procedure involved five stages: data preparation, model selection, prompt design, response evaluation, and quality analysis of feedback. OpenChat 3.6 achieved an accuracy of 0.84 in classifying student answers. In comparison, GPT4-o achieved an accuracy of 0.85. The open-source LLM provided accurate and suitable feedback in 69% of cases, with substantial interrater agreement (κ = 0.89) on feedback quality. However, performance varied across task types, highlighting areas for improvement in prompt specificity, especially in handling physics terminology. These findings suggest that, with optimisation, open-source LLMs can offer a locally controlled and effective solution for formative assessment in physics education, enabling real-time, targeted feedback to support student learning.
OriginalspracheEnglisch
Seiten (von - bis) 1–26
Seitenumfang26
FachzeitschriftInternational Journal of Science Education
DOIs
PublikationsstatusElektronisch veröffentlicht (E-Pub) - 12 Mai 2025

ASJC Scopus Sachgebiete

  • Ausbildung bzw. Denomination

Dieses zitieren