Publikationstyp: Konferenz: Paper
Art der Begutachtung: Peer review (Abstract)
Titel: Using synthetic sentences for developing a corpus-based feedback service for student writing
Autor/-in: Runte, Maren
Mahlow, Cerstin
Ulasik, Malgorzata Anna
Cho, Sooyeon
et. al: No
Angaben zur Konferenz: SIG Writing, Paris Nanterre University, France, 26-28 June 2024
Erscheinungsdatum: 26-Jun-2024
Sprache: Englisch
Schlagwörter: Corpus linguistics; Student writing; Annotation; Move; Step
Fachgebiet (DDC): 410.285: Computerlinguistik
808: Rhetorik und Schreiben
Zusammenfassung: To support students when writing their bachelor thesis, early and immediate feedback on structure and linguistic realization of required elements has been proven useful. Feedback should thus influence the writing process and help students produce appropriate products. Automatic feedback systems usually rely on models derived from scientific articles written by established researchers (e.g., Weder 2015) which also serve as examples shown to students. However, student writing is not on the same level as expert writing as students are still in the process to learn and master relevant competences. Automated services, intended to support students learning to write scientific introductions, should thus be based on authentic student writing to offer appropriate feedback. These services should offer feedback on drafts, let writers revise their texts and give feedback with respect to improvements, thus guiding the writing process.Within the project «Digital Literacy in University Contexts», we develop such a service, based on over 5000 student theses written in German from several study fields. So far, we manually annotated 1034 introductions using an adapted scheme (Runte et al. 2022) based on Weder (2015). The extracted 9032 step-annotated sentences served both as a resource for corpus-linguistic investigations and training material for machine learning algorithms. The corpus-linguistic findings were used as additional features both for training models and to define feedback categories to be shown to students. As a result of our corpus-linguistic exploration, we could classify two kinds of keywords: topic-specific and procedural keywords and keyword phrases. Additionally, we identified specific linguistic features associated with certain steps. These findings were used twofold: (1) to train recommender systems for semi-automatic annotation allowing us to annotate a larger amount of introductions needed to train the automatic step detection system, and (2) as input for the creation of 1268 synthetic sentences with the API of OpenAI’s GPT-3 to arrive at a decent amount of balanced training data for our models. Additionally, we use the phrases and features to provide students with real-world examples from fellow students.
URI: https://digitalcollection.zhaw.ch/handle/11475/31099
Volltext Version: Publizierte Version
Lizenz (gemäss Verlagsvertrag): Keine Angabe
Departement: Angewandte Linguistik
Organisationseinheit: Institute of Language Competence (ILC)
Publiziert im Rahmen des ZHAW-Projekts: Digital Literacy im Hochschulkontext (DigLit)
Enthalten in den Sammlungen:Publikationen Angewandte Linguistik

Dateien zu dieser Ressource:
Es gibt keine Dateien zu dieser Ressource.
Zur Langanzeige
Runte, M., Mahlow, C., Ulasik, M. A., & Cho, S. (2024, June 26). Using synthetic sentences for developing a corpus-based feedback service for student writing. SIG Writing, Paris Nanterre University, France, 26-28 June 2024.
Runte, M. et al. (2024) ‘Using synthetic sentences for developing a corpus-based feedback service for student writing’, in SIG Writing, Paris Nanterre University, France, 26-28 June 2024.
M. Runte, C. Mahlow, M. A. Ulasik, and S. Cho, “Using synthetic sentences for developing a corpus-based feedback service for student writing,” in SIG Writing, Paris Nanterre University, France, 26-28 June 2024, Jun. 2024.
RUNTE, Maren, Cerstin MAHLOW, Malgorzata Anna ULASIK und Sooyeon CHO, 2024. Using synthetic sentences for developing a corpus-based feedback service for student writing. In: SIG Writing, Paris Nanterre University, France, 26-28 June 2024. Conference paper. 26 Juni 2024
Runte, Maren, Cerstin Mahlow, Malgorzata Anna Ulasik, and Sooyeon Cho. 2024. “Using Synthetic Sentences for Developing a Corpus-Based Feedback Service for Student Writing.” Conference paper. In SIG Writing, Paris Nanterre University, France, 26-28 June 2024.
Runte, Maren, et al. “Using Synthetic Sentences for Developing a Corpus-Based Feedback Service for Student Writing.” SIG Writing, Paris Nanterre University, France, 26-28 June 2024, 2024.


Alle Ressourcen in diesem Repository sind urheberrechtlich geschützt, soweit nicht anderweitig angezeigt.