Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Beyond Benchmarks: Spotting Key Topical Sentences While Improving Automated Essay Scoring Performance with Topic-Aware BERT
Stockholms universitet, Samhällsvetenskapliga fakulteten, Institutionen för data- och systemvetenskap.ORCID-id: 0000-0002-0945-707x
Stockholms universitet, Samhällsvetenskapliga fakulteten, Institutionen för data- och systemvetenskap.ORCID-id: 0000-0001-9731-1048
Stockholms universitet, Samhällsvetenskapliga fakulteten, Institutionen för data- och systemvetenskap.ORCID-id: 0000-0002-9942-8730
Stockholms universitet, Samhällsvetenskapliga fakulteten, Institutionen för data- och systemvetenskap.
Visa övriga samt affilieringar
2023 (Engelska)Ingår i: Electronics, E-ISSN 2079-9292, Vol. 12, nr 1, artikel-id 150Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Automated Essay Scoring (AES) automatically allocates scores to essays at scale and may help teachers reduce the heavy burden during grading activities. Recently, researchers have deployed neural-based AES approaches to improve upon the state-of-the-art AES performance. These neural-based AES methods mainly take student essays as the sole input and focus on learning the relationship between student essays and essay scores through deep neural networks. However, their only product, the predicted holistic score, is far from providing adequate pedagogical information, such as automated writing evaluation (AWE). In this work, we propose Topic-aware BERT, a new method of learning relations among scores, student essays, as well as topical information in essay instructions. Beyond improving the AES benchmark performance, Topic-aware BERT can automatically retrieve key topical sentences in student essays by probing self-attention maps in intermediate layers. We evaluate the performance of Topic-aware BERT of different variants to (i) perform AES and (ii) retrieve key topical sentences using the open dataset Automated Student Assessment Prize and a manually annotated dataset. Our experiments show that Topic-aware BERT achieves a strong AES performance compared with the previous best neural-based AES methods and demonstrates effectiveness in identifying key topical sentences in argumentative essays.

Ort, förlag, år, upplaga, sidor
2023. Vol. 12, nr 1, artikel-id 150
Nyckelord [en]
Artificial Intelligence, Natural Language Processing, Automated Essay Scoring, Automated Writing Evaluation, BERT
Nationell ämneskategori
Språkbehandling och datorlingvistik
Forskningsämne
data- och systemvetenskap
Identifikatorer
URN: urn:nbn:se:su:diva-213557DOI: 10.3390/electronics12010150ISI: 000910414500001Scopus ID: 2-s2.0-85145830143OAI: oai:DiVA.org:su-213557DiVA, id: diva2:1724814
Anmärkning

This article belongs to the Special Issue Artificial Intelligence Solutions and Applications for Distributed Systems in Smart Spaces

Tillgänglig från: 2023-01-09 Skapad: 2023-01-09 Senast uppdaterad: 2025-02-07Bibliografiskt granskad
Ingår i avhandling
1. Exploring the Educational Utility of Pretrained Language Models
Öppna denna publikation i ny flik eller fönster >>Exploring the Educational Utility of Pretrained Language Models
2024 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

The emergence of pretrained language models has profoundly reshaped natural language processing, serving as foundation models for a wide range of tasks. Over the past decade, pretrained language models have evolved significantly, leading to the development of different types of models and approaches for utilising them. This progression spans from static to contextual models and from smaller models to more powerful, generative large language models. The increasing capabilities of these models have, in turn, led to growing interest in exploring new use cases and applications across various domains, including education, where digitalisation has created opportunities for AI applications that leverage pretrained language models, particularly due to the abundance of text data in educational contexts.

This thesis explores the educational utility of pretrained language models, specifically by investigating how different paradigms of these models can be applied to address tasks in education. These paradigms include various methodologies for leveraging the knowledge embedded in pretrained language models, such as embeddings, fine-tuning, prompt-based learning, and in-context learning. For collaborative learning group formation, a clustering approach based on pretrained embeddings is proposed, enabling the creation of either homogeneous or heterogeneous groups depending on the specific learning situation. For automated essay scoring, a pretrained language model is fine-tuned using both the essay instructions and the essay text as input; the proposed method also highlights key topical sentences that contribute to the predicted essay score. For educational question generation, a method based on prompt-based learning is introduced and shown to be more data-efficient than existing methods. Finally, for educational question answering, certain limitations of the in-context learning (or prompting) paradigm, such as a tendency of large language models to hallucinate or miscalculate, are addressed. Specifically, workflows and prompting strategies based on retrieval-augmented generation and tool-augmented generation are proposed, allowing large language models to ground answers in specific learning materials and to leverage external tools, such as calculators and knowledge bases, within chain-of-thought reasoning processes. These strategies are shown to produce more reliable and transparent answers to complex questions.

Through five empirical studies, methodological innovations within each paradigm of pretrained language models are proposed and evaluated for specific educational use cases. In addition to contributing methodologically to natural language processing, the results demonstrate the potential utility of pretrained language models in educational AI applications, thereby advancing the field of technology enhanced learning. The proposed methods not only improve predictive performance on specific tasks but also aim to enhance the transparency of pretrained language models, which is essential for building reliable and trustworthy educational AI applications.

Ort, förlag, år, upplaga, sidor
Stockholm: Department of Computer and Systems Sciences, Stockholm University, 2024
Serie
Report Series / Department of Computer & Systems Sciences, ISSN 1101-8526 ; 24-017
Nyckelord
Natural Language Processing, Technology Enhanced Learning, Pretrained Language Models, Large Language Models, Generative AI, Collaborative Learning, Automated Essay Scoring, Educational Question Generation, Educational Question Answering
Nationell ämneskategori
Datavetenskap (datalogi) Språkbehandling och datorlingvistik
Forskningsämne
data- och systemvetenskap
Identifikatorer
urn:nbn:se:su:diva-235084 (URN)978-91-8107-014-9 (ISBN)978-91-8107-015-6 (ISBN)
Disputation
2024-12-16, L30, Nodhuset, Borgarfjordsgatan 12, Kista., 09:00 (Engelska)
Opponent
Handledare
Tillgänglig från: 2024-11-21 Skapad: 2024-10-30 Senast uppdaterad: 2025-02-01Bibliografiskt granskad

Open Access i DiVA

fulltext(502 kB)291 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 502 kBChecksumma SHA-512
e7b0085599491738148715764e55a774b66dd723b449bf20a892a91d2c39491a610c56a90e9cf84578506816617da84f48428f2a95993fb6b57da318d0acd0f6
Typ fulltextMimetyp application/pdf

Övriga länkar

Förlagets fulltextScopus

Person

Wu, YongchaoHenriksson, AronNouri, JalalDuneld, MartinLi, Xiu

Sök vidare i DiVA

Av författaren/redaktören
Wu, YongchaoHenriksson, AronNouri, JalalDuneld, MartinLi, Xiu
Av organisationen
Institutionen för data- och systemvetenskap
I samma tidskrift
Electronics
Språkbehandling och datorlingvistik

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 294 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 440 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf