An overview of Grammatical Error Correction for the twelve MultiGEC-2025 languages

dc.contributor.authorMasciolini, Arianna
dc.contributor.authorCaines, Andrew
dc.contributor.authorDe Clercq, Orphée
dc.contributor.authorKruijsbergen, Joni
dc.contributor.authorKurfalı, Murathan
dc.contributor.authorMuñoz Sánchez, Ricardo
dc.contributor.authorVolodina, Elena
dc.contributor.authorÖstling, Robert
dc.contributor.authorAllkivi, Kais
dc.contributor.authorArhar Holdt, Špela
dc.contributor.authorAuzin̦a, Ilze
dc.contributor.authorDarģis, Roberts
dc.contributor.authorDrakonaki, Elena
dc.contributor.authorFrey, Jennifer-Carmen
dc.contributor.authorGlišic, Isidora
dc.contributor.authorKikilintza, Pinelopi
dc.contributor.authorNicolas, Lionel
dc.contributor.authorRomanyshyn, Mariana
dc.contributor.authorRosen, Alexandr
dc.contributor.authorRozovskaya, Alla
dc.contributor.authorSuluste, Kristjan
dc.contributor.authorSyvokon, Oleksiy
dc.contributor.authorTantos, Alexandros
dc.contributor.authorTouriki, Despoina-Ourania
dc.contributor.authorTsiotskas, Konstantinos
dc.contributor.authorTsourilla, Eleni
dc.contributor.authorVarsamopoulos, Vassilis
dc.contributor.authorWisniewski, Katrin
dc.contributor.authorŽagar, Aleš
dc.contributor.authorZesch, Torsten
dc.contributor.organizationSpråkbanken Text, SFS, University of Gothenburg, Swedensv
dc.contributor.organizationUniversity of Cambridge, UKsv
dc.contributor.organizationGhent University, Belgiumsv
dc.contributor.organizationRISE Research Institutes of Sweden, Swedensv
dc.contributor.organizationStockholm University, Swedensv
dc.contributor.organizationTallinn University, Estoniasv
dc.contributor.organizationUniversity of Ljubljana, Sloveniasv
dc.contributor.organizationIMCS at the University of Latvia, Latviasv
dc.contributor.organizationAristotle University of Thessaloniki, Greecesv
dc.contributor.organizationEurac Research Bolzano, Italysv
dc.contributor.organizationUniversity of Iceland, Icelandsv
dc.contributor.organizationGrammarlysv
dc.contributor.organizationCharles University, Czech Republicsv
dc.contributor.organizationCity University of New York (CUNY), USAsv
dc.contributor.organizationInstitute of the Estonian Language, Estoniasv
dc.contributor.organizationMicrosoftsv
dc.contributor.organizationLeipzig University, Germanysv
dc.contributor.organizationFernUniversität in Hagen, Germanysv
dc.date.accessioned2025-01-31T07:57:03Z
dc.date.available2025-01-31T07:57:03Z
dc.date.issued2025-01-31
dc.description.abstractThis overview is complementary to the comprehensive dataset description article for MultiGEC – a dataset for Multilingual Grammatical Error Correction including data for twelve European languages: Czech, English, Estonian, German, Greek, Icelandic, Italian, Latvian, Russian, Slovene, Swedish and Ukrainian. It is well-known that in the field of Natural Language Processing (NLP) most publications tend to focus on the English language. While this is due to historical reasons (ease of publication, greater outreach, increased number of citations, etc.), it does leave other languages at a disadvantage across multiple tasks. The MultiGEC dataset was created as an attempt to counteract this effect. This report provides a historical overview of the evolution of GEC for each of the twelve languages in this dataset and provides a context for the work on the dataset and the related MultiGEC-2025 shared task.sv
dc.format.extent9 pagessv
dc.identifier.issn1401-5919
dc.identifier.urihttps://hdl.handle.net/2077/84800
dc.language.isoengsv
dc.relation.ispartofseriesGU-ISS-2025-01sv
dc.subjectGrammatical Error Correctionsv
dc.subjectLanguage Technologysv
dc.subjectNatural Language Processingsv
dc.subjectshared tasksv
dc.subjectMultiGEC-2025sv
dc.subjectComputational SLAsv
dc.titleAn overview of Grammatical Error Correction for the twelve MultiGEC-2025 languagessv
dc.typeTextsv
dc.type.svepreportsv

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
2025_MultiGEC_GEC_overview.pdf
Size:
147.83 KB
Format:
Adobe Portable Document Format
Description:
Report

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
4.68 KB
Format:
Item-specific license agreed upon to submission
Description: