Swedish MuClaGED: A new dataset for Grammatical Error Detection in Swedish
DOI:
https://doi.org/10.3384/ecp190004Keywords:
grammatical error detection, L2 Swedish, shared task, SweLLAbstract
This paper introduces the Swedish Mu-ClaGED dataset, a new dataset specifically built for the task of Multi-Class Grammatical Error Detection (GED). The dataset has been produced as a part of the multilingual Computational SLA shared task initiative. In this paper we elaborate on the generation process and the design choices made to obtain Swedish MuClaGED. We also show initial baseline results for the performance on the dataset in a task of Grammatical Error Detection and Classification on the sentence level, which have been obtained through (Bi)LSTM ((Bidirectional) Long-Short Term Memory) methods.
Downloads
Published
2022-12-02
Issue
Section
Contents
License
Copyright (c) 2022 Judit Casademont Moner, Elena Volodina
This work is licensed under a Creative Commons Attribution 4.0 International License.