Swedish MuClaGED: A new dataset for Grammatical Error Detection in Swedish

Judit Casademont Moner; Elena Volodina

doi:10.3384/ecp190004

Authors

Judit Casademont Moner University of Gothenburg
Elena Volodina University of Gothenburg

DOI:

https://doi.org/10.3384/ecp190004

Keywords:

grammatical error detection, L2 Swedish, shared task, SweLL

Abstract

This paper introduces the Swedish Mu-ClaGED dataset, a new dataset specifically built for the task of Multi-Class Grammatical Error Detection (GED). The dataset has been produced as a part of the multilingual Computational SLA shared task initiative. In this paper we elaborate on the generation process and the design choices made to obtain Swedish MuClaGED. We also show initial baseline results for the performance on the dataset in a task of Grammatical Error Detection and Classification on the sentence level, which have been obtained through (Bi)LSTM ((Bidirectional) Long-Short Term Memory) methods.

Swedish MuClaGED: A new dataset for Grammatical Error Detection in Swedish

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

License