Selected papers from the CLARIN Annual Conference 2022

					View Selected papers from the CLARIN Annual Conference 2022

This volume presents the highlights of the eleventh CLARIN Annual Conference in 2022. The conference
was held from 10 to 12 October 2022 as a hybrid event, with Prague, Czechia, as the venue.

CLARIN, the Common Language Resources and Technology Infrastructure, is a virtual platform that is accessible to everyone interested in language. CLARIN offers access to language resources, technology, and knowledge, and enables cross-country collaboration among academia, industry, policy-makers, cultural institutions, and the general public. Researchers, students, and citizens are offered access to digital language resources and technology services to deploy, connect, analyse and sustain such resources. In line with the Open Science agenda, CLARIN enables scholars from the Social Sciences and Humanities (SSH) and beyond to engage in and contribute to cutting-edge, data-driven research based on language data in a range of formats and modalities.

Series: Linköping Electronic Conference Proceedings 198
Editors: Tomaž Erjavec and Maria Eskevich
ISBN: 978-91-8075-254-1
ISSN: 1650-3686 (print), 1650-3740 (online)

Published: 2023-06-09

Contents

  • Analysing Changes in Official Use of the Design Concept Using SweCLARIN Resources

    Lars Ahrenberg, Daniel Holmer, Stefan Holmlid, Arne Jönsson
    1-11
    DOI: https://doi.org/10.3384/ecp198001
  • The CLaDA-BG Dictionary Creation System: Specifics and Perspectives

    Zhivko Angelov, Kiril Simov, Petya Osenova, Zara Kancheva
    12-22
    DOI: https://doi.org/10.3384/ecp198002
  • Linguistic Autobiographies. Towards the Creation of a Multilingual Resource Family

    Silvia Calamai, Rosalba Nodari, Claudia Soria, Alessandro Carlucci
    23-32
    DOI: https://doi.org/10.3384/ecp198003
  • The Pipeline for Publishing Resources in the Language Bank of Finland

    Ute Dieckmann, Mietta Lennes, Jussi Piitulainen, Jyrki Niemi, Erik Axelson, Tommi Jauhiainen, Krister Lindén
    33-43
    DOI: https://doi.org/10.3384/ecp198004
  • TEI and Git in ParlaMint: Collaborative Development of Language Resources

    Tomaž Erjavec, Matyáš Kopp, Katja Meden
    44-56
    DOI: https://doi.org/10.3384/ecp198005
  • EU Data Governance Act: Outlining a Potential Role for CLARIN

    Paweł Kamocki, Krister Linden, Andrius Puksas, Aleksei Kelli
    57-65
    DOI: https://doi.org/10.3384/ecp198006
  • Semantic Classification of Prepositions in BulTreeBank WordNet

    Zara Kancheva
    66-76
    DOI: https://doi.org/10.3384/ecp198007
  • Neural Metaphor Detection for Slovene

    Matej Klemen, Marko Robnik-Šikonja
    77-89
    DOI: https://doi.org/10.3384/ecp198008
  • Evaluation of the Archivio Vi.Vo Architecture: A Case Study on the Reuse of Legacy Data for Linguistic Purposes

    Roberta Bianca Luzietti
    90-98
    DOI: https://doi.org/10.3384/ecp198009
  • It-Sr-NER: CLARIN Compatible NER and Geoparsing Web Services for Italian and Serbian Parallel Text

    Olja Perišić, Ranka Stanković, Milica Ikonić Nešić, Mihailo Škorić
    99-110
    DOI: https://doi.org/10.3384/ecp198010
  • Lemmatizing and POS-tagging Akkadian with BabyLemmatizer and Dictionary-Based Post-Correction

    Aleksi Sahala, Tero Alstola, Jonathan Valk, Krister Lindén
    111-119
    DOI: https://doi.org/10.3384/ecp198011
  • Developing Resources for Measuring Text Readability in Sesotho

    Johannes Sibeko
    120-132
    DOI: https://doi.org/10.3384/ecp198012
  • WebLicht-Batch -- A Web-Based Interface for Batch Processing Large Input with the WebLicht Workflow Engine

    Claus Zinn, Ben Campbell
    133-141
    DOI: https://doi.org/10.3384/ecp198013