The Pipeline for Publishing Resources in the Language Bank of Finland

Authors

  • Ute Dieckmann
  • Mietta Lennes
  • Jussi Piitulainen
  • Jyrki Niemi
  • Erik Axelson
  • Tommi Jauhiainen
  • Krister Lindén

DOI:

https://doi.org/10.3384/ecp198004

Keywords:

Language resource, Corpus processing, Data management, Deposition agreement, Workflow, Licensing, Downloadable resources, Korp

Abstract

We present the process of publishing resources in Kielipankki, the Language Bank of Finland. Our pipeline includes all the steps that are needed to publish a resource: from finding and receiving the original data until making the data available via different platforms, e.g., the Korp concordance tool or the download service. Our goal is to standardize the publishing process by creating an ordered checklist of tasks with the corresponding documentation and by developing conversion scripts and processing tools that can be shared and applied on different resources.

Downloads

Published

2023-06-09