FAIR Tool Discovery

an automated software metadata harvesting pipeline for CLARIAH

Authors

  • Maarten van Gompel KNAW Humanities Cluster, Amsterdam, the Netherlands
  • Menzo Windhouwer KNAW Humanities Cluster, Amsterdam, the Netherlands

DOI:

https://doi.org/10.3384/ecp216.12

Keywords:

software metadata, linked open data, metadata harvesting

Abstract

We present the Tool Discovery pipeline, a core component of the CLARIAH infrastructure in the Netherlands. This pipeline harvests software metadata from the source, detects existing heterogeneous metadata formats already in use by software developers, and converts them to a single uniform representation based on schema.org and codemeta. The resulting data is then made available for further ingestion into other user-facing catalogue/portal systems.

Downloads

Published

2025-08-25