The issue of metadata is crucially intertwined with interoperability, accessibility, and re-use of research data. On the other hand, such issue is particularly cumbersome in the case of speech and oral archives, which are often scattered among different public and private institutions. From this respect, the aim of the European Common Language Resources and Infrastructure for Social Sciences and Humanities (CLARIN) is to provide easy access to digital written, spoken, video, multimodal language data and at the same time to offer advanced tools to discover, annotate, and analyse them. The paper presents CLARIN and its Component MetaData Infrastructure (CMDI) and it further discusses the relevant profiles for the description of speech and oral archives. Since Italy has recently joined the CLARIN infrastructure consortium, it additionally analyses the accessibility of Italian corpora via the Virtual Language Observatory (VLO) and via the Parlare Italiano web portal.
Calamai, S., Frontini, F. (2018). Speech audio archives and CLARIN metadata. In Amedeo De Dominicis (a cura di), Speech audio archives: preservation, restoration, annotation aimed at supporting the linguistic analysis (pp. 11-28). Roma : Accademia Nazionale dei Lincei.
Speech audio archives and CLARIN metadata
Calamai, Silvia;
2018-01-01
Abstract
The issue of metadata is crucially intertwined with interoperability, accessibility, and re-use of research data. On the other hand, such issue is particularly cumbersome in the case of speech and oral archives, which are often scattered among different public and private institutions. From this respect, the aim of the European Common Language Resources and Infrastructure for Social Sciences and Humanities (CLARIN) is to provide easy access to digital written, spoken, video, multimodal language data and at the same time to offer advanced tools to discover, annotate, and analyse them. The paper presents CLARIN and its Component MetaData Infrastructure (CMDI) and it further discusses the relevant profiles for the description of speech and oral archives. Since Italy has recently joined the CLARIN infrastructure consortium, it additionally analyses the accessibility of Italian corpora via the Virtual Language Observatory (VLO) and via the Parlare Italiano web portal.File | Dimensione | Formato | |
---|---|---|---|
03 - Frontini - Calamai.pdf
non disponibili
Descrizione: articolo
Tipologia:
PDF editoriale
Licenza:
NON PUBBLICO - Accesso privato/ristretto
Dimensione
401.54 kB
Formato
Adobe PDF
|
401.54 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11365/1060508