The issue of metadata is crucially intertwined with interoperability, accessibility, and re-use of research data. On the other hand, such issue is particularly cumbersome in the case of speech and oral archives, which are often scattered among different public and private institutions. From this respect, the aim of the European Common Language Resources and Infrastructure for Social Sciences and Humanities (CLARIN) is to provide easy access to digital written, spoken, video, multimodal language data and at the same time to offer advanced tools to discover, annotate, and analyse them. The paper presents CLARIN and its Component MetaData Infrastructure (CMDI) and it further discusses the relevant profiles for the description of speech and oral archives. Since Italy has recently joined the CLARIN infrastructure consortium, it additionally analyses the accessibility of Italian corpora via the Virtual Language Observatory (VLO) and via the Parlare Italiano web portal.

Calamai, S., Frontini, F. (2018). Speech audio archives and CLARIN metadata. In Amedeo De Dominicis (a cura di), Speech audio archives: preservation, restoration, annotation aimed at supporting the linguistic analysis (pp. 11-28). Roma : Accademia Nazionale dei Lincei.

Speech audio archives and CLARIN metadata

Calamai, Silvia;
2018-01-01

Abstract

The issue of metadata is crucially intertwined with interoperability, accessibility, and re-use of research data. On the other hand, such issue is particularly cumbersome in the case of speech and oral archives, which are often scattered among different public and private institutions. From this respect, the aim of the European Common Language Resources and Infrastructure for Social Sciences and Humanities (CLARIN) is to provide easy access to digital written, spoken, video, multimodal language data and at the same time to offer advanced tools to discover, annotate, and analyse them. The paper presents CLARIN and its Component MetaData Infrastructure (CMDI) and it further discusses the relevant profiles for the description of speech and oral archives. Since Italy has recently joined the CLARIN infrastructure consortium, it additionally analyses the accessibility of Italian corpora via the Virtual Language Observatory (VLO) and via the Parlare Italiano web portal.
2018
978-88-218-1165-4
Calamai, S., Frontini, F. (2018). Speech audio archives and CLARIN metadata. In Amedeo De Dominicis (a cura di), Speech audio archives: preservation, restoration, annotation aimed at supporting the linguistic analysis (pp. 11-28). Roma : Accademia Nazionale dei Lincei.
File in questo prodotto:
File Dimensione Formato  
03 - Frontini - Calamai.pdf

non disponibili

Descrizione: articolo
Tipologia: PDF editoriale
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 401.54 kB
Formato Adobe PDF
401.54 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1060508