In this paper we present a first version of a transcription portal for audio files based on automatic speech recognition (ASR) in various languages. The portal is implemented in the CLARIN resources research network and intended for use by non-technical scholars. We explain the background and interdisciplinary nature of interview data, the perks and quirks of using ASR for transcribing the audio in a research context, the dos and don’ts for optimal use of the portal, and future developments foreseen. The portal is promoted in a range of workshops, but there are a number of challenges that have to be met. These challenges concern privacy issues, ASR quality, and cost, amongst others.

Draxler, C., van den Heuvel, H., van Hessen, A., Calamai, S., Corti, L., Scagliola, S. (2020). A CLARIN Transcription Portal for Interview Data. In Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 20) (pp.3353-3359).

A CLARIN Transcription Portal for Interview Data

S. Calamai;
2020-01-01

Abstract

In this paper we present a first version of a transcription portal for audio files based on automatic speech recognition (ASR) in various languages. The portal is implemented in the CLARIN resources research network and intended for use by non-technical scholars. We explain the background and interdisciplinary nature of interview data, the perks and quirks of using ASR for transcribing the audio in a research context, the dos and don’ts for optimal use of the portal, and future developments foreseen. The portal is promoted in a range of workshops, but there are a number of challenges that have to be met. These challenges concern privacy issues, ASR quality, and cost, amongst others.
2020
Draxler, C., van den Heuvel, H., van Hessen, A., Calamai, S., Corti, L., Scagliola, S. (2020). A CLARIN Transcription Portal for Interview Data. In Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 20) (pp.3353-3359).
File in questo prodotto:
File Dimensione Formato  
2020.lrec-1.411.pdf

accesso aperto

Descrizione: articolo
Tipologia: PDF editoriale
Licenza: Creative commons
Dimensione 518.12 kB
Formato Adobe PDF
518.12 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1128795