This work introduces a multiple connectionist architecture based on a mixture of Recurrent Neural Networks to approach the problem of speaker adaptation in the acoustic feature domain (i.e. speaker normalization). Normalization is applied to the case of a speaker-independent (SI) speech recognition system based on continuous density hidden Markov models. The technique for combining multiple recurrent models is discussed. Recognition experiments with a continuous speech large dictionary task shows that the proposed architecture is capable to tangibly improve recognition performance, allowing for a 21.9% reduction of the word error rate. © 1997 ESANN. All Rights Reserved.
Trentin, E., Giuliani, D. (1997). Speaker normalization with a mixture of recurrent networks. In Proceedings of ESANN97, European Symposium on Artificial Neural Networks (pp.1-6). D-facto, Padova.
Speaker normalization with a mixture of recurrent networks
TRENTIN, EDMONDO;
1997-01-01
Abstract
This work introduces a multiple connectionist architecture based on a mixture of Recurrent Neural Networks to approach the problem of speaker adaptation in the acoustic feature domain (i.e. speaker normalization). Normalization is applied to the case of a speaker-independent (SI) speech recognition system based on continuous density hidden Markov models. The technique for combining multiple recurrent models is discussed. Recognition experiments with a continuous speech large dictionary task shows that the proposed architecture is capable to tangibly improve recognition performance, allowing for a 21.9% reduction of the word error rate. © 1997 ESANN. All Rights Reserved.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11365/5151
Attenzione
Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo
