The paper describes two applications of radial basis function networks to automatic speech recognition. We used local basis networks of elliptical kernels of different functional form, with recursive allocation of units and on-line optimization of parameters (GRAN model). In the first application, the neural network is used as a front end of a continuous speech speaker-dependent recognition system to normalize the input data from new speakers. With a limited amount of new acoustic data, the recognition error of phone units from the Italian speech corpus APASCI is decreased with an adaptability ratio of 25%. The same model has also been applied in a speaker identification task on a database collected at IRST consisting of isolated digits. An identification error rate of 17% has been obtained on the whole database (50 speakers).

Furlanello, C., Giuliani, D., Trentin, E., Falavigna, D. (1995). Applications of generalized radial basis functions in speaker normalization and identification. In Proceedings of ISCAS '95, IEEE International Symposium on Circuit and Systems (pp.1704-1707). IEEE.

Applications of generalized radial basis functions in speaker normalization and identification

Trentin E.;
1995-01-01

Abstract

The paper describes two applications of radial basis function networks to automatic speech recognition. We used local basis networks of elliptical kernels of different functional form, with recursive allocation of units and on-line optimization of parameters (GRAN model). In the first application, the neural network is used as a front end of a continuous speech speaker-dependent recognition system to normalize the input data from new speakers. With a limited amount of new acoustic data, the recognition error of phone units from the Italian speech corpus APASCI is decreased with an adaptability ratio of 25%. The same model has also been applied in a speaker identification task on a database collected at IRST consisting of isolated digits. An identification error rate of 17% has been obtained on the whole database (50 speakers).
1995
0-7803-2570-2
Furlanello, C., Giuliani, D., Trentin, E., Falavigna, D. (1995). Applications of generalized radial basis functions in speaker normalization and identification. In Proceedings of ISCAS '95, IEEE International Symposium on Circuit and Systems (pp.1704-1707). IEEE.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/5162
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo