The prediction of the linguistic origin of surnames is a basic functionality required in the design of high-quality multilanguage speech synthesizers. The assignment of a given string representing a surname to a specific language is typically based on a set of rules which can hardly be written in an explicit form. The approach we propose faces this problem combining a rule-based system with a module based on evidential reasoning and a module based on neural networks. The resulting hybrid system combines the different sources of information, merging both knowledge from experts on linguistics and knowledge automatically acquired using learning from examples. The system has been validated on a large database containing surnames belonging to four different languages, showing its effectiveness for real-world applications.

P., B., Gori, M., Maggini, M., Scarselli, F., J., S. (2003). A hybrid model for the prediction of the linguistic origin of surnames. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 15(3), 760-763 [10.1109/TKDE.2003.1198404].

A hybrid model for the prediction of the linguistic origin of surnames

GORI, MARCO;MAGGINI, MARCO;SCARSELLI, FRANCO;
2003-01-01

Abstract

The prediction of the linguistic origin of surnames is a basic functionality required in the design of high-quality multilanguage speech synthesizers. The assignment of a given string representing a surname to a specific language is typically based on a set of rules which can hardly be written in an explicit form. The approach we propose faces this problem combining a rule-based system with a module based on evidential reasoning and a module based on neural networks. The resulting hybrid system combines the different sources of information, merging both knowledge from experts on linguistics and knowledge automatically acquired using learning from examples. The system has been validated on a large database containing surnames belonging to four different languages, showing its effectiveness for real-world applications.
2003
P., B., Gori, M., Maggini, M., Scarselli, F., J., S. (2003). A hybrid model for the prediction of the linguistic origin of surnames. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 15(3), 760-763 [10.1109/TKDE.2003.1198404].
File in questo prodotto:
File Dimensione Formato  
TKDE03.pdf

non disponibili

Tipologia: Post-print
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 278.96 kB
Formato Adobe PDF
278.96 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/21356
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo