This paper faces the problem of extracting knowledge from raw text. We present a deep architecture in the framework of Learning from Constraints [5] that is trained to identify mentions to entities and relations belonging to a given ontology. Each input word is encoded into two latent representations with different coverage of the local context, that are exploited to predict the type of entity and of relation to which the word belongs. Our model combines an entropy-based regularizer and a set of First-Order Logic formulas that bridge the predictions on entity and relation types accordingly to the ontology structure. As a result, the system generates symbolic descriptions of the raw text that are interpretable and well-suited to attach human-level knowledge. We evaluate the model on a dataset composed of sentences about simple facts, that we make publicly available. The proposed system can efficiently learn to discover mentions with very few human supervisions and that the relation to knowledge in the form of logic constraints improves the quality of the system predictions.

Zugarini, A., Morvan, J., Melacci, S., Knerr, S., Gori, M. (2018). Combining deep learning and symbolic processing for extracting knowledge from raw text. In Artificial Neural Networks in Pattern Recognition. ANNPR 2018 (pp.90-101). Berlin : Springer Verlag [10.1007/978-3-319-99978-4_7].

Combining deep learning and symbolic processing for extracting knowledge from raw text

Zugarini, Andrea;Melacci, Stefano
;
Gori, Marco
2018-01-01

Abstract

This paper faces the problem of extracting knowledge from raw text. We present a deep architecture in the framework of Learning from Constraints [5] that is trained to identify mentions to entities and relations belonging to a given ontology. Each input word is encoded into two latent representations with different coverage of the local context, that are exploited to predict the type of entity and of relation to which the word belongs. Our model combines an entropy-based regularizer and a set of First-Order Logic formulas that bridge the predictions on entity and relation types accordingly to the ontology structure. As a result, the system generates symbolic descriptions of the raw text that are interpretable and well-suited to attach human-level knowledge. We evaluate the model on a dataset composed of sentences about simple facts, that we make publicly available. The proposed system can efficiently learn to discover mentions with very few human supervisions and that the relation to knowledge in the form of logic constraints improves the quality of the system predictions.
2018
9783319999777
Zugarini, A., Morvan, J., Melacci, S., Knerr, S., Gori, M. (2018). Combining deep learning and symbolic processing for extracting knowledge from raw text. In Artificial Neural Networks in Pattern Recognition. ANNPR 2018 (pp.90-101). Berlin : Springer Verlag [10.1007/978-3-319-99978-4_7].
File in questo prodotto:
File Dimensione Formato  
melacci_ANNPR2018.pdf

non disponibili

Tipologia: PDF editoriale
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 1.17 MB
Formato Adobe PDF
1.17 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1065983