The opaqueness of deep neural networks hinders their employment in safety-critical applications. This issue is driving the research community to focus on eXplainable Artificial Intelligence (XAI) techniques. XAI algorithms can be categorized into two types: those that explain the predictions of black-box models and those that create interpretable models from the start. While interpretable models foster user trust, their performance is normally inferior to traditional black-box models like neural networks. To fill this gap, this chapter presents an extensive framework introducing special neural networks known as Logic Explained Networks (LENs). The most notable advantage of this approach is that LENs achieve performances that are comparable to the state-of-the-art neural networks while providing concise First-Order Logic explanations. Moreover, LENs are designed to learn various types of explanations, such as local or global, by associating related task functions or directly explaining a single class based on the others and/or the input data. They can offer explanations for their behaviour or that of other black-box models. Finally, LENs can be applied in many learning scenarios: they can be trained using supervised or unsupervised learning and applied across different domains like tabular data, computer vision, natural language processing, and relational tasks.

Ciravegna, G., Giannini, F., Barbiero, P., Gori, M., Lio, P., Maggini, M., et al. (2023). Learning Logic Explanations by Neural Networks. In P. Hitzler, Md K. Sarker, A. Eberhart (a cura di), Compendium of Neurosymbolic Artificial Intelligence (pp. 547-558). IOS Press [10.3233/FAIA230157].

Learning Logic Explanations by Neural Networks

Giannini, Francesco;Gori, Marco;Maggini, Marco;Melacci, Stefano
2023-01-01

Abstract

The opaqueness of deep neural networks hinders their employment in safety-critical applications. This issue is driving the research community to focus on eXplainable Artificial Intelligence (XAI) techniques. XAI algorithms can be categorized into two types: those that explain the predictions of black-box models and those that create interpretable models from the start. While interpretable models foster user trust, their performance is normally inferior to traditional black-box models like neural networks. To fill this gap, this chapter presents an extensive framework introducing special neural networks known as Logic Explained Networks (LENs). The most notable advantage of this approach is that LENs achieve performances that are comparable to the state-of-the-art neural networks while providing concise First-Order Logic explanations. Moreover, LENs are designed to learn various types of explanations, such as local or global, by associating related task functions or directly explaining a single class based on the others and/or the input data. They can offer explanations for their behaviour or that of other black-box models. Finally, LENs can be applied in many learning scenarios: they can be trained using supervised or unsupervised learning and applied across different domains like tabular data, computer vision, natural language processing, and relational tasks.
2023
9781643684062
9781643684079
Ciravegna, G., Giannini, F., Barbiero, P., Gori, M., Lio, P., Maggini, M., et al. (2023). Learning Logic Explanations by Neural Networks. In P. Hitzler, Md K. Sarker, A. Eberhart (a cura di), Compendium of Neurosymbolic Artificial Intelligence (pp. 547-558). IOS Press [10.3233/FAIA230157].
File in questo prodotto:
File Dimensione Formato  
nesy_chapter_2023_accepted.pdf

non disponibili

Tipologia: Post-print
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 269.85 kB
Formato Adobe PDF
269.85 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
melacci_COMPENDIUMNESY2023.pdf

non disponibili

Tipologia: PDF editoriale
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 443.34 kB
Formato Adobe PDF
443.34 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1245614