The opaqueness of deep neural networks hinders their employment in safety-critical applications. This issue is driving the research community to focus on eXplainable Artificial Intelligence (XAI) techniques. XAI algorithms can be categorized into two types: those that explain the predictions of black-box models and those that create interpretable models from the start. While interpretable models foster user trust, their performance is normally inferior to traditional black-box models like neural networks. To fill this gap, this chapter presents an extensive framework introducing special neural networks known as Logic Explained Networks (LENs). The most notable advantage of this approach is that LENs achieve performances that are comparable to the state-of-the-art neural networks while providing concise First-Order Logic explanations. Moreover, LENs are designed to learn various types of explanations, such as local or global, by associating related task functions or directly explaining a single class based on the others and/or the input data. They can offer explanations for their behaviour or that of other black-box models. Finally, LENs can be applied in many learning scenarios: they can be trained using supervised or unsupervised learning and applied across different domains like tabular data, computer vision, natural language processing, and relational tasks.
Ciravegna, G., Giannini, F., Barbiero, P., Gori, M., Lio, P., Maggini, M., et al. (2023). Learning Logic Explanations by Neural Networks. In P. Hitzler, Md K. Sarker, A. Eberhart (a cura di), Compendium of Neurosymbolic Artificial Intelligence (pp. 547-558). IOS Press [10.3233/FAIA230157].
Learning Logic Explanations by Neural Networks
Giannini, Francesco;Gori, Marco;Maggini, Marco;Melacci, Stefano
2023-01-01
Abstract
The opaqueness of deep neural networks hinders their employment in safety-critical applications. This issue is driving the research community to focus on eXplainable Artificial Intelligence (XAI) techniques. XAI algorithms can be categorized into two types: those that explain the predictions of black-box models and those that create interpretable models from the start. While interpretable models foster user trust, their performance is normally inferior to traditional black-box models like neural networks. To fill this gap, this chapter presents an extensive framework introducing special neural networks known as Logic Explained Networks (LENs). The most notable advantage of this approach is that LENs achieve performances that are comparable to the state-of-the-art neural networks while providing concise First-Order Logic explanations. Moreover, LENs are designed to learn various types of explanations, such as local or global, by associating related task functions or directly explaining a single class based on the others and/or the input data. They can offer explanations for their behaviour or that of other black-box models. Finally, LENs can be applied in many learning scenarios: they can be trained using supervised or unsupervised learning and applied across different domains like tabular data, computer vision, natural language processing, and relational tasks.File | Dimensione | Formato | |
---|---|---|---|
nesy_chapter_2023_accepted.pdf
non disponibili
Tipologia:
Post-print
Licenza:
NON PUBBLICO - Accesso privato/ristretto
Dimensione
269.85 kB
Formato
Adobe PDF
|
269.85 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
melacci_COMPENDIUMNESY2023.pdf
non disponibili
Tipologia:
PDF editoriale
Licenza:
NON PUBBLICO - Accesso privato/ristretto
Dimensione
443.34 kB
Formato
Adobe PDF
|
443.34 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11365/1245614