Continual learning for unsupervised concept bottleneck discovery

IRIS

In the context of continual learning, little attention is dedicated to the problem of developing a layer of “concepts”, also known as “concept bottleneck”, to support the discrimination of higher-level task information, especially when concepts are not supervised. Concept bottleneck discovery in an unsupervised setting is thus largely unexplored, and this paper aims to move a step forward in such direction. We consider a neural network that faces a stream of binary tasks, with no further information on the relationships among them, i.e., no supervisions at the level of concepts. The learning of the concept bottleneck layer is driven by means of a triplet-based criterion, which is instantiated in conjunction with a specifically designed experience replay (concept replay). Such a novel criterion exploits fuzzy Hamming distances to treat vectors of concept probabilities as fuzzy bitstrings, encouraging different concept activations across different tasks, while also adding a regularization effect which pushes probabilities towards crisp values. Despite the lack of concept supervisions, we found that continually learning the streamed tasks in a progressive manner yields the development of inner concepts that are significantly better correlated with the higher-level tasks, compared to the case of joint-offline learning. This result is showcased in an extended experimental activity involving different architectures and newly created (and shared) datasets that are also well-suited to support further investigation of continual learning in concept-based models.

Lorello, L.S., Lippi, M., Melacci, S. (2024). Continual learning for unsupervised concept bottleneck discovery. In Proceedings of Machine Learning Research (pp.597-619). San Diego, CA : ML Research Press.

Continual learning for unsupervised concept bottleneck discovery

Lorello, Luca Salvatore;Lippi, Marco;Melacci, Stefano

2024-01-01

Abstract

In the context of continual learning, little attention is dedicated to the problem of developing a layer of “concepts”, also known as “concept bottleneck”, to support the discrimination of higher-level task information, especially when concepts are not supervised. Concept bottleneck discovery in an unsupervised setting is thus largely unexplored, and this paper aims to move a step forward in such direction. We consider a neural network that faces a stream of binary tasks, with no further information on the relationships among them, i.e., no supervisions at the level of concepts. The learning of the concept bottleneck layer is driven by means of a triplet-based criterion, which is instantiated in conjunction with a specifically designed experience replay (concept replay). Such a novel criterion exploits fuzzy Hamming distances to treat vectors of concept probabilities as fuzzy bitstrings, encouraging different concept activations across different tasks, while also adding a regularization effect which pushes probabilities towards crisp values. Despite the lack of concept supervisions, we found that continually learning the streamed tasks in a progressive manner yields the development of inner concepts that are significantly better correlated with the higher-level tasks, compared to the case of joint-offline learning. This result is showcased in an extended experimental activity involving different architectures and newly created (and shared) datasets that are also well-suited to support further investigation of continual learning in concept-based models.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Citazione
	
				Lorello, L.S., Lippi, M., Melacci, S. (2024). Continual learning for unsupervised concept bottleneck discovery. In Proceedings of Machine Learning Research (pp.597-619). San Diego, CA : ML Research Press.
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
lorello25a.pdf accesso aperto Tipologia: PDF editoriale Licenza: PUBBLICO - Pubblico con Copyright Dimensione 1.03 MB Formato Adobe PDF Visualizza/Apri	1.03 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1315904