Foveated Neural Computation

IRIS

The classic computational scheme of convolutional layers leverages filter banks that are shared over all the spatial coordinates of the input, independently on external information on what is specifically under observation and without any distinctions between what is closer to the observed area and what is peripheral. In this paper we propose to go beyond such a scheme, introducing the notion of Foveated Convolutional Layer (FCL), that formalizes the idea of location-dependent convolutions with foveated processing, i.e., fine-grained processing in a given-focused area and coarser processing in the peripheral regions. We show how the idea of foveated computations can be exploited not only as a filtering mechanism, but also as a mean to speed-up inference with respect to classic convolutional layers, allowing the user to select the appropriate trade-off between level of detail and computational burden. FCLs can be stacked into neural architectures and we evaluate them in several tasks, showing how they efficiently handle the information in the peripheral regions, eventually avoiding the development of misleading biases. When integrated with a model of human attention, FCL-based networks naturally implement a foveated visual system that guides the attention toward the locations of interest, as we experimentally analyze on a stream of visual stimuli.

Tiezzi, M., Marullo, S., Betti, A., Meloni, E., Faggi, L., Gori, M., et al. (2023). Foveated Neural Computation. In Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022 (pp.19-35). Cham : Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-26409-2_2].

Foveated Neural Computation

Tiezzi M.;Marullo S.;Betti A.;Meloni E.;Faggi L.;Gori M.;Melacci S.

2023-01-01

Abstract

The classic computational scheme of convolutional layers leverages filter banks that are shared over all the spatial coordinates of the input, independently on external information on what is specifically under observation and without any distinctions between what is closer to the observed area and what is peripheral. In this paper we propose to go beyond such a scheme, introducing the notion of Foveated Convolutional Layer (FCL), that formalizes the idea of location-dependent convolutions with foveated processing, i.e., fine-grained processing in a given-focused area and coarser processing in the peripheral regions. We show how the idea of foveated computations can be exploited not only as a filtering mechanism, but also as a mean to speed-up inference with respect to classic convolutional layers, allowing the user to select the appropriate trade-off between level of detail and computational burden. FCLs can be stacked into neural architectures and we evaluate them in several tasks, showing how they efficiently handle the information in the peripheral regions, eventually avoiding the development of misleading biases. When integrated with a model of human attention, FCL-based networks naturally implement a foveated visual system that guides the attention toward the locations of interest, as we experimentally analyze on a stream of visual stimuli.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Codice ISBN
	
				978-3-031-26408-5
978-3-031-26409-2
			
	Citazione
	
				Tiezzi, M., Marullo, S., Betti, A., Meloni, E., Faggi, L., Gori, M., et al. (2023). Foveated Neural Computation. In Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022 (pp.19-35). Cham : Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-26409-2_2].
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
sub_620.pdf Open Access dal 17/03/2024 Tipologia: Post-print Licenza: PUBBLICO - Pubblico con Copyright Dimensione 1.55 MB Formato Adobe PDF Visualizza/Apri	1.55 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1231234