The classic computational scheme of convolutional layers leverages filter banks that are shared over all the spatial coordinates of the input, independently on external information on what is specifically under observation and without any distinctions between what is closer to the observed area and what is peripheral. In this paper we propose to go beyond such a scheme, introducing the notion of Foveated Convolutional Layer (FCL), that formalizes the idea of location-dependent convolutions with foveated processing, i.e., fine-grained processing in a given-focused area and coarser processing in the peripheral regions. We show how the idea of foveated computations can be exploited not only as a filtering mechanism, but also as a mean to speed-up inference with respect to classic convolutional layers, allowing the user to select the appropriate trade-off between level of detail and computational burden. FCLs can be stacked into neural architectures and we evaluate them in several tasks, showing how they efficiently handle the information in the peripheral regions, eventually avoiding the development of misleading biases. When integrated with a model of human attention, FCL-based networks naturally implement a foveated visual system that guides the attention toward the locations of interest, as we experimentally analyze on a stream of visual stimuli.

Tiezzi, M., Marullo, S., Betti, A., Meloni, E., Faggi, L., Gori, M., et al. (2023). Foveated Neural Computation. In Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022 (pp.19-35). Cham : Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-26409-2_2].

Foveated Neural Computation

Tiezzi M.
;
Gori M.;Melacci S.
2023-01-01

Abstract

The classic computational scheme of convolutional layers leverages filter banks that are shared over all the spatial coordinates of the input, independently on external information on what is specifically under observation and without any distinctions between what is closer to the observed area and what is peripheral. In this paper we propose to go beyond such a scheme, introducing the notion of Foveated Convolutional Layer (FCL), that formalizes the idea of location-dependent convolutions with foveated processing, i.e., fine-grained processing in a given-focused area and coarser processing in the peripheral regions. We show how the idea of foveated computations can be exploited not only as a filtering mechanism, but also as a mean to speed-up inference with respect to classic convolutional layers, allowing the user to select the appropriate trade-off between level of detail and computational burden. FCLs can be stacked into neural architectures and we evaluate them in several tasks, showing how they efficiently handle the information in the peripheral regions, eventually avoiding the development of misleading biases. When integrated with a model of human attention, FCL-based networks naturally implement a foveated visual system that guides the attention toward the locations of interest, as we experimentally analyze on a stream of visual stimuli.
2023
978-3-031-26408-5
978-3-031-26409-2
Tiezzi, M., Marullo, S., Betti, A., Meloni, E., Faggi, L., Gori, M., et al. (2023). Foveated Neural Computation. In Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022 (pp.19-35). Cham : Springer Science and Business Media Deutschland GmbH [10.1007/978-3-031-26409-2_2].
File in questo prodotto:
File Dimensione Formato  
sub_620.pdf

Open Access dal 17/03/2024

Tipologia: Post-print
Licenza: PUBBLICO - Pubblico con Copyright
Dimensione 1.55 MB
Formato Adobe PDF
1.55 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1231234