Supervised and Unsupervised Co-Training of Adaptive Activation Functions in Neural Nets

Castelli, I.; Trentin, E.

doi:10.1007/978-3-642-28258-4_6

In spite of the nice theoretical properties of mixtures of logistic activation functions, standard feedforward neural network with limited resources and gradient-descent optimization of the connection weights may practically fail in several, difficult learning tasks. Such tasks would be better faced by relying on a more appropriate, problem-specific basis of activation functions. The paper introduces a connectionist model which features adaptive activation functions. Each hidden unit in the network is associated with a specific pair (f(•), p(•)), where f(•) (the very activation) is modeled via a specialized neural network, and p(•) is a probabilistic measure of the likelihood of the unit itself being relevant to the computation of the output over the current input. While f(•) is optimized in a supervised manner (through a novel backpropagation scheme of the target outputs which do not suffer from the traditional phenomenon of "vanishing gradient" that occurs in standard backpropagation), p(•) is realized via a statistical parametric model learned through unsupervised estimation. The overall machine is implicitly a co-trained coupled model, where the topology chosen for learning each f(•) may vary on a unit-by-unit basis, resulting in a highly non-standard neural architecture. © 2012 Springer-Verlag.

Castelli, I., Trentin, E. (2012). Supervised and Unsupervised Co-Training of Adaptive Activation Functions in Neural Nets. In Partially Supervised Learning: First IAPR TC3 Workshop, PSL 2011, Ulm, Germany (pp.52-61). Springer [10.1007/978-3-642-28258-4_6].

Supervised and Unsupervised Co-Training of Adaptive Activation Functions in Neural Nets

Castelli I.;Trentin E.

2012-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2012
			
	Codice ISBN
	
				9783642282577
			
	Citazione
	
				Castelli, I., Trentin, E. (2012). Supervised and Unsupervised Co-Training of Adaptive Activation Functions in Neural Nets. In Partially Supervised Learning: First IAPR TC3 Workshop, PSL 2011, Ulm, Germany (pp.52-61). Springer [10.1007/978-3-642-28258-4_6].
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/22235

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Nome	Dominio	Durata	Descrizione
s_.*	plu.mx	sessione	recupero grafico citazioni sociali da plumx
A_.*	core.ac.uk	7 giorni	recupero pubblicazioni consigliate per il pannello core-recommander
GS_.*	gstatic.com	richiesta http	visualizza grafico citazioni
CC_.*	creativecommons.org	richiesta http	visualizza licenza bitstream

Supervised and Unsupervised Co-Training of Adaptive Activation Functions in Neural Nets

Castelli I.;Trentin E.

2012-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Attenzione

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)