Unsupervised Learning by Minimal Entropy Encoding

IRIS

Following basic principles of information-theoretic learning, in this paper, we propose a novel approach to data clustering, referred to as minimal entropy encoding (MEE), which is based on a set of functions (features) projecting each input onto a minimum entropy configuration (code). Inspired by traditional parsimony principles, we seek solutions in reproducing kernel Hilbert spaces and then we prove that the encoding functions are expressed in terms of kernel expansion. In order to avoid trivial solutions, the developed features must be as different as possible by means of a soft constraint on the empirical estimation of the entropy associated with the encoding functions. This leads to an unconstrained optimization problem that can be efficiently solved by conjugate gradient. We also investigate an optimization strategy based on concave-convex algorithms. The relationships with maximum margin clustering are studied, showing that MEE overcomes some of its critical issues, such as the lack of a multiclass extension and the need to face problems with a large number of constraints. A massive evaluation on several benchmarks of the proposed approach shows improvements over state-of-the-art techniques, both in terms of accuracy and computational complexity.

Melacci, S., Gori, M. (2012). Unsupervised Learning by Minimal Entropy Encoding. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 23(12), 1849-1861 [10.1109/TNNLS.2012.2216899].

Unsupervised Learning by Minimal Entropy Encoding

MELACCI, STEFANO;GORI, MARCO

2012-01-01

Abstract

Following basic principles of information-theoretic learning, in this paper, we propose a novel approach to data clustering, referred to as minimal entropy encoding (MEE), which is based on a set of functions (features) projecting each input onto a minimum entropy configuration (code). Inspired by traditional parsimony principles, we seek solutions in reproducing kernel Hilbert spaces and then we prove that the encoding functions are expressed in terms of kernel expansion. In order to avoid trivial solutions, the developed features must be as different as possible by means of a soft constraint on the empirical estimation of the entropy associated with the encoding functions. This leads to an unconstrained optimization problem that can be efficiently solved by conjugate gradient. We also investigate an optimization strategy based on concave-convex algorithms. The relationships with maximum margin clustering are studied, showing that MEE overcomes some of its critical issues, such as the lack of a multiclass extension and the need to face problems with a large number of constraints. A massive evaluation on several benchmarks of the proposed approach shows improvements over state-of-the-art techniques, both in terms of accuracy and computational complexity.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
			2012
		
	Rivista su cui è pubblicata l'opera
	
			IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
		
	Citazione
	
			Melacci, S., Gori, M. (2012). Unsupervised Learning by Minimal Entropy Encoding. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 23(12), 1849-1861 [10.1109/TNNLS.2012.2216899].
		
	Appare nelle tipologie:
	
			1.1 Articolo in rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/43940

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo