Semiotic-Based Construction of a Large Emotional Image Dataset with Neutral Samples

IRIS

Image Visual Sentiment Analysis (VSA) requires the availability of large annotated datasets, whose construction presents many challenges. The necessity of gathering a large amount of labeled images contrasts with the rigorous, but lengthy, process required for manual annotation based on psychovisual experiments, and with the automatic gathering of large amounts of data roughly labeled based on the sentiment analysis of the text accompanying the images, like captions, tweets and tags. An additional limitation is the scarcity of high-quality datasets with a neutral class, which forces the images to be classified into emotions even when the observers show no emotional activation. In this work, we present a scalable methodology rooted in semiotics and art theory for the construction of a 3-class (positive, negative and neutral) VSA dataset, enabling the downloading of a desired quantity of images while maintaining labeling coherence and accuracy. Based on the proposed methodology, we introduce and make publicly available a VSA dataset of over 100,000 images. To validate the quality of the dataset, we used it to train several classifiers and compared their performance with those of classifiers trained on other datasets. The results, we got, show that the classifiers trained on the new dataset provide better performance when tested on independent datasets, including those commonly used for psycho-visual experiments.

Blanchini, M., Dimitri, G., Abady, L., Tondi, B., Lancioni, T., Barni, M. (2025). Semiotic-Based Construction of a Large Emotional Image Dataset with Neutral Samples. In 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp.7552-7561) [10.1109/wacv61041.2025.00734].

Semiotic-Based Construction of a Large Emotional Image Dataset with Neutral Samples

Blanchini, Marco;Dimitri, Giovanna;Abady, Lydia;Tondi, Benedetta;Lancioni, Tarcisio;Barni, Mauro

2025-01-01

Abstract

Image Visual Sentiment Analysis (VSA) requires the availability of large annotated datasets, whose construction presents many challenges. The necessity of gathering a large amount of labeled images contrasts with the rigorous, but lengthy, process required for manual annotation based on psychovisual experiments, and with the automatic gathering of large amounts of data roughly labeled based on the sentiment analysis of the text accompanying the images, like captions, tweets and tags. An additional limitation is the scarcity of high-quality datasets with a neutral class, which forces the images to be classified into emotions even when the observers show no emotional activation. In this work, we present a scalable methodology rooted in semiotics and art theory for the construction of a 3-class (positive, negative and neutral) VSA dataset, enabling the downloading of a desired quantity of images while maintaining labeling coherence and accuracy. Based on the proposed methodology, we introduce and make publicly available a VSA dataset of over 100,000 images. To validate the quality of the dataset, we used it to train several classifiers and compared their performance with those of classifiers trained on other datasets. The results, we got, show that the classifiers trained on the new dataset provide better performance when tested on independent datasets, including those commonly used for psycho-visual experiments.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Citazione
	
				Blanchini, M., Dimitri, G., Abady, L., Tondi, B., Lancioni, T., Barni, M. (2025). Semiotic-Based Construction of a Large Emotional Image Dataset with Neutral Samples. In 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (pp.7552-7561) [10.1109/wacv61041.2025.00734].
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
ArticoloWACV2025BlanchiniDimitri.pdf accesso aperto Tipologia: Post-print Licenza: PUBBLICO - Pubblico con Copyright Dimensione 1.54 MB Formato Adobe PDF Visualizza/Apri	1.54 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1290837