Attacking and Defending Printer Source Attribution Classifiers in the Physical Domain

IRIS

The security of machine learning classifiers has received increasing attention in the last years. In forensic applications, guaranteeing the security of the tools investigators rely on is crucial, since the gathered evidence may be used to decide about the innocence or the guilt of a suspect. Several adversarial attacks were proposed to assess such security, with a few works focusing on transferring such attacks from the digital to the physical domain. In this work, we focus on physical domain attacks against source attribution of printed documents. We first show how a simple reprinting attack may be sufficient to fool a model trained on images that were printed and scanned only once. Then, we propose a hardened version of the classifier trained on the reprinted attacked images. Finally, we attack the hardened classifier with several attacks, including a new attack based on the Expectation Over Transformation approach, which finds the adversarial perturbations by simulating the physical transformations occurring when the image attacked in the digital domain is printed again. The results we got demonstrate a good capability of the hardened classifier to resist attacks carried out in the physical domain

Ferreira, A., Barni, M. (2022). Attacking and Defending Printer Source Attribution Classifiers in the Physical Domain. In MMFORWILD 2022 [10.5281/zenodo.6899743].

Attacking and Defending Printer Source Attribution Classifiers in the Physical Domain

Ferreira, Anselmo;Barni, Mauro

2022-01-01

Abstract

The security of machine learning classifiers has received increasing attention in the last years. In forensic applications, guaranteeing the security of the tools investigators rely on is crucial, since the gathered evidence may be used to decide about the innocence or the guilt of a suspect. Several adversarial attacks were proposed to assess such security, with a few works focusing on transferring such attacks from the digital to the physical domain. In this work, we focus on physical domain attacks against source attribution of printed documents. We first show how a simple reprinting attack may be sufficient to fool a model trained on images that were printed and scanned only once. Then, we propose a hardened version of the classifier trained on the reprinted attacked images. Finally, we attack the hardened classifier with several attacks, including a new attack based on the Expectation Over Transformation approach, which finds the adversarial perturbations by simulating the physical transformations occurring when the image attacked in the digital domain is printed again. The results we got demonstrate a good capability of the hardened classifier to resist attacks carried out in the physical domain

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Citazione
	
				Ferreira, A., Barni, M. (2022). Attacking and Defending Printer Source Attribution Classifiers in the Physical Domain. In MMFORWILD 2022 [10.5281/zenodo.6899743].
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Paper_MMFORWILD_2022.pdf accesso aperto Tipologia: Pre-print Licenza: PUBBLICO - Pubblico con Copyright Dimensione 289.74 kB Formato Adobe PDF Visualizza/Apri	289.74 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1217154