An Overview of Backdoor Attacks Against Deep Neural Networks and Possible Defences

IRIS

Together with impressive advances touching every aspect of our society, AI technology based on Deep Neural Networks (DNN) is bringing increasing security concerns. While attacks operating at test time have monopolised the initial attention of researchers, backdoor attacks, exploiting the possibility of corrupting DNN models by interfering with the training process, represent a further serious threat undermining the dependability of AI techniques. In backdoor attacks, the attacker corrupts the training data to induce an erroneous behaviour at test time. Test-time errors, however, are activated only in the presence of a triggering event. In this way, the corrupted network continues to work as expected for regular inputs, and the malicious behaviour occurs only when the attacker decides to activate the backdoor hidden within the network. Recently, backdoor attacks have been an intense research domain focusing on both the development of new classes of attacks, and the proposal of possible countermeasures. The goal of this overview is to review the works published until now, classifying the different types of attacks and defences proposed so far. The classification guiding the analysis is based on the amount of control that the attacker has on the training process, and the capability of the defender to verify the integrity of the data used for training, and to monitor the operations of the DNN at training and test time. Hence, the proposed analysis is suited to highlight the strengths and weaknesses of both attacks and defences with reference to the application scenarios they are operating in.

Guo, W., Tondi, B., Barni, M. (2022). An Overview of Backdoor Attacks Against Deep Neural Networks and Possible Defences. IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 3, 261-287 [10.1109/OJSP.2022.3190213].

An Overview of Backdoor Attacks Against Deep Neural Networks and Possible Defences

Guo, W;Tondi, B;Barni, M

2022-01-01

Abstract

Together with impressive advances touching every aspect of our society, AI technology based on Deep Neural Networks (DNN) is bringing increasing security concerns. While attacks operating at test time have monopolised the initial attention of researchers, backdoor attacks, exploiting the possibility of corrupting DNN models by interfering with the training process, represent a further serious threat undermining the dependability of AI techniques. In backdoor attacks, the attacker corrupts the training data to induce an erroneous behaviour at test time. Test-time errors, however, are activated only in the presence of a triggering event. In this way, the corrupted network continues to work as expected for regular inputs, and the malicious behaviour occurs only when the attacker decides to activate the backdoor hidden within the network. Recently, backdoor attacks have been an intense research domain focusing on both the development of new classes of attacks, and the proposal of possible countermeasures. The goal of this overview is to review the works published until now, classifying the different types of attacks and defences proposed so far. The classification guiding the analysis is based on the amount of control that the attacker has on the training process, and the capability of the defender to verify the integrity of the data used for training, and to monitor the operations of the DNN at training and test time. Hence, the proposed analysis is suited to highlight the strengths and weaknesses of both attacks and defences with reference to the application scenarios they are operating in.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Rivista su cui è pubblicata l'opera
	
				IEEE OPEN JOURNAL OF SIGNAL PROCESSING
			
	Citazione
	
				Guo, W., Tondi, B., Barni, M. (2022). An Overview of Backdoor Attacks Against Deep Neural Networks and Possible Defences. IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 3, 261-287 [10.1109/OJSP.2022.3190213].
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
An_Overview_of_Backdoor_Attacks_Against_Deep_Neural_Networks_and_Possible_Defences.pdf accesso aperto Tipologia: PDF editoriale Licenza: Creative commons Dimensione 3.87 MB Formato Adobe PDF Visualizza/Apri	3.87 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1218756