We investigate if the random feature selection approach proposed in [1] to improve the robustness of forensic detectors to targeted attacks, can be extended to detectors based on deep learning features. In particular, we study the transferability of adversarial examples targeting an original CNN image manipulation detector to other detectors (a fully connected neural network and a linear SVM) that rely on a random subset of the features extracted from the flatten layer of the original network. The results we got by considering three image manipulation detection tasks (resizing, median filtering and adaptive histogram equalization), two original network architectures and three classes of attacks, show that feature randomization helps to hinder attack transferability, even if, in some cases, simply changing the architecture of the detector, or even retraining the detector is enough to prevent the transferability of the attacks.

Barni, M., Nowroozi, E., Tondi, B., Zhang, B. (2020). Effectiveness of random deep feature selection for securing image manipulation detectors against adversarial examples. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp.2977-2981). New York : IEEE [10.1109/ICASSP40776.2020.9053318].

Effectiveness of random deep feature selection for securing image manipulation detectors against adversarial examples

Barni, M.
;
Nowroozi, E.;Tondi, B.;
2020-01-01

Abstract

We investigate if the random feature selection approach proposed in [1] to improve the robustness of forensic detectors to targeted attacks, can be extended to detectors based on deep learning features. In particular, we study the transferability of adversarial examples targeting an original CNN image manipulation detector to other detectors (a fully connected neural network and a linear SVM) that rely on a random subset of the features extracted from the flatten layer of the original network. The results we got by considering three image manipulation detection tasks (resizing, median filtering and adaptive histogram equalization), two original network architectures and three classes of attacks, show that feature randomization helps to hinder attack transferability, even if, in some cases, simply changing the architecture of the detector, or even retraining the detector is enough to prevent the transferability of the attacks.
2020
978-1-5090-6631-5
978-1-5090-6632-2
Barni, M., Nowroozi, E., Tondi, B., Zhang, B. (2020). Effectiveness of random deep feature selection for securing image manipulation detectors against adversarial examples. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp.2977-2981). New York : IEEE [10.1109/ICASSP40776.2020.9053318].
File in questo prodotto:
File Dimensione Formato  
09053318.pdf

non disponibili

Tipologia: PDF editoriale
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 229.38 kB
Formato Adobe PDF
229.38 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1127175