Randomized Smoothing (RS) has been proposed as a method to develop deep learning classifiers with certified robustness, i.e., for which a certain level of robustness can be theoretically guaranteed. In this paper, we explore the application of the RS technique in the context of multimedia forensics, focusing on the prominent task of synthetic image detection. Our experiments, carried out on the task of detection of images generated by StyleGAN2 and Latent Diffusion models, reveal that the input pre-processing, the input size and in particular the network architecture have a noticeable impact on the certification performance. In particular, we achieved the best performance with EfficientNetB4, while we found that the certification achieved by detectors based on general-purpose features, namely CLIP, is poor. We also evaluated the performance of the RS synthetic image detectors against common image post-processing, showing that they exhibit strong robustness against a wide variety of processing, even when the distortion introduced by the processing exceeds the one the detectors can provably withstand.
Zeng, K., Barni, M., Tondi, B. (2025). Certifiably Robust Synthetic Image Detectors. In European Signal Processing Conference (pp.815-819). European Signal Processing Conference, EUSIPCO [10.23919/eusipco63237.2025.11226152].
Certifiably Robust Synthetic Image Detectors
Zeng, Kai;Barni, Mauro;Tondi, Benedetta
2025-01-01
Abstract
Randomized Smoothing (RS) has been proposed as a method to develop deep learning classifiers with certified robustness, i.e., for which a certain level of robustness can be theoretically guaranteed. In this paper, we explore the application of the RS technique in the context of multimedia forensics, focusing on the prominent task of synthetic image detection. Our experiments, carried out on the task of detection of images generated by StyleGAN2 and Latent Diffusion models, reveal that the input pre-processing, the input size and in particular the network architecture have a noticeable impact on the certification performance. In particular, we achieved the best performance with EfficientNetB4, while we found that the certification achieved by detectors based on general-purpose features, namely CLIP, is poor. We also evaluated the performance of the RS synthetic image detectors against common image post-processing, showing that they exhibit strong robustness against a wide variety of processing, even when the distortion introduced by the processing exceeds the one the detectors can provably withstand.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11365/1315942
Attenzione
Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo
