Attacking RAG Systems in Multiple Domains with Locally Running and Automatic Procedures

IRIS

Nowadays, services exploiting Large Language Models (LLMs) frequently customize the model responses in function of their specific requirements, thanks to Retrieval-Augmented Generation (RAG). While this is a simple and effective solution, it has been recently shown to suffer from serious LLM-related security issues that, in some circumstances, can be exploited to access the RAG internal knowledge base, possibly containing private/sensitive data. This paper (i) compares different existing recent attacks to RAG systems in three real-world scenarios, with the goal of in-depth evaluating them. Moreover, novel attacks are proposed, in order to study distinct open directions in this field of study: (ii) the first one is based on free-online-API variants of recent black-box attacks, which can be run on a domestic machine, and a fully black-box reformulation of a gray-box method, in both cases simulating more realistic conditions built with open-source tools; (iii) the second one is aimed at building automatic procedures to get the most out of the knowledge-base of a RAG system, introducing a strategy that allows a recent memory-based approach to become automatic. A large experimental comparison is included that, to our best knowledge, is the most extended one in the current scientific literature. Results highlight the sensitivity of the attack procedure to the basic tools over which they are built, but also shows that open-source solutions can be easily exploited to setup attacks. Moreover, the actually feasibility of designing an automatic procedure is proved. In both the cases, this paper raises an important warning on the need of taking specific precautions to setup robust RAG systems.

Di Maio, C., Melacci, S. (2025). Attacking RAG Systems in Multiple Domains with Locally Running and Automatic Procedures. In 2025 International Joint Conference on Neural Networks (IJCNN) (pp.1-8). New York : Institute of Electrical and Electronics Engineers Inc. [10.1109/ijcnn64981.2025.11228722].

Attacking RAG Systems in Multiple Domains with Locally Running and Automatic Procedures

Di Maio, Christian;Melacci, Stefano

2025-01-01

Abstract

Nowadays, services exploiting Large Language Models (LLMs) frequently customize the model responses in function of their specific requirements, thanks to Retrieval-Augmented Generation (RAG). While this is a simple and effective solution, it has been recently shown to suffer from serious LLM-related security issues that, in some circumstances, can be exploited to access the RAG internal knowledge base, possibly containing private/sensitive data. This paper (i) compares different existing recent attacks to RAG systems in three real-world scenarios, with the goal of in-depth evaluating them. Moreover, novel attacks are proposed, in order to study distinct open directions in this field of study: (ii) the first one is based on free-online-API variants of recent black-box attacks, which can be run on a domestic machine, and a fully black-box reformulation of a gray-box method, in both cases simulating more realistic conditions built with open-source tools; (iii) the second one is aimed at building automatic procedures to get the most out of the knowledge-base of a RAG system, introducing a strategy that allows a recent memory-based approach to become automatic. A large experimental comparison is included that, to our best knowledge, is the most extended one in the current scientific literature. Results highlight the sensitivity of the attack procedure to the basic tools over which they are built, but also shows that open-source solutions can be easily exploited to setup attacks. Moreover, the actually feasibility of designing an automatic procedure is proved. In both the cases, this paper raises an important warning on the need of taking specific precautions to setup robust RAG systems.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Codice ISBN
	
				979-8-3315-1042-8
			
	Citazione
	
				Di Maio, C., Melacci, S. (2025). Attacking RAG Systems in Multiple Domains with Locally Running and Automatic Procedures. In 2025 International Joint Conference on Neural Networks (IJCNN) (pp.1-8). New York : Institute of Electrical and Electronics Engineers Inc. [10.1109/ijcnn64981.2025.11228722].
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
melacci_IJCNN2025c.pdf non disponiibile Tipologia: PDF editoriale Licenza: NON PUBBLICO - Accesso privato/ristretto Dimensione 1.22 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.22 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1315896