Protein–protein interfaces: A Graph Neural Network approach

IRIS

Protein–protein interactions (PPIs) are fundamental processes governing cellular functions, crucial for understanding biological systems at the molecular level. Compared to experimental methods for PPI prediction and site identification, computational deep learning approaches represent an affordable and efficient solution to tackle these problems. Since protein structure can be summarized as a graph, graph neural networks (GNNs) represent the ideal deep learning architecture for the task. In this work, PPI prediction is modeled as a node-focused binary classification task using a GNN to determine whether a generic residue is part of the interface. Biological data were obtained from the Protein Data Bank in Europe (PDBe), leveraging the Protein Interfaces, Surfaces, and Assemblies (PISA) service. To gain a deeper understanding of how proteins interact, the data obtained from PISA were assembled into three datasets: Whole, Interface, and Chain, consisting of data on the whole protein, couples of interacting chains, and single chains, respectively. These three datasets correspond to three different nuances of the problem: identifying interfaces between protein complexes, between chains of the same protein, and interface regions in general. The results indicate that GNNs are capable of solving each of the three tasks with very good performance levels.

Pancino, N., Gallegati, C., Romagnoli, F., Bongini, P., Bianchini, M. (2024). Protein–protein interfaces: A Graph Neural Network approach. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 25(11) [10.3390/ijms25115870].

Protein–protein interfaces: A Graph Neural Network approach

Pancino, Niccolo;Gallegati, Caterina;Romagnoli, Fiamma;Bongini, Pietro;Bianchini, Monica

2024-01-01

Abstract

Protein–protein interactions (PPIs) are fundamental processes governing cellular functions, crucial for understanding biological systems at the molecular level. Compared to experimental methods for PPI prediction and site identification, computational deep learning approaches represent an affordable and efficient solution to tackle these problems. Since protein structure can be summarized as a graph, graph neural networks (GNNs) represent the ideal deep learning architecture for the task. In this work, PPI prediction is modeled as a node-focused binary classification task using a GNN to determine whether a generic residue is part of the interface. Biological data were obtained from the Protein Data Bank in Europe (PDBe), leveraging the Protein Interfaces, Surfaces, and Assemblies (PISA) service. To gain a deeper understanding of how proteins interact, the data obtained from PISA were assembled into three datasets: Whole, Interface, and Chain, consisting of data on the whole protein, couples of interacting chains, and single chains, respectively. These three datasets correspond to three different nuances of the problem: identifying interfaces between protein complexes, between chains of the same protein, and interface regions in general. The results indicate that GNNs are capable of solving each of the three tasks with very good performance levels.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Rivista su cui è pubblicata l'opera
	
				INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES
			
	Citazione
	
				Pancino, N., Gallegati, C., Romagnoli, F., Bongini, P., Bianchini, M. (2024). Protein–protein interfaces: A Graph Neural Network approach. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 25(11) [10.3390/ijms25115870].
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
ijms-25-05870-with-cover.pdf accesso aperto Tipologia: PDF editoriale Licenza: Creative commons Dimensione 2.93 MB Formato Adobe PDF Visualizza/Apri	2.93 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1261854