Discovering NDM-1 inhibitors using molecular substructure embeddings representations

IRIS

NDM-1 (New-Delhi-Metallo-beta-lactamase-1) is an enzyme developed by bacteria that is implicated in bacteria resistance to almost all known antibiotics. In this study, we deliver a new, curated NDM-1 bioactivities database, along with a set of unifying rules for managing different activity properties and inconsistencies. We define the activity classification problem in terms of Multiple Instance Learning, employing embeddings corresponding to molecular substructures and present an ensemble ranking and classification framework, relaying on a k-fold Cross Validation method employing a per fold hyper-parameter optimization procedure, showing promising generalization ability. The MIL paradigm displayed an improvement up to 45.7 %, in terms of Balanced Accuracy, in comparison to the classical Machine Learning paradigm. Moreover, we investigate different compact molecular representations, based on atomic or bi-atomic substructures. Finally, we scanned the Drugbank for strongly active compounds and we present the top-15 ranked compounds.

Papastergiou, T., Azé, J., Bringay, S., Louet, M., Poncelet, P., Rosales-Hurtado, M., et al. (2023). Discovering NDM-1 inhibitors using molecular substructure embeddings representations. JOURNAL OF INTEGRATIVE BIOINFORMATICS, 20(2), 1-20 [10.1515/jib-2022-0050].

Discovering NDM-1 inhibitors using molecular substructure embeddings representations

Papastergiou, Thomas;Azé, Jérôme;Bringay, Sandra;Louet, Maxime;Poncelet, Pascal;Rosales-Hurtado, Miyanou;Vo-Hoang, Yen;Licznar-Fajardo, Patricia;Docquier, Jean-Denis;Gavara, Laurent

2023-01-01

Abstract

NDM-1 (New-Delhi-Metallo-beta-lactamase-1) is an enzyme developed by bacteria that is implicated in bacteria resistance to almost all known antibiotics. In this study, we deliver a new, curated NDM-1 bioactivities database, along with a set of unifying rules for managing different activity properties and inconsistencies. We define the activity classification problem in terms of Multiple Instance Learning, employing embeddings corresponding to molecular substructures and present an ensemble ranking and classification framework, relaying on a k-fold Cross Validation method employing a per fold hyper-parameter optimization procedure, showing promising generalization ability. The MIL paradigm displayed an improvement up to 45.7 %, in terms of Balanced Accuracy, in comparison to the classical Machine Learning paradigm. Moreover, we investigate different compact molecular representations, based on atomic or bi-atomic substructures. Finally, we scanned the Drugbank for strongly active compounds and we present the top-15 ranked compounds.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Rivista su cui è pubblicata l'opera
	
				JOURNAL OF INTEGRATIVE BIOINFORMATICS
			
	Citazione
	
				Papastergiou, T., Azé, J., Bringay, S., Louet, M., Poncelet, P., Rosales-Hurtado, M., et al. (2023). Discovering NDM-1 inhibitors using molecular substructure embeddings representations. JOURNAL OF INTEGRATIVE BIOINFORMATICS, 20(2), 1-20 [10.1515/jib-2022-0050].
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
10.1515_jib-2022-0050.pdf accesso aperto Descrizione: Articolo Tipologia: PDF editoriale Licenza: Creative commons Dimensione 3.5 MB Formato Adobe PDF Visualizza/Apri	3.5 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1244794