Molecule generation has experienced multiple breakthroughs in recent years. While traditional techniques are very reliable they have low space exploration potential; therefore, machine learning techniques are needed to delve deep into the vast space of potential compounds, helping to find new ways of designing candidate molecules. This paper presents a sequential Markovian model based on graph neural networks for molecular generation. The model employs a Breadth–First Search (BFS) ordering strategy and a modular architecture to enhance independence between functions, with a focus on maintaining strict independence at each step of the Markovian process. Experimental results on ZINC (MOSES), ZINC (250 K) and Polymers datasets demonstrate the model’s ability to perform both unconditional and conditional molecule generation while preserving dataset properties. Notably, the step–wise approach achieves state–of–the–art results in terms of uniqueness and validity without using valency masks.

Goupil, B., Joly, A., Pancino, N., Bongini, P., Scarselli, F., Bianchini, M. (2025). MOLGMP: A Markov approach for molecular graph generation with GNNs. NEUROCOMPUTING, 652(1) [10.1016/j.neucom.2025.131066].

MOLGMP: A Markov approach for molecular graph generation with GNNs

Pancino, Niccoló;Bongini, Pietro
;
Scarselli, Franco;Bianchini, Monica
2025-01-01

Abstract

Molecule generation has experienced multiple breakthroughs in recent years. While traditional techniques are very reliable they have low space exploration potential; therefore, machine learning techniques are needed to delve deep into the vast space of potential compounds, helping to find new ways of designing candidate molecules. This paper presents a sequential Markovian model based on graph neural networks for molecular generation. The model employs a Breadth–First Search (BFS) ordering strategy and a modular architecture to enhance independence between functions, with a focus on maintaining strict independence at each step of the Markovian process. Experimental results on ZINC (MOSES), ZINC (250 K) and Polymers datasets demonstrate the model’s ability to perform both unconditional and conditional molecule generation while preserving dataset properties. Notably, the step–wise approach achieves state–of–the–art results in terms of uniqueness and validity without using valency masks.
2025
Goupil, B., Joly, A., Pancino, N., Bongini, P., Scarselli, F., Bianchini, M. (2025). MOLGMP: A Markov approach for molecular graph generation with GNNs. NEUROCOMPUTING, 652(1) [10.1016/j.neucom.2025.131066].
File in questo prodotto:
File Dimensione Formato  
MOLGMP.pdf

accesso aperto

Tipologia: PDF editoriale
Licenza: Creative commons
Dimensione 2.42 MB
Formato Adobe PDF
2.42 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1302215