A deep attention network for predicting amino acid signals in the formation of α-helices

IRIS

The secondary and tertiary structure of a protein has a primary role in determining its function. Even though many folding prediction algorithms have been developed in the past decades — mainly based on the assumption that folding instructions are encoded within the protein sequence — experimental techniques remain the most reliable to establish protein structures. In this paper, we searched for signals related to the formation of α-helices. We carried out a statistical analysis on a large dataset of experimentally characterized secondary structure elements to find over- or under-occurrences of specific amino acids defining the boundaries of helical moieties. To validate our hypothesis, we trained various Machine Learning models, each equipped with an attention mechanism, to predict the occurrence of α-helices. The attention mechanism allows to interpret the model’s decision, weighing the importance the predictor gives to each part of the input. The experimental results show that different models focus on the same subsequences, which can be seen as codes driving the secondary structure formation.

Visibelli, A., Bongini, P., Rossi, A., Niccolai, N., Bianchini, M. (2020). A deep attention network for predicting amino acid signals in the formation of α-helices. JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 18(5) [10.1142/S0219720020500286].

A deep attention network for predicting amino acid signals in the formation of α-helices

Visibelli, A.;Bongini, P.;Rossi, A.;Niccolai, N.;Bianchini, M.

2020-01-01

Abstract

The secondary and tertiary structure of a protein has a primary role in determining its function. Even though many folding prediction algorithms have been developed in the past decades — mainly based on the assumption that folding instructions are encoded within the protein sequence — experimental techniques remain the most reliable to establish protein structures. In this paper, we searched for signals related to the formation of α-helices. We carried out a statistical analysis on a large dataset of experimentally characterized secondary structure elements to find over- or under-occurrences of specific amino acids defining the boundaries of helical moieties. To validate our hypothesis, we trained various Machine Learning models, each equipped with an attention mechanism, to predict the occurrence of α-helices. The attention mechanism allows to interpret the model’s decision, weighing the importance the predictor gives to each part of the input. The experimental results show that different models focus on the same subsequences, which can be seen as codes driving the secondary structure formation.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2020
			
	Rivista su cui è pubblicata l'opera
	
				JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY
			
	Citazione
	
				Visibelli, A., Bongini, P., Rossi, A., Niccolai, N., Bianchini, M. (2020). A deep attention network for predicting amino acid signals in the formation of α-helices. JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 18(5) [10.1142/S0219720020500286].
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Eliche.pdf non disponibili Tipologia: Post-print Licenza: NON PUBBLICO - Accesso privato/ristretto Dimensione 862.32 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	862.32 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11365/1117818