Protein structure mining using a structural alphabet. - Inserm - Institut national de la santé et de la recherche médicale Accéder directement au contenu
Article Dans Une Revue Proteins - Structure, Function and Bioinformatics Année : 2007

Protein structure mining using a structural alphabet.

Résumé

We present a comprehensive evaluation of a new structure mining method called PB-ALIGN. It is based on the encoding of protein structure as 1D sequence of a combination of 16 short structural motifs or protein blocks (PBs). PBs are short motifs capable of representing most of the local structural features of a protein backbone. Using derived PB substitution matrix and simple dynamic programming algorithm, PB sequences are aligned the same way amino acid sequences to yield structure alignment. PBs are short motifs capable of representing most of the local structural features of a protein backbone. Alignment of these local features as sequence of symbols enables fast detection of structural similarities between two proteins. Ability of the method to characterize and align regions beyond regular secondary structures, for example, N and C caps of helix and loops connecting regular structures, puts it a step ahead of existing methods, which strongly rely on secondary structure elements. PB-ALIGN achieved efficiency of 85% in extracting true fold from a large database of 7259 SCOP domains and was successful in 82% cases to identify true super-family members. On comparison to 13 existing structure comparison/mining methods, PB-ALIGN emerged as the best on general ability test dataset and was at par with methods like YAKUSA and CE on nontrivial test dataset. Furthermore, the proposed method performed well when compared to flexible structure alignment method like FATCAT and outperforms in processing speed (less than 45 s per database scan). This work also establishes a reliable cut-off value for the demarcation of similar folds. It finally shows that global alignment scores of unrelated structures using PBs follow an extreme value distribution. PB-ALIGN is freely available on web server called Protein Block Expert (PBE) at http://bioinformatics.univ-reunion.fr/PBE/. Proteins 2008. (c) 2007 Wiley-Liss, Inc.
Fichier principal
Vignette du fichier
Tyagi_Proteins_2007.pdf (1.16 Mo) Télécharger le fichier

Dates et versions

inserm-00176443 , version 1 (04-09-2009)

Identifiants

Citer

Manoj Tyagi, Alexandre de Brevern, Narayanaswamy Srinivasan, Bernard Offmann. Protein structure mining using a structural alphabet.: Protein structure mining using a structural alphabet. Proteins - Structure, Function and Bioinformatics, 2007, 71 (2), pp.920-937. ⟨10.1002/prot.21776⟩. ⟨inserm-00176443⟩
222 Consultations
237 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More