Issue December 2010

category image Volume 28
No. 3 (289-441)
December 2010
ISSN 0739-1102

SMpred: A Support Vector Machine Approach to Identify Structural Motifs in Protein Structure Without Using Evolutionary Information (405-414)

Knowledge of three dimensional structure is essential to understand the function of a protein. Although the overall fold is made from the whole details of its sequence, a small group of residues, often called as structural motifs, play a crucial role in determining the protein fold and its stability. Identification of such structural motifs requires sufficient number of sequence and structural homologs to define conservation and evolutionary information. Unfortunately, there are many structures in the protein structure databases have no homologous structures or sequences. In this work, we report an SVM method, SMpred, to identify structural motifs from single protein structure without using sequence and structural homologs. SMpred method was trained and tested using 132 proteins domains containing 581 motifs. SMpred method achieved 78.79% accuracy with 79.06% sensitivity and 78.53% specificity. The performance of SMpred was evaluated with MegaMotifBase using 188 proteins containing 1161 motifs. Out of 1161 motifs, SMpred correctly identified 1503 structural motifs reported in MegaMotifBase. Further, we showed that SMpred is useful approach for the length deviant superfamilies and single member superfamilies. This result suggests the usefulness of our approach for facilitating the identification of structural motifs in protein structure in the absence of sequence and structural homologs. The dataset and executable for the SMpred algorithm is available at http://www3.ntu.edu.sg/home/EPNSugan/index_files/SMpred.htm.

Ganesan Pugalenthi1
Krishna Kumar Kandaswamy2,3
P. N. Suganthan4,*
R. Sowdhamini5
Thomas Martinetz2
Prasanna R. Kolatkar1,*

1Laboratory of Structural Biochemistry, Genome Institute of Singapore, 60 Biopolis Street, Singapore 138672
2Institute for Neuro- and Bioinformatics, University of Lübeck, 23538 Lübeck, Germany
3Graduate School for Computing in Medicine and Life Sciences, University of Lübeck, 23538 Lübeck, Germany
4School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore, 639798
5National Centre for Biological Sciences, UAS-GKVK campus, Bellary Road, Bangalore 560 065, India

EPNSugan@ntu.edu.sg


Download Full Text Article in PDF

Purchase Downloadable Full Text PDF of Article

Corporate User

$100.00

University/Academic User

$50.00

Subscription is more cost effective than purchasing PDFs on-the-fly.  Click here for details.