Issue August 2007No. 1 (p 1-118) August 2007 ISSN 0739-110 Predicting Protein N-glycosylation by Combining Functional Domain and Secretion Information (p. 49-54)Protein N-glycosylation plays an important role in protein function. Yet, at present, few computational methods are available for the prediction of this protein modification. This prompted our development of a support vector machine (SVM)-based method for this task, as well as a partial least squares (PLS) regression based prediction method for comparison. A functional domain feature space was used to create SVM and PLS models, which achieved accuracies of 83.91% and 79.89%, respectively, as evaluated by a leave-one-out cross-validation. Subsequently, SVM and PLS models were developed based on functional domain and protein secretion information, which yielded accuracies of 89.13% and 86%, respectively. This analysis demonstrates that the protein functional domain and secretion information are both efficient predictors of N-glycosylation.
Key words: N-glycosylation; SVM, Support Vector Machine; PLS, Partial least squares; Prediction; Bioinformatics; Domain; and Secreted protein. Sujun Li1 1Bioinformatics Center Subscription is more cost effective than purchasing PDFs on-the-fly. Click here for details. |