Issue December 2007No. 3 (p 207-326) December 2007 ISSN 0739-110 Prediction of Side Chain Orientations in Proteins by Statistical Machine Learning Methods (p. 275-288)We develop ways to predict the side chain orientations of residues within a protein structure by using several different statistical machine learning methods. Here side chain orientation of a given residue i is measured by an angle Ωi between the vector pointing from the center of the protein structure to the Ciα atom and the vector pointing from the Ciα atom to the center of its side chain atoms. To predict the Ωi angles, we construct statistical models by using several different methods such as general linear regression, a regression tree and bagging, a neural network, and a support vector machine. The root mean square errors for the different models range only from 36.67 to 37.60 degrees and the correlation coefficients are all between 30% and 34%. The performances of different models in the test set are, thus, quite similar, and show the relative predictive power of these models to be significant in comparison with random side chain orientations.
Key words: Side chain orientations; Statistical machine learning; Protein structure prediction. Aimin Yan1,2 1Laurence H. Baker Center for Bioinformatics and Biological Statistics Subscription is more cost effective than purchasing PDFs on-the-fly. Click here for details. |