Issue December 2007

category image Volume 25
No. 3 (p 207-326)
December 2007
ISSN 0739-110

Prediction of Side Chain Orientations in Proteins by Statistical Machine Learning Methods (p. 275-288)

We develop ways to predict the side chain orientations of residues within a protein structure by using several different statistical machine learning methods. Here side chain orientation of a given residue i is measured by an angle Ωi between the vector pointing from the center of the protein structure to the Ciα atom and the vector pointing from the Ciα atom to the center of its side chain atoms. To predict the Ωi angles, we construct statistical models by using several different methods such as general linear regression, a regression tree and bagging, a neural network, and a support vector machine. The root mean square errors for the different models range only from 36.67 to 37.60 degrees and the correlation coefficients are all between 30% and 34%. The performances of different models in the test set are, thus, quite similar, and show the relative predictive power of these models to be significant in comparison with random side chain orientations.

Key words: Side chain orientations; Statistical machine learning; Protein structure prediction.

Aimin Yan1,2
Andrzej Kloczkowski1,2
Heike Hofmann3
Robert L. Jernigan1,2,*

1Laurence H. Baker Center for Bioinformatics and Biological Statistics
2Department of Biochemistry
Biophysics and Molecular Biology
3Department of Statistics
Iowa State University
Ames, Iowa, USA
*jernigan@iastate.edu

Purchase Downloadable Full Text PDF of Articles

Corporate User

$100.00

University/Academic User

$50.00

Subscription is more cost effective than purchasing PDFs on-the-fly.  Click here for details.