TY - JOUR
T1 - Prediction of protein motions from amino acid sequence and its application to protein-protein interaction
AU - Hirose, Shuichi
AU - Yokota, Kiyonobu
AU - Kuroda, Yutaka
AU - Wako, Hiroshi
AU - Endo, Shigeru
AU - Kanai, Satoru
AU - Noguchi, Tamotsu
N1 - Funding Information:
We thank members of molecular function team in Computational Biology Research Center (CBRC), of Kuroda’s group at TUAT, and Dr. Hiroyuki Toh (Kyushu University) for helpful discussion. This work was supported by PharmaDesign, Inc..
PY - 2010
Y1 - 2010
N2 - Background. Structural flexibility is an important characteristic of proteins because it is often associated with their function. The movement of a polypeptide segment in a protein can be broken down into two types of motions: internal and external ones. The former is deformation of the segment itself, but the latter involves only rotational and translational motions as a rigid body. Normal Model Analysis (NMA) can derive these two motions, but its application remains limited because it necessitates the gathering of complete structural information. Results. In this work, we present a novel method for predicting two kinds of protein motions in ordered structures. The prediction uses only information from the amino acid sequence. We prepared a dataset of the internal and external motions of segments in many proteins by application of NMA. Subsequently, we analyzed the relation between thermal motion assessed from X-ray crystallographic B-factor and internal/external motions calculated by NMA. Results show that attributes of amino acids related to the internal motion have different features from those related to the B-factors, although those related to the external motion are correlated strongly with the B-factors. Next, we developed a method to predict internal and external motions from amino acid sequences based on the Random Forest algorithm. The proposed method uses information associated with adjacent amino acid residues and secondary structures predicted from the amino acid sequence. The proposed method exhibited moderate correlation between predicted internal and external motions with those calculated by NMA. It has the highest prediction accuracy compared to a nave model and three published predictors. Conclusions. Finally, we applied the proposed method predicting the internal motion to a set of 20 proteins that undergo large conformational change upon protein-protein interaction. Results show significant overlaps between the predicted high internal motion regions and the observed conformational change regions.
AB - Background. Structural flexibility is an important characteristic of proteins because it is often associated with their function. The movement of a polypeptide segment in a protein can be broken down into two types of motions: internal and external ones. The former is deformation of the segment itself, but the latter involves only rotational and translational motions as a rigid body. Normal Model Analysis (NMA) can derive these two motions, but its application remains limited because it necessitates the gathering of complete structural information. Results. In this work, we present a novel method for predicting two kinds of protein motions in ordered structures. The prediction uses only information from the amino acid sequence. We prepared a dataset of the internal and external motions of segments in many proteins by application of NMA. Subsequently, we analyzed the relation between thermal motion assessed from X-ray crystallographic B-factor and internal/external motions calculated by NMA. Results show that attributes of amino acids related to the internal motion have different features from those related to the B-factors, although those related to the external motion are correlated strongly with the B-factors. Next, we developed a method to predict internal and external motions from amino acid sequences based on the Random Forest algorithm. The proposed method uses information associated with adjacent amino acid residues and secondary structures predicted from the amino acid sequence. The proposed method exhibited moderate correlation between predicted internal and external motions with those calculated by NMA. It has the highest prediction accuracy compared to a nave model and three published predictors. Conclusions. Finally, we applied the proposed method predicting the internal motion to a set of 20 proteins that undergo large conformational change upon protein-protein interaction. Results show significant overlaps between the predicted high internal motion regions and the observed conformational change regions.
UR - http://www.scopus.com/inward/record.url?scp=77954407194&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77954407194&partnerID=8YFLogxK
U2 - 10.1186/1472-6807-10-20
DO - 10.1186/1472-6807-10-20
M3 - Article
C2 - 20626880
AN - SCOPUS:77954407194
SN - 1472-6807
VL - 10
JO - BMC Structural Biology
JF - BMC Structural Biology
M1 - 20
ER -