TY - JOUR
T1 - Analysis of evolutionary conservation patterns and their influence on identifying protein functional sites
AU - Fang, Chun
AU - Noguchi, Tamotsu
AU - Yamana, Hayato
N1 - Publisher Copyright:
© Imperial College Press.
PY - 2014/10/30
Y1 - 2014/10/30
N2 - Evolutionary conservation information included in position-specific scoring matrix (PSSM) has been widely adopted by sequence-based methods for identifying protein functional sites, because all functional sites, whether in ordered or disordered proteins, are found to be conserved at some extent. However, different functional sites have different conservation patterns, some of them are linear contextual, some of them are mingled with highly variable residues, and some others seem to be conserved independently. Every value in PSSMs is calculated independently of each other, without carrying the contextual information of residues in the sequence. Therefore, adopting the direct output of PSSM for prediction fails to consider the relationship between conservation patterns of residues and the distribution of conservation scores in PSSMs. In order to demonstrate the importance of combining PSSMs with the specific conservation patterns of functional sites for prediction, three different PSSMbased methods for identifying three kinds of functional sites have been analyzed. Results suggest that, different PSSM-based methods differ in their capability to identify different patterns of functional sites, and better combining PSSMs with the specific conservation patterns of residues would largely facilitate the prediction.
AB - Evolutionary conservation information included in position-specific scoring matrix (PSSM) has been widely adopted by sequence-based methods for identifying protein functional sites, because all functional sites, whether in ordered or disordered proteins, are found to be conserved at some extent. However, different functional sites have different conservation patterns, some of them are linear contextual, some of them are mingled with highly variable residues, and some others seem to be conserved independently. Every value in PSSMs is calculated independently of each other, without carrying the contextual information of residues in the sequence. Therefore, adopting the direct output of PSSM for prediction fails to consider the relationship between conservation patterns of residues and the distribution of conservation scores in PSSMs. In order to demonstrate the importance of combining PSSMs with the specific conservation patterns of functional sites for prediction, three different PSSMbased methods for identifying three kinds of functional sites have been analyzed. Results suggest that, different PSSM-based methods differ in their capability to identify different patterns of functional sites, and better combining PSSMs with the specific conservation patterns of residues would largely facilitate the prediction.
KW - Conservation patterns
KW - Functional sites
KW - Position-specific scoring matrix
UR - http://www.scopus.com/inward/record.url?scp=84908637319&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84908637319&partnerID=8YFLogxK
U2 - 10.1142/S0219720014400034
DO - 10.1142/S0219720014400034
M3 - Article
C2 - 25362840
AN - SCOPUS:84908637319
SN - 0219-7200
VL - 12
JO - Journal of Bioinformatics and Computational Biology
JF - Journal of Bioinformatics and Computational Biology
IS - 5
M1 - 1440003
ER -