TY - GEN
T1 - Analyzing conservation patterns and its influence on identifying protein functional
AU - Fang, Chun
AU - Noguchi, Tamotsu
AU - Yamana, Hayato
PY - 2014/1/1
Y1 - 2014/1/1
N2 - Evolutionary conservation information included in position-specific scoring matrix (PSSM) has been adopted by almost all sequence-based methods for identifying protein functional sites, because all functional sites, whether in ordered or disordered proteins, are found to be conserved to some extent. However, different functional sites have different conservation patterns, some of them are linear contextual, some of them are mingled with highly variable residues, and some others seem to be conserved independently. All existing studies used the direct output of PSSM for functional sites prediction, without considering the relationship between conservation patterns of residues and the distribution of conservation scores in PSSMs. In order to demonstrate the importance of analyzing conservation patterns, three PSSM-based methods for identifying 3 kinds of functional sites have been compared. Experiment results show that, although all the methods are based on the same feature - PSSM of protein sequence, they are competent in identifying different patterns of functional sites: the PSSM-based method is competent for identifying functional site which is independently conserved; the smoothed-PSSM is competent for identifying functional site which is continuous conserved; and the masked-smoothed- PSSM based method is competent for identifying functional site which is mingled with intensively highly flexible and highly conserved residues.
AB - Evolutionary conservation information included in position-specific scoring matrix (PSSM) has been adopted by almost all sequence-based methods for identifying protein functional sites, because all functional sites, whether in ordered or disordered proteins, are found to be conserved to some extent. However, different functional sites have different conservation patterns, some of them are linear contextual, some of them are mingled with highly variable residues, and some others seem to be conserved independently. All existing studies used the direct output of PSSM for functional sites prediction, without considering the relationship between conservation patterns of residues and the distribution of conservation scores in PSSMs. In order to demonstrate the importance of analyzing conservation patterns, three PSSM-based methods for identifying 3 kinds of functional sites have been compared. Experiment results show that, although all the methods are based on the same feature - PSSM of protein sequence, they are competent in identifying different patterns of functional sites: the PSSM-based method is competent for identifying functional site which is independently conserved; the smoothed-PSSM is competent for identifying functional site which is continuous conserved; and the masked-smoothed- PSSM based method is competent for identifying functional site which is mingled with intensively highly flexible and highly conserved residues.
UR - http://www.scopus.com/inward/record.url?scp=84905823610&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84905823610&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84905823610
SN - 9781632665140
T3 - Proceedings of the 6th International Conference on Bioinformatics and Computational Biology, BICOB 2014
SP - 73
EP - 79
BT - Proceedings of the 6th International Conference on Bioinformatics and Computational Biology, BICOB 2014
PB - International Society for Computers and Their Applications
T2 - 6th International Conference on Bioinformatics and Computational Biology, BICOB 2014
Y2 - 24 March 2014 through 26 March 2014
ER -