Predicting residue contacts for protein-protein interactions by integration of multiple information


Detailed knowledge of interfacial region between interacting proteins is not only helpful in annotating function for proteins, but also very important for structure-based drug design and disease treatment. However, this is one of the most difficult tasks and current methods are constrained by some factors. In this study, we developed a new method to predict residue-residue contacts of two interacting protein domains by integrating information about evolutionary couplings andamino acid pairwise contact potentials, as well as domain-domain interaction interfaces. The experimental results showed that our proposed method outperformed the previous method with the same datasets. Moreover, the method promises an improvement in the source of template-based protein docking.

Share and Cite:

Kien T. Le, T. , Hirose, O. , Tran, V. , Saethang, T. , T. Nguyen, L. , Dang, X. , Ngo, D. , Kubo, M. , Yamada, Y. and Satou, K. (2014) Predicting residue contacts for protein-protein interactions by integration of multiple information. Journal of Biomedical Science and Engineering, 7, 28-37. doi: 10.4236/jbise.2014.71005.

Conflicts of Interest

The authors declare no conflicts of interest.


[1] Chen, C.-T., Peng, H.-P., Jian, J.-W., Tsai, K.-C., Chang, J.-Y., Yang, E.-W., Chen, J.-B., Ho, S.-Y., Hsu, W.-L. and Yang, A.-S. (2012) Protein-protein interaction site predictions with three-dimensional probability distributions of interacting atoms on protein surfaces. PloS ONE, 7, e37706.
[2] Jordan, A.R., El-Manzalawy, Y., Dobbs, D. and Honavar, V. (2012) Predicting protein-protein interface residues using local surface structural similarity. BMC Bioinformatics, 13, 41.
[3] Bradford, J.R. and Westhead, D.R. (2005) Improved prediction of protein-protein binding sites using a support vector machines approach. Bioinformatics, 21, 1487-1494.
[4] Zhou, H.X. and Shan, Y. (2001) Prediction of protein interaction sites from sequence profile and residue neighbor list. Proteins, 44, 336-343.
[5] Burgoyne, N.J. and Jackson, R.M. (2006) Predicting protein interaction sites: Binding hot-spots in protein-protein and protein-ligand interfaces. Bioinformatics, 22, 1335-1342.
[6] Friedrich, T., Pils, B., Dandekar, T. and Muller, T. (2006) Modelling interaction sites in protein domains with interaction profile hidden Markov models. Bioinformatics, 22, 2851-2857.
[7] Ofran, Y. and Rost, B. (2003) Predicted protein-protein interaction sites from local sequence information. FEBS Letters, 544, 236-239.
[8] Zhou, H.-X. and Qin, S. (2007) Interaction-site prediction for protein complexes: A critical assessment. Bioinformatics, 23, 2203-2209.
[9] Ritchie, D.W. (2008) Recent progress and future directions in protein-protein docking. Current Protein and Peptide Science, 9, 1-15.
[10] Li, B. and Kihara, D. (2012) Protein docking prediction using predicted protein-protein interface. BMC Bioinformatics, 13, 7.
[11] Thattai, M., Burak, Y. and Shraiman, B.I. (2007) The origins of specificity in polyketide synthase protein interactions. PLoS Computational Biology, 3, 1827-1835.
[12] Burger, L. and Van Nimwegen, E. (2008) Accurate prediction of protein-protein interactions from sequence alignments using a Bayesian method. Molecular Systems Biology, 4, 1-14.
[13] White, R.A., Szurmant, H., Hoch, J.A. and Hwa, T. (2007) Features of protein-protein interactions in two-component signaling deduced from genomic libraries. Methods in Enzymology, 422, 75-101.
[14] Weigt, M., White, R.A., Szurmant, H., Hoch, J.A. and Hwa, T. (2009) Identification of direct residue contacts in protein-protein interaction by message passing. PNAS, 106, 67-72.
[15] González, A.J., Liao, L. and Wu, C.H. (2013) Prediction of contact matrix for protein-protein interaction. Bioinformatics, 29, 1018-1025.
[16] Ghoorah, A.W., Devignes, M.-D., Smaïl-Tabbone, M. and Ritchie, D.W. (2011) Spatial clustering of protein binding sites for template based protein docking. Bioinformatics, 27, 2820-2827.
[17] Aloy, P., Ceulemans, H., Stark, A. and Russell, R.B. (2003) The relationship between sequence and interaction divergence in proteins. Journal of Molecular Biology, 332, 989-998.
[18] Keskin, O. and Nussinov, R. (2007) Similar binding sites and different partners: Implications to shared proteins in cellular pathways. Structure, 15, 341-354.
[19] Morcos, F., Pagnani, A., Lunt, B., Bertolino, A., Marks, D.S., Sander, C., Zecchina, R., Onuchic, J.N., Hwa, T. and Weigt, M. (2011) Direct-coupling analysis of residue coevolution captures native contacts across many protein families. Proceedings of the National Academy of Sciences of the United States of America, 108, E1293-E1301.
[20] Kawashima, S., Pokarowski, P., Pokarowska, M., Kolinski, A., Katayama, T. and Kanehisa, M. (2008) AAindex: Amino acid index database, progress report 2008. Nucleic Acids Research, 36, D202-D205.
[21] Krogh, A., Brown, M., Mian, I.S., Jokander, K. and David, H. (1994) Hidden Markov Models in Computational Biology Applications to Protein Modeling. Journal of Molecular Biology, 235, 1501-1531.
[22] Eddy, S.R. (1998) Profile hidden Markov models. Bioinformatics Review, 14, 755-763.
[23] González, A.J. and Liao, L. (2010) Predicting domain-domain interaction based on domain profiles with feature selection and support vector machines. BMC Bioinformatics, 11, 537.
[24] González, A.J. and Liao, L. (2009) Constrained fisher scores derived from interaction profile hidden Markov models improve protein to protein interaction. Proceedings of the First International Conference BICoB 2009, New Orleans, 8-10 April 2009, 236-247.
[25] Hopf, T.A., Colwell, L.J., Sheridan, R., Rost, B., Sander, C. and Marks, D.S. (2012) Three-dimensional structures of membrane proteins from genomic sequencing. Cell, 149, 1607-1621.
[26] Marks, D.S., Colwell, L.J., Sheridan, R., Hopf, A.T., Pagnani, A., Zecchina, R. and Sander, C. (2011) Protein 3D structure computed from evolutionary sequence variation. PloS ONE, 6, e28766.
[27] Procaccini, A., Lunt, B., Szurmant, H., Hwa, T. and Weigt, M. (2011) Dissecting the specificity of protein-protein interaction in bacterial two-component signaling: orphans and crosstalks. PloS ONE, 6, e19729.
[28] Punta, M., Coggill, P.C., Eberhardt, R.Y., Mistry, J., Tate, J., Boursnell, C., Pang, N., Forslund, K., Ceric, G., Clements, J, Heger, A., Holm, L., Sonnhammer, E.L.L., Eddy, S.R., Bateman, A. and Finn, R.D. (2012) The Pfam protein families database. Nucleic Acids Research, 40, D290-D301.
[29] Stein, A., Russell, R.B. and Aloy, P. (2005) 3did: Interacting protein domains of known three-dimensional structure. Nucleic Acids Research, 33, D413-D417.

Copyright © 2023 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.