Inferring the Association Network from p53 Sequence Alignment Using Granular Evaluations

  • David K. Y. Chiu
  • Ramya Manjunath
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7986)


The relationship connecting the biomolecular sequence, the molecular structure, and the biological function is of extreme importance in nanostructure analysis such as drug discovery. Previous studies involving multiple sequence alignment of biomolecules have demonstrated that associated sites are indicative of the structural and functional characteristics of biomolecules, comparable to methods such as consensus sequences analysis. In this paper, a new method to detect associated sites in aligned sequence ensembles is proposed. It involves the use of multiple sub-tables (or levels) of two-dimensional contingency table analysis. The idea is to incorporate analysis by using a concept known as granular computing, which represents information at different levels of granularity. The analysis involves two phases. The first phase includes labeling of the molecular sites in the p53 protein multiple sequence alignment according to the detected associated patterns. The sites are consequently labeled into three different types based on their site characteristics: 1) conserved sites, 2) associated sites and 3) hypervariate sites. In the second phase, the significance of the extracted site patterns is evaluated with respect to targeted structural and functional characteristics of the p53 protein. The results indicate that the extracted site patterns are significantly associated with some of the known functionalities of p53, a cancer suppressor. Furthermore, when these sites are aligned with p63 and p73, the homologs of p53 without the same cancer suppressing property, based on the common domains, the sites significantly discriminate between the human sequences of the p53 family. Therefore, the study confirms the importance of these detected sites that could indicate their differences in cancer suppressing property.


Data-mining association network protein sequence alignment granular computing bioinformatics 


  1. 1.
    Chiu, D.K.Y., Chen, X., Wong, A.K.C.: Association Between Statistical and Functional Patterns in Biomolecules. In: Proceedings of the Atlantic Symposium on Computational Biology and Genome Information Systems and Technology, Durham, USA, pp. 64–69 (2001)Google Scholar
  2. 2.
    Chiu, D.K.Y., Cheung, B.: Hierarchical Maximum Entropy Discretization. Computing and Information. In: Proceedings of the International Conference on Computing and Information (ICCI 1989), pp. 237–242. North-Holland, Toronto (1989)Google Scholar
  3. 3.
    Chiu, D.K.Y., Cheung, B., Wong, A.K.C.: Information Synthesis based on Hierarchical Maximum Entropy Discretization. Journal of Experimental and Theoretical Artificial Intelligence 2, 117–129 (1990)CrossRefGoogle Scholar
  4. 4.
    Chiu, D.K.Y., Kolodziejczak, T.: Inferring Consensus Structure from Nucleic Acid Sequences. Computational Applications in Biosciences 7, 347–352 (1991)Google Scholar
  5. 5.
    Chiu, D.K.Y., Lui, T.W.H.: NHOP: A Nested Associative Pattern for Analysis of Consensus Sequence Ensembles. IEEE Trans. on Knowledge and Data Engineering (2012) (in press)Google Scholar
  6. 6.
    Chiu, D.K.Y., Lui, T.W.H.: A Multiple-pattern Biosequence Analysis Method for Diverse Source Association Mining. Applied Bioinformatics 4(2), 85–92 (2005)CrossRefGoogle Scholar
  7. 7.
    Chiu, D.K.Y., Wang, Y.: Multipattern Consensus Regions in Multiple Aligned Protein Sequences and their Segmentation. EURASIP J. Bioinformatics Syst. Biol. 35809, 1–8 (2006)CrossRefGoogle Scholar
  8. 8.
    Chiu, D.K.Y., Wong, A.K.C.: Multiple Pattern Associations for Interpreting Structural and Functional Characteristics of Biomolecules. Information Science 167, 23–39 (2004)MathSciNetzbMATHCrossRefGoogle Scholar
  9. 9.
    Chiu, D.K.Y., Wong, A.K.C., Cheung, B.: Information Discovery through Hierarchical Maximum Entropy Discretization and Synthesis. In: Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases, pp. 126–140. MIT Press, Cambridge (1991)Google Scholar
  10. 10.
    Chiu, D.K.Y., Xu, P.S.C.: InfoBarcoding: Selection of Non-Contiguous Sites in Molecular Biomarker. In: Proceeding, Computational Advances in Bio. and Medical Sciences (ICCABS), pp. 69–74 (2011)Google Scholar
  11. 11.
    Durston, K., Chiu, D.K.Y., Wong, A.K.C., Li, G.C.L.: Statistical Discovery of Site Inter-Dependencies in Sub-Molecular Hierarchical Protein Structuring. EURASIP J. on Bioinformatics and Systems Biology 2012, 8 (2012)CrossRefGoogle Scholar
  12. 12.
    European Bioinformatics Institute tool for Multiple Sequence Alignment using clustalw2,
  13. 13.
    Frishman, D., Mironov, A., Gelfand, M.: Starts of Bacterial Genes: Estimating the Reliability of Computer Predictions. Gene 234, 257–265 (1999)CrossRefGoogle Scholar
  14. 14.
    Gonzalez, A.J., Liao, L., Wu, C.H.: Predicting Ligand-Binding Residues using Multi-Positional Correlations and Kernel Canonical Correlation Analysis. In: Proc. 2010 IEEE Intern. Conf. of Bioinformatics and Biomedicine (BIBM), pp. 158–163 (2010)Google Scholar
  15. 15.
    Greenblatt, M.S., Bennett, W.P., Hollstein, M., Harris, C.C.: Mutations in the p53 Tumor Suppressor Gene: Clues to Cancer Etiology and Molecular Pathogenesis. Cancer Research 54, 4855–4878 (1994)Google Scholar
  16. 16.
    Haberman, S.J.: The Analysis of Residuals in Cross-Classified Tables. Biometrics 29, 205–220 (1973)CrossRefGoogle Scholar
  17. 17.
    Hollstein, M., Sidransky, D., Vogelstein, B., Harris, C.C.: p53 Mutations in Human Cancers. Science 253(5015), 49–53 (1991)CrossRefGoogle Scholar
  18. 18.
    Joerger, A.C., Fersht, A.R.: Structural Biology of the Tumor Suppressor p53 and Cancer-Associated Mutants. Advanced Cancer Research 97, 1–23 (2007)CrossRefGoogle Scholar
  19. 19.
    Lane, D.P.: Cancer and p53, Guardian of the Genome. Nature 358, 15–16 (1992)CrossRefGoogle Scholar
  20. 20.
    Lane, D.P., Cheok, C.F., Lain, S.: p53-based Cancer Therapy. Cold Spring Harb. Perspect. Biol., 2, a001222 (2010)Google Scholar
  21. 21.
    Lin, T.Y.: Granular computing. In: Wang, G., Liu, Q., Yao, Y., Skowron, A. (eds.) RSFDGrC 2003. LNCS (LNAI), vol. 2639, pp. 16–24. Springer, Heidelberg (2003)Google Scholar
  22. 22.
    Lin, T.Y.: From Rough Sets and Neighborhood Systems to Information Granulation and Computing in Words. In: European Congress on Intelligent Techniques and Soft Computing, pp. 1602–1607 (1997)Google Scholar
  23. 23.
    Melino, G., Lu, X., Gasco, M., Crook, T., Knight, R.A.: Functional Regulation of p73 and p63: Development and Cancer. Trends Biochem. Sci. 28, 663–670 (2003)CrossRefGoogle Scholar
  24. 24.
    Pedrycz, W.: Granular Computing: An Emerging Paradigm. Physica-Verlag, Heidelberg (2003)Google Scholar
  25. 25.
    The p53 website,
  26. 26.
    The UniProtKB database,
  27. 27.
    Wong, A.K.C., Lui, T.S., Wang, C.C.: Statistical Analysis of Residue Variability in Cytochrome C. J. Molecular Biology 102(2), 287–295 (1976)CrossRefGoogle Scholar
  28. 28.
    Wong, A.K.C., Wang, Y.: High-Order Pattern Discovery from Discrete-Valued Data. IEEE Trans. on Knowledge Systems 9(6), 877–893 (1997)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • David K. Y. Chiu
    • 1
  • Ramya Manjunath
    • 1
  1. 1.University of GuelphGuelphCanada

Personalised recommendations