Discriminating Graph Pattern Mining from Gene Expression Data

Fassetti, Fabio; Rombo, Simona E.; Serrao, Cristina

doi:10.1007/978-3-319-63477-7_4

Part of the book series: SpringerBriefs in Computer Science ((BRIEFSCOMPUTER))

364 Accesses

Abstract

Here we consider the problem of mining gene expression data in order to single out interesting features characterizing healthy/ unhealthy samples of an input dataset. The presented approach is based on a network model of the input gene expression data, where there is a labeled graph for each sample. This is the first attempt to build a different graph for each sample and, then, to have a database of graphs for representing a sample set. The main goal is that of singling out interesting differences between healthy and unhealthy samples, through the extraction of discriminative patterns among graphs belonging to the two different sample sets. Differently from the other approaches presented in the literature, this technique is able to take into account important local similarities, and also collaborative effects involving interactions between multiple genes. In particular, edge-labeled graphs are employed and the discriminative power of a pattern is measured on the basis of edge weights, which are representative of how much relevant is the co-expression between two genes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Since there is a one-to-one correspondence between an individual and its representing tuple, for the sake of simplicity, we employ the same symbol t to denote both the individual and its corresponding tuple in the dataset.
2.
The reader is referred to Sect. 4.4.2 for the details.
3.
Note that, due to the symmetry of Eq. (4.2), the same line of reasoning can be followed to find, given values \(\rho _0\) and \(x_0\), the values of y such that the value of \(\rho \) solution of Eq. (4.2) is larger than \(\rho _0\).

References

Allison, D.B., Cui, X., Page, G.P., Sabripour, M.: Microarray data analysis: from disarray to consolidation and consensus. Nat. Rev. Genet. 7(1), 55–65 (2006)
Article Google Scholar
Anastassiou, D.: Computational analysis of the synergy among multiple interacting genes. Mol. Syst. Biol. 3(1), 83 (2007)
Google Scholar
Atias, N., Sharan, R.: Comparative analysis of protein networks: hard problems, practical solutions. Commun. ACM 55(5), 88–97 (2012)
Article Google Scholar
Dehmer, M., Emmert-Streib, F., Graber, A., Salvador, A.: Applied statistics for network biology: methods in systems biology. John Wiley & Sons (2011)
Google Scholar
Emmert-Streib, F., Tripathi, S., de Matos Simoes, R.: Harnessing the complexity of gene expression data from cancer: from single gene to structural pathway methods. Biol. Direct 7(44.10), 1186 (2012)
Google Scholar
Gray, R.M.: Entropy and information theory. Springer Science & Business Media (2011)
Google Scholar
Metzker, M.L.: Sequencing technologies-the next generation. Nat. Rev. Genet. 11(1), 31–46 (2010)
Article Google Scholar
Mitchell, T.M.: Machine Learning, vol. 45. Burr Ridge, IL: McGraw Hill (1997)
Google Scholar
Panni, S., Rombo, S.E.: Searching for repetitions in biological networks: methods, resources and tools. Brief. Bioinform. 16(1), 118–136 (2015)
Article Google Scholar
Quackenbush, J.: Computational analysis of microarray data. Nat. Revi. Genet. 2(6), 418–427 (2001)
Article Google Scholar
Roy, S., Bhattacharyya, D.K., Kalita, J.K.: Reconstruction of gene co-expression network from microarray data using local expression patterns. BMC Bioinform. 15(Suppl 7), S10 (2014)
Article Google Scholar
Rung, J., Brazma, A.: Reuse of public genome-wide gene expression data. Nat. Rev. Genet. 14, 89–99 (2013)
Article Google Scholar
Vidal, M., Cusick, M.E., Barabasi, A.L.: Interactome networks and human disease. Cell 144(6), 986–998 (2011)
Article Google Scholar
Wang, Z., Gerstein, M., Snyder, M.: Rna-seq: a revolutionary tool for transcriptomics. Nat. Rev. Genet. 10(1), 57–63 (2009)
Article Google Scholar
Watkinson, J., Wang, X., Zheng, T., Anastassiou, D.: Identification of gene interactions associated with disease from gene expression data using synergy networks. BMC Syst. Biol. 2(1), 10 (2008)
Article Google Scholar
Yan, X., Cheng, H., Han, J., Yu, P.S.: Mining significant graph patterns by leap search. In: ACM SIGMOD International Conference on Management of data, pp. 433–444. ACM (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Calabria, Calabria, Cosenza, Italy
Fabio Fassetti
University of Palermo, Palermo, Italy
Simona E. Rombo
University of Calabria, Calabria, Cosenza, Italy
Cristina Serrao

Authors

Fabio Fassetti
View author publications
You can also search for this author in PubMed Google Scholar
Simona E. Rombo
View author publications
You can also search for this author in PubMed Google Scholar
Cristina Serrao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fabio Fassetti .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Fassetti, F., Rombo, S.E., Serrao, C. (2017). Discriminating Graph Pattern Mining from Gene Expression Data. In: Discriminative Pattern Discovery on Biological Networks. SpringerBriefs in Computer Science. Springer, Cham. https://doi.org/10.1007/978-3-319-63477-7_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-63477-7_4
Published: 02 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63476-0
Online ISBN: 978-3-319-63477-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics