Abstract
Various genetic events during the process of natural evolution shape the landscape of the genomes. In this chapter, we explore an approach to investigating multiple genomes in order to unravel their complex relationships that go beyond their placement on a phylogeny. To this end, we treat genes as the smallest syntactic unit on the genome and explore their relative organization across multiple genomes. In the first half of the chapter, we discuss mathematical models to capture the combinatorial structures of this relative organization and statistical models to study their distributions. In the second half of the chapter, we apply these models to analyze the relationship between three closely related plant genomes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Laxmi Parida. (2007) Statistical significance of large gene clusters. Journal of Computational Biology, 14(9):1145–1149.
M Zhang, H W Leong. (2009) Gene Team Tree: A Hierarchical Representation of Gene Teams for All Gap Lengths. J. Comp. Biol, 16(10):1383–1389.
Q Yang, G Yi, F Zhang, M R Thon, S-H Sze. (2010) Identifying Gene Clusters within Localized Regions in Multiple Genomes. J. Comp. Biol., 17(5):657–668.
M Zhang, H W Leong. (2010) Bidirectional best hit r-window gene clusters. BMC Bioinf., 11(Suppl 1):s63.
Gad Landau, Laxmi Parida, and Oren Weimann. (2005) Using PQ trees for comparative genomics. In Proc. of the Symp. on Comp. Pattern Matching, volume 3537 of Lecture Notes in Computer Science, pages 128–143. Springer-Verlag.
Laxmi Parida. (2007) Pattern Discovery in Bioinformatics: Theory and Algorithms. Chapman Hall.
Rakesh Agrawal and Ramakrishnan Srikant. (1994) Fast algorithms for mining association rules in large databases. Proceedings of the 20th International Conference on Very Large Data Bases, VLDB, pp 487–499, Santiago, Chile.
R Eres, G M Landau, L Parida. (2004) Permutation pattern discovery in biosequences. J. Comp. Biol., 11(6):1050–1060.
D Sankoff, L Haque. (2005) Power boosts for cluster tests. LNCS 3678:121–130.
Z Yang, D Sankoff. (2009) Natural parameter values for generalized gene adjacency. LNCS, 5817:13–23.
O. Jaillon et al. (2007) The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature, 449(5793):463–467.
G. A. Tuskan et al. (2006) The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray). Science, 313(5793): 1596–1604.
The Arabidopsis Genome Initiative. (2000) Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature, 408(6814):796–815.
P. J. Kersey et al. (2010) Ensembl Genomes: Extending Ensembl across the taxonomic space. Nucleic Acids Research, 38(suppl 1):D563–D569.
Albert J. Vilella, Jessica Severin, Abel Ureta-Vidal, Li Heng, Richard Durbin, and Ewan Birney. (2009) EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates. Genome Research, 19(2):327–335.
The Gene Ontology Consortium. (2000) Gene ontology: tool for the unification of biology. Nat. Genet., 25(1):25–9.
Acknowledgments
We would like to thank Alex Feltus for providing the genomic characteristics for the three plant species (Fig. 3).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Parida, L., Haiminen, N. (2012). Discovering Patterns in Gene Order. In: Anisimova, M. (eds) Evolutionary Genomics. Methods in Molecular Biology, vol 855. Humana Press, Totowa, NJ. https://doi.org/10.1007/978-1-61779-582-4_16
Download citation
DOI: https://doi.org/10.1007/978-1-61779-582-4_16
Published:
Publisher Name: Humana Press, Totowa, NJ
Print ISBN: 978-1-61779-581-7
Online ISBN: 978-1-61779-582-4
eBook Packages: Springer Protocols