Abstract
In Bioinformatics there is a lack of software tools that fit with the requirements demanded by biologists. For instance, when a DNA sample is sequenced, a lot of work have to be performed manually and several tools are used. The application of Information Systems (IS) principles into the development of bioinformatics tools opens a new interesting research path. One of the most promising approaches is the use of conceptual models in order to precisely define how genomic data is represented into an IS. This work introduces how to build a Genome Information System (GIS) using these principles. As a first step to achieve this goal, a conceptual model to formally describe genomic mutations is presented. In addition, as a proof of concept of this approach, a variation analysis prototype has been implemented using this conceptual model as a development core.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Watson, J., Crick, F.: A structure for deoxyribose nucleic acid. Nature 171, 737–738 (1953)
Jordan, E.: The American Journal of Human Genetics 51, 1–6 (1992)
Craig, J., Venter, J.C., Adams, M.D., Myers, E., Li, P.W., Mural, R.J., Sutton, G.G., Smith, H.O., Yandell, M., Evans, C.A., Holt, R.A., Gocayne, J.D., Amanatides, P., Ballew, R.M., Huson, D.H., Wortman, J.R., Zhang, Q., Kodira, C.D., Zheng, X.H., Chen, L., Skupski, M., Subramanian, G., Thomas, P.D., Zhang, J., Gabor Miklos, G.L., Nelson, C., Broder, S., Clark, A.G., Nadeau, J., McKusick, V.A., Zinder, N., Levine, A.J., Roberts, R.J., Simon, M., Slayman, C., Hunkapiller, M., Bolanos, R., Delcher, A., Dew, I., Fasulo, D., Flanigan, M., Florea, L., Halpern, A., Hannenhalli, S., Kravitz, S., Levy, S., Mobarry, C., Reinert, K., Remington, K., Abu-Threideh, J., Beasley, E., Biddick, K., Bonazzi, V., Brandon, R., Cargill, M., Chandramouliswaran, I., Charlab, R., Chaturvedi, K., Deng, Z., Di Francesco, V., Dunn, P., Eilbeck, K., Evangelista, C., Gabrielian, A.E., Gan, W., Ge, W., Gong, F., Gu, Z., Guan, P., Heiman, T.J., Higgins, M.E., Ji, R.R., Ke, Z., Ketchum, K.A., Lai, Z., Lei, Y., Li, Z., Li, J., Liang, Y., Lin, X., Lu, F., Merkulov, G.V., Milshina, N., Moore, H.M., Naik, A.K., Narayan, V.A., Neelam, B., Nusskern, D., Rusch, D.B., Salzberg, S., Shao, W., Shue, B., Sun, J., Wang, Z., Wang, A., Wang, X., Wang, J., Wei, M., Wides, R., Xiao, C., Yao, A., Ye, J., Zhan, M., Zhang, W., Zhang, H., Zhao, Q., Zheng, L., Zhong, F., Zhong, W., Zhu, S., Zhao, S., Gilbert, D., Baumhueter, S., Spier, G., Carter, C., Cravchik, A., Woodage, T., Ali, F., An, H., Awe, A., Baldwin, D., Baden, H., Barnstead, M., Barrow, I., Beeson, K., Busam, D., Carver, A., Center, A., Cheng, M.L., Curry, L., Danaher, S., Davenport, L., Desilets, R., Dietz, S., Dodson, K., Doup, L., Ferriera, S., Garg, N., Gluecksmann, A., Hart, B., Haynes, J., Haynes, C., Heiner, C., Hladun, S., Hostin, D., Houck, J., Howland, T., Ibegwam, C., Johnson, J., Kalush, F., Kline, L., Koduru, S., Love, A., Mann, F., May, D., McCawley, S., McIntosh, T., McMullen, I., Moy, M., Moy, L., Murphy, B., Nelson, K., Pfannkoch, C., Pratts, E., Puri, V., Qureshi, H., Reardon, M., Rodriguez, R., Rogers, Y.H., Romblad, D., Ruhfel, B., Scott, R., Sitter, C., Smallwood, M., Stewart, E., Strong, R., Suh, E., Thomas, R., Tint, N.N., Tse, S., Vech, C., Wang, G., Wetter, J., Williams, S., Williams, M., Windsor, S., Winn-Deen, E., Wolfe, K., Zaveri, J., Zaveri, K., Abril, J.F., Guigó, R., Campbell, M.J., Sjolander, K.V., Karlak, B., Kejariwal, A., Mi, H., Lazareva, B., Hatton, T., Narechania, A., Diemer, K., Muruganujan, A., Guo, N., Sato, S., Bafna, V., Istrail, S., Lippert, R., Schwartz, R., Walenz, B., Yooseph, S., Allen, D., Basu, A., Baxendale, J., Blick, L., Caminha, M., Carnes-Stine, J., Caulk, P., Chiang, Y.H., Coyne, M., Dahlke, C., Mays, A., Dombroski, M., Donnelly, M., Ely, D., Esparham, S., Fosler, C., Gire, H., Glanowski, S., Glasser, K., Glodek, A., Gorokhov, M., Graham, K., Gropman, B., Harris, M., Heil, J., Henderson, S., Hoover, J., Jennings, D., Jordan, C., Jordan, J., Kasha, J., Kagan, L., Kraft, C., Levitsky, A., Lewis, M., Liu, X., Lopez, J., Ma, D., Majoros, W., McDaniel, J., Murphy, S., Newman, M., Nguyen, T., Nguyen, N., Nodell, M., Pan, S., Peck, J., Peterson, M., Rowe, W., Sanders, R., Scott, J., Simpson, M., Smith, T., Sprague, A., Stockwell, T., Turner, R., Venter, E., Wang, M., Wen, M., Wu, D., Wu, M., Xia, A., Zandieh, A., Zhu, X.: The Sequence of the Human Genome Science 291, 1304–1351 (2001)
Collins, F.S., Green, E.D., Guttmacher, A.E., Guyer, M.S.: A vision for the future of genomics research Nature 422, 835–847 (2003)
Gilbert, D.G.: Eugenes: a eukaryote genome information system. Nucleic Acids Research 30, 145–148 (2002)
Navigenics (2010), http://www.navigenics.com
23andme (2010), https://www.23andme.com
Decodeme (2010), http://www.decodeme.com
Medco acquires leading genetics healthcare company, DNA Direct (2005), http://www.dnadirect.com/web/
Knome (2010), http://www.knome.com
Irizarry, R.A., Bolstad, B.M., Collin, F., Cope, L.M., Hobbs, B., Speed, T.P.: Summaries of affymetrix genechip probe level data. Nucleic Acids Research 31, e15 (2003)
Klein, R.: Power analysis for genome-wide association studies. BMC Genetics 8, 58 (2007)
Tiwari, A., Sekhar, A.K.: Workflow based framework for life science informatics. Computational Biology and Chemistry 31, 305–319 (2007)
Hassan, M., Brown, R.D., Varma-O’brien, S., Rogers, D.: Cheminformatics analysis and learning in a data pipelining environment. Molecular Diversity 10, 283–299 (2006)
Watson, C., Guo, Y., Sheldonb, J.: Yike Guo and Jonathan Sheldon of InforSense discuss the impact of workflow technology on drug discovery. Drug Discovery Today 10, 1211–1212 (2005)
Shah, S., He, D., Sawkins, J., Druce, J., Quon, G., Lett, D., Zheng, G., Xu, T., Ouellette, B.: Pegasys: software for executing and integrating analyses of biological sequences. BMC Bioinformatics 5 (2004)
Ashburner, M., Ball, C.A., Blake, J.A., Botstein, D., Butler, H., Cherry, J.M., Davis, A.P., Dolinski, K., Dwight, S.S., Eppig, J.T., Harris, M.A., Hill, D.P., Issel-Tarver, L., Kasarskis, A., Lewis, S., Matese, J.C., Richardson, J.E., Ringwald, M., Rubin, G.M., Sherlock, G.: Gene ontology: tool for the unification of biology. Nature Genetics 25, 25–29 (2000)
BioPax-Consortium: Biological pathways exchange (2005), http://www.biopax.org/
Stevens, R., Baker, P., Bechhofer, S., Ng, G., Jacoby, A., Paton, N.W., Goble, C.A., Brass, A.: Tambis: Transparent access to multiple bioinformatics information sources. Bioinformatics 16, 184–186 (2000)
Paton, N.W., Khan, S.A., Hayes, A., Moussouni, F., Brass, A., Eilbeck, K., Goble, C.A., Hubbard, S.J., Oliver, S.G.: Conceptual modelling of genomic information. Bioinformatics 16, 548–557 (2000)
Brookes, A., Lehvaslaiho, H., Muilu, J., Shigemoto, Y., Oroguchi, T., Tomiki, T., Mukaiyama, A., Konagaya, A., Kojima, T., Inoue, I., Kuroda, M., Mizushima, H., Thorisson, G., Dash, D., Rajeevan, H., Darlison, M.W., Woon, M., Fredman, D., Smith, A.V., Senger, M., Naito, K., Sugawara, H.: The phenotype and genotype experiment object model (PaGE-OM): a robust data structure for information related to DNA variation. Human Mutation 30, 968–977 (2009)
Medigue, C., Rechenmann, F., Danchin, A., Viari, A.: Imagene: an integrated computer environment for sequence annotation and analysis. Bioinformatics 15, 2–15 (1999)
den Dunnen, J.T., Antonarakis, E.: Nomenclature for the description of human sequence variations. Human Genetics 109, 121–124 (2001)
Richesson, R., Turley, J.P.: Conceptual models: Definitions, construction, and applications in public health surveillance. Journal of Urban Health 80, i128 (2006)
Pastor, O., Levin, A., Casamayor, J., Celma, M., Virueta, A., Eraso, L., Pérez-Alonso, M.: Enforcing conceptual modeling to improve the understanding of human genome. In: Procs. of the IVth Int. Conference on Research Challenges in Information Science (2010)
NCBI: The refseqgene project (2010), http://www.ncbi.nlm.nih.gov/RefSeq/RSG
Stevens, R., Goble, C., Baker, P., Brass, A.: A classification of tasks in bioinformatics. Bioinformatics 17, 180–188 (2001)
Kent, W.J.: Blat, the blast-like alignment tool. Genome Research 12, 656–664 (2002)
Altschul, S., Gish, W., Miller, W., Myers, E.W., Lipman, D.: Basic local alignment search tool. Journal of Molecular Biology 215, 403–410 (1990)
Ram, S.: Toward Semantic Interoperability of Heterogeneous Biological Data Sources. In: Pastor, Ó., Falcão e Cunha, J. (eds.) CAiSE 2005. LNCS, vol. 3520, pp. 32–32. Springer, Heidelberg (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Martínez, A.M., Martín, A., Villanueva, M.J., Valverde, F., Levin, A.M., Pastor, O. (2011). Facing the Challenges of Genome Information Systems: A Variation Analysis Prototype. In: Soffer, P., Proper, E. (eds) Information Systems Evolution. CAiSE 2010. Lecture Notes in Business Information Processing, vol 72. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17722-4_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-17722-4_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17721-7
Online ISBN: 978-3-642-17722-4
eBook Packages: Computer ScienceComputer Science (R0)