Accelerating Disease Gene Identification Through Integrated SNP Data Analysis
Information about small genetic variations in organisms, known as single nucleotide polymorphism (SNPs), is crucial to identify candidate genes that have a role in disease susceptibility, a long-standing research goal in biology. While a number of established public SNP databases are available, the specification of effective techniques for SNP analysis remains an open issue. We describe a secondary SNP database that integrates data from multiple public sources, designed to support various experimental ranking models for SNPs. By prioritizing SNPs within large regions of the genome, scientists are able to rapidly narrow their search for candidate genes. In the paper we describe the ranking models, the data integration architecture, and preliminary experimental results.
KeywordsSingle Nucleotide Polymorphism Score Model Single Nucleotide Polymorphism Data Single Nucleotide Polymorphism Analysis Single Nucleotide Polymorphism Density
Unable to display preview. Download preview PDF.
- 1.Chakravarti, A.: Population genetics – making sense out of sequence. Nature Genetics, 21(Suppl. 1) (January 1999)Google Scholar
- 3.Coulet, A., Smaïl-Tabbone, M., Benlian, P., Napoli, A., Devignes, M.: SNP-Converter: An ontology-based solution to reconcile heterogeneous SNP descriptions for pharmacogenomic studies. In: Leser, U., Naumann, F., Eckman, B.A. (eds.) DILS 2006. LNCS (LNBI), vol. 4075, pp. 82–93. Springer, Heidelberg (2006)Google Scholar
- 10.Reuveni, E., Ramensky, V.E., Gross, C.: Mouse SNP miner: An annotated database of mouse functional single nucleotide polymorphism. BMC Genomics 8(24) (2007)Google Scholar
- 12.Wang, L., Liu, S., Niu, T., Xu, X.: SNPHunter: a bioinformatic software for single nucleotide polymorphism data acquisition and management. BMC Bioinformatics 6(60), (2005) doi:10.1186/1471-2105-6-60Google Scholar