Skip to main content

Estimation of Missing Values in SNP Array

  • Conference paper
Modern Advances in Applied Intelligence (IEA/AIE 2014)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8482))

  • 1582 Accesses

Abstract

DNA microarray usage in genetics is rapidly proliferating, generating huge amount of data. It is estimated that around 5-20% of measurements do not succeed, leading to missing values in the data destined for further analysis. Missing values in further microarray analysis lead to low reliability, therefore there is a need for effective and efficient methods of missing values estimation.

This report presents a method for estimating missing values in SNP Microarrays using k-Nearest Neighbors among similar individuals. Usage of preliminary imputation is proposed and discussed. It is shown that introduction of multiple passes of kNN improves quality of missing value estimation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alizadeh, A.A., Eisen, M.B., Davis, R.E., Ma, C., Lossos, I.S., Rosenwald, A., Boldrick, J.C., Sabet, H., Tran, T., Yu, X., Powell, J.I., Yang, L., Marti, G.E., Moore, T., Hudson Jr., J., Lu, L., Lewis, D.B., Tibshirani, R., Sherlock, G., Chan, W.C., Greiner, T.C., Weisenburger, D.D., Armitage, J.O., Warnke, R., Staudt, L.M., et al.: Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403, 503–511 (2000)

    Google Scholar 

  2. Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann Publishers (2011)

    Google Scholar 

  3. Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshi-rani, R., Botstein, D., Altman, R.B.: Missing value estimation methods for DNA microarrays. Bioinformatics 17(6), 520–525 (2001)

    Google Scholar 

  4. Kang, H., Qin, Z.S., Niu, T., Liu, J.S.: Incorporating Genotyping Uncer-tainty in Haplotype Inference for Single-Nucleotide Polymorphisms. Am. J. Hum. Genet. 74, 495–510 (2004)

    Google Scholar 

  5. Patil, N., et al.: Blocks of Limited Haplotype Diversity Revealed by High-Resolution Scanning of Human Chromosome 21. Science 294, 1719–1723 (2001)

    Google Scholar 

  6. Sinoquet, C.: Iterative two-pass algorithm for missing data imputation in SNP arrays. Journal of Bioinformatics and Computational Biology 7(5), 833–852 (2009)

    Google Scholar 

  7. Zezula, P., Amato, G., Dohnal, V., Batko, M.: Similarity Search: The Me-tric Space Approach. Springer (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Podsiadly, P. (2014). Estimation of Missing Values in SNP Array. In: Ali, M., Pan, JS., Chen, SM., Horng, MF. (eds) Modern Advances in Applied Intelligence. IEA/AIE 2014. Lecture Notes in Computer Science(), vol 8482. Springer, Cham. https://doi.org/10.1007/978-3-319-07467-2_45

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-07467-2_45

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-07466-5

  • Online ISBN: 978-3-319-07467-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics