Skip to main content

Ethics and Data Mining in Biomedical Engineering

  • Chapter
  • First Online:
  • 964 Accesses

Abstract

Data mining is a generic term given to the process of analysing data, usually high volumes of data contained in large databases, in order to discover previously unknown patterns and trends. Data mining utilizes and combines methods from statistics, machine learning, pattern recognition and database management. Typical data mining tasks involve detecting data subsets that are similar in some way, unusual or anomalous or have features that are associated or dependent. Although data mining is not traditionally focused on the development of predictive models that generalise known patterns to new (unseen) data, this is often an extremely valuable way of verifying the efficacy of the derived models. That is, if these models accurately predict unseen data, then is more likely that these models truly represent the underlying patterns in the data rather than being purely by-chance occurrences. In addition, in biomedical applications the purpose of the data mining is often to better understand the patterns of disease so that improved diagnoses, prognoses and treatments can be developed in the future.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  • Bradley AP (1997) The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit 30(7):1145–1159

    Article  Google Scholar 

  • Bradley AP, Longstaff ID (2004) Sample size estimation using the receiver operating characteristic curve. In: International Conference on Pattern Recognition, Cambridge, vol. 4, pp. 428–431

    Google Scholar 

  • Breiman L, Friedman J, Olshen R, Stone C (1984) Classification and regression trees. Wadsworth, Belmont, CA

    MATH  Google Scholar 

  • Dietterich TG, Lathrop RH, Lozano-Pérez T (1997) Solving the multiple instance problem with axis-parallel rectangles. Artif Intell 89(1–2):31–71

    Article  MATH  Google Scholar 

  • Duda RO, Hart PE, Stork DG (2001) Pattern classification. Wiley, New York, NY

    MATH  Google Scholar 

  • Efron B (1982) The jackknife, the bootstrap, and other resampling plans. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA

    Book  Google Scholar 

  • Fawcett T (2006) An introduction to ROC analysis. Pattern Recognit Lett 27:861–874

    Article  Google Scholar 

  • Friedman JH. (1995) Introduction to computational learning and statistical prediction. In: Tutorial at the International Conference on Machine Learning, Lake Tahoe, CA

    Google Scholar 

  • Fukunaga K (1990) Introduction to statistical pattern recognition, 2nd edn. New York, NY, Academic

    MATH  Google Scholar 

  • Hand DJ (1981) Discrimination and classification. Wiley, Chichester

    MATH  Google Scholar 

  • Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143:29–36

    Google Scholar 

  • Hanley JA, McNeil BJ (1983) A method of comparing the areas under receiver operating ­characteristic curves derived from the same cases. Radiology 148:601–611

    Google Scholar 

  • Landgrebe TCW, Duin RPW (2008) Efficient multiclass ROC approximation by decomposition via confusion matrix perturbation analysis. IEEE Trans Pattern Anal Mach Intell 30(5):810–822

    Article  Google Scholar 

  • Landgrebe TCW, Paclik P, Duin RPW, Bradley AP (2006) Precision-recall operating characteristic curves in imprecise environments. In: International Conference on Pattern Recognition, Hong Kong, vol. 4, pp. 123–127

    Google Scholar 

  • McLachlan GJ (1992) Discriminant analysis and statistical pattern recognition. Wiley, New York, NY

    Book  Google Scholar 

  • Old F (1993) Inventions, patents, brands and designs. Patent Press, Sydney

    Google Scholar 

  • Preston N (1996) Understanding ethics. Federation Press, Sydney

    Google Scholar 

  • Seltzer W (2005) The promise and pitfalls of data mining: ethical issues. In Proceedings of the American Statistical Association, Section on Government Statistics, Alexandria, VA: American Statistical Association, pp. 1441–1445

    Google Scholar 

  • Swets JA, Dawes RM, Monahan J (2000) Better Decisions Through Science. Scientific American, pp. 82–87

    Article  Google Scholar 

  • Swets JA, Pickets RM (1982) Evaluation of Diagnostic Systems: Methods from Signal Detection Theory, Academic Press, New York

    Article  Google Scholar 

  • Vandewalle P, Kovacevic J, Vetterli M (2009) Reproducible research in signal processing. IEEE Signal Process Mag 26(3):37–47

    Article  Google Scholar 

  • Weiss S, Kulikowski C (1991) Computer systems that learn: classification and prediction methods from statistics, neural networks, machine learning, and expert systems. Morgan Kaufmann, San Mateo, CA

    Google Scholar 

  • Zar JH (1998) Biostatistical analysis, 4th edn. Prentice-Hall, Upper Saddle River, NJ

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Andrew P. Bradley .

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media New York

About this chapter

Cite this chapter

Bradley, A.P. (2013). Ethics and Data Mining in Biomedical Engineering. In: Ethics for Biomedical Engineers. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6913-1_5

Download citation

Publish with us

Policies and ethics