Skip to main content

Outlier Labeling Methods for Medical Data

  • Chapter
  • First Online:
Logistics, Supply Chain and Financial Predictive Analytics

Part of the book series: Asset Analytics ((ASAN))

Abstract

Outlier detection should be considered as preliminary step to avoid misinterpretation of results in data analysis. In this paper, the performance indices of the outlier labeling methods such as SD method, Median method, MADe method, Z−Score, Modified Z−Score, Tukey’s method for univariate data set were compared. Each labeling method has different measures and based on which an interval was constructed to detect outliers. We have attempted to find the appropriate choice of outlier detection methods. The advantages and disadvantages of each method were discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Ferguson TS (1961) On the rejection of outliers. In: Proceedings of the fourth berkeley symposium on mathematical statistics and probability, vol 1, pp 377–381

    Google Scholar 

  2. Anscombe FJ, Guttman I (1960) Rejection of outliers. Technometrics 2(2):123–147

    Article  Google Scholar 

  3. Gumbel EJ (1960) Discussion of the Papers by Messrs. Anscombe and Daniel. Technometrics 2(2):165–166

    Google Scholar 

  4. Kruskal WH (1960) Some remarks on wild observations. Technometrics 2(1):1–3

    Article  Google Scholar 

  5. Hampel F (1971) A general qualitative definition of robustness. Ann Math Stat 42(6):1887–1896

    Article  Google Scholar 

  6. Barnett V (1978) The study of outliers: purpose and model. Appl Stat 27(3):242–250

    Article  Google Scholar 

  7. Bain JL, Engelhardt M (1992) Introduction to probability and mathematical statistics, 1st edn. Duxbury Press, California

    Google Scholar 

  8. Rousseeuw PJ, Croux C (1993) Alternatives to median absolute deviation. Am Stat Assoc 88(424):1273–1283

    Google Scholar 

  9. Olewuezi NP (2011) Note on the comparison of some outlier labeling techniques. J Math Stat 7(4):353–355

    Article  Google Scholar 

  10. Barbato G, Barini EM, Genta G, Levi R (2011) Features and performance of some outlier detection methods. J Appl Stat 38(10):2133–2149

    Article  Google Scholar 

  11. Leys C, Ley C, Klein O, Bernard P, Licata L (2013) Detecting outliers: do not use standard deviation around the mean, use absolute deviation around the median. J Exp Soc Psychol 49(4), 764–766

    Google Scholar 

  12. Khaleelur Rahman SMA, Mohamed Sathik M, Senthamarai Kannan K (2014) A novel approach for univariate outlier detection. Int J Sci Eng 5(2), 1594–1599

    Google Scholar 

  13. Obikee A, Ebuh G, Obiora-Ilouno H (2014) Comparison of outlier techniques based on simulated data. Open J Stat 4:536–561

    Article  Google Scholar 

  14. Senthamarai Kannan K, Manoj K, Arumugam S (2015) Labeling methods for identifying outliers. Int J Stat Syst 10(2):231–238

    Google Scholar 

  15. Iglewicz B, Hoaglin DC (1993) How to detect and handle outliers, 1st edn. ASQC Quality Press, Milwaukee

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to S. Stephen Raj .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Kannan, K.S., Raj, S.S. (2019). Outlier Labeling Methods for Medical Data. In: Deep, K., Jain, M., Salhi, S. (eds) Logistics, Supply Chain and Financial Predictive Analytics. Asset Analytics. Springer, Singapore. https://doi.org/10.1007/978-981-13-0872-7_6

Download citation

Publish with us

Policies and ethics