Advances in Predictive Data Mining Methods

Hong, Se June; Weiss, Sholom M.

doi:10.1007/3-540-48097-8_2

Se June Hong³ &
Sholom M. Weiss³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1715))

Included in the following conference series:

International Workshop on Machine Learning and Data Mining in Pattern Recognition

728 Accesses
4 Citations

Abstract

Predictive models have been widely used long before the development of the new field that we call data mining. Expanding application demand for data mining of ever increasing data warehouses, and the need for understandability of predictive models with increased accuracy of prediction, all have fueled recent advances in automated predictive methods. We first examine a few successful application areas and technical challenges they present. We discuss some theoretical developments in PAC learning and statistical learning theory leading to the emergence of support vector machines. We then examine some technical advances made in enhancing the performance of the models both in accuracy (boosting, bagging, stacking) and scalability of modeling through distributed model generation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Gallagher C., “Risk Classication Aided by New Software Tool (CHAID Chi Squared Automatic Interaction Detector”, National Underwriter Property & Casualty Risk and Benets Management, Vol. 17, No. 19, April 1992.
Google Scholar
Breiman L., Friedman J.H., Olshen R.A. & Stone C.J., Classication and Regression Trees, Wadsworth International Group, 1984.
Google Scholar
Quinlan J.R., C4.5 programs for machine learning, Morgan Kaufmann, 1993.
Google Scholar
Shafer J., Agrawal R, Mehta M., “SPRINT: A Scalable Parallel Classier for data Mining”, Procc. of the 22nd ICVLDB, pp. 544–555, 1996.
Google Scholar
Apte C., Grossman E., Pednault E., Rosen B., Tipu F., White B, “Insurance Risk Modeling Using Data Mining Technology”, Tech. Report RC-21314, IBMResearch Division, 1998. To appear in Proc. of PADD99.
Google Scholar
Stolfo S.J., Prodromidis A., Tselepis S., Lee W., Fan W. & Chan P., “JAM: Java Agents for Meta-Learning over Distributed Databases”, Proc. of KDDM97, pp. 74–81, 1997.
Google Scholar
Hayes P.J. & Weinstein S.,“Adding Value to Financial News by Computer”, Proc. of the First International Conference on Artificial Intelligence Applications on Wall Street, pp. 2–8, 1991.
Google Scholar
Hayes P.J., Andersen P.M., Nirenburg I.B., & Schmandt L.M., “TCS: A Shell for Content-Based Text Categorization”, Proc. of the Sixth IEEE CAIA, pp. 320–326, 1990.
Google Scholar
Weiss S. & Indurkhya N., Predictive Data Mining: A Practical guide,Morgan Kaufmann, 1998.
Google Scholar
Hosking J.R.M., Pednault E.P.D. & Sudan M., “A Statistical Perspective on Data Mining”, Future Generation Computer Systems: Special issue on Data Mining, Vol. 3, Nos. 2-3, pp. 117–134., 1997.
Article Google Scholar
Vapnik V.N., Statistical Learning Theory, Wiley, 1998
Google Scholar
Breiman L., “Bagging Predictors”,Machine Learning, Vol. 24, pp.123–140, 1996.
Google Scholar
Freund Y. & Schapire R., “Experiments with a New Boosting Algorithm”, Proc. of the International Machine Learning Conference, Morgan Kaufmann, pp. 148–156, 1996.
Google Scholar
Wolpert D., “Stacked Generalization”,Neural Networks, Vol. 5, No. 2, pp. 241–260, 1992.
Article Google Scholar
Dietterich, T.D., “Machine learning Research: Four Current Directions”, AI Magazine, Vol. 18, No. 4, pp. 97–136, 1997.
Google Scholar
Domingos P. & Pazzani M., “on the Optimality of the Simple Bayesian Classifier under Zero-One Loss”, Machine Learning, Vol. 29, pp. 103–130, 1997.
Article Google Scholar

Download references

Author information

Authors and Affiliations

IBM T.J. Watson Research Center, P.O. Box 218, Yorktown Heights, NY, 10598, USA
Se June Hong & Sholom M. Weiss

Authors

Se June Hong
View author publications
You can also search for this author in PubMed Google Scholar
Sholom M. Weiss
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institut für Bildverarbeitung und angewandte Informatik, Arno-Nitzsche-Str. 45, D-04277, Leipzig, Germany
Petra Perner
School of Electronic Engineering, Information Technology and Mathematics, University of Surrey, Guilford, GU2 5XH, UK
Maria Petrou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hong, S.J., Weiss, S.M. (1999). Advances in Predictive Data Mining Methods. In: Perner, P., Petrou, M. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 1999. Lecture Notes in Computer Science(), vol 1715. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48097-8_2

Download citation

DOI: https://doi.org/10.1007/3-540-48097-8_2
Published: 24 March 2000
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66599-1
Online ISBN: 978-3-540-48097-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics