Efficient Search of Reliable Exceptions

Liu, Huan; Lu, Hongjun; Feng, Ling; Hussain, Farhad

doi:10.1007/3-540-48912-6_27

Efficient Search of Reliable Exceptions

Huan Liu³,
Hongjun Lu⁴,
Ling Feng⁵ &
…
Farhad Hussain³

Conference paper
First Online: 01 January 2002

1030 Accesses
26 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1574))

Abstract

Finding patterns from data sets is a fundamental task of data mining. If we categorize all patterns into strong, weak, and random, conventional data mining techniques are designed only to find strong patterns, which hold for numerous objects and are usually consistent with the expectations of experts. While such strong patterns are helpful in prediction, the unexpectedness and contradiction exhibited by weak patterns are also very useful although they represent a relatively small number of objects. In this paper, we address the problem of finding weak patterns (i.e., reliable exceptions) from databases. A simple and efficient approach is proposed which uses deviation analysis to identify interesting exceptions and explore reliable ones. Besides, it is flexible in handling both subjective and objective exceptions. We demonstrate the effectiveness of the proposed approach through a set of real-life data sets, and present interesting findings.

on leave from School of Computing, National University of Singapore.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In Proc. of the ACM SIGMOD Conference on Management of Data, pages 207–216, Washington D.C., USA, May 1993.
Google Scholar
R. Agrawal and R. Srikant. Fast algorithms for mining association rules. In Proc. of the 20th Conference on Very Large Data Bases, pages 478–499, Santiago, Chile, September 1994.
Google Scholar
W. Frawely, G. Piatesky-Shapiro, and C. Matheus. Knowledge Discovery in database: an overview. G. Piatesky-Shapiro and W. Frawley (eds.) Knowledge Discovery in Database, AAAI/MIT Press, 1991.
Google Scholar
B.R. Gaines. An ounce of knowledge is worth a ton of data. In A.M. Segre, editor, Proceedings of The Sixth International Workshop on Machine Learning, pages 156–159. Morgan Kaufmann Publishers, Inc., 1989.
Google Scholar
M. Klemettinen, H. Mannila, P. Ronkainen, H. Toivonen, and I. Verkamo. Finding interesting rules from large sets of discovered association rules. In Proc. of the Third International Conference on Information and Knowledge Management, pages 401–407, November 1994.
Google Scholar
B. Liu, W. Hsu, and S. Chen. Using general impression to analyze discovered classification rules. In Proc. of the Third International Conference on Knowledge Discovery and Data Mining, pages 31–36, Newport Beach, California, USA, 1997.
Google Scholar
C. Matheus, G. Piatetsky-Shapiro, and D. McNeil. An application of KEFIR to the analysis of healthcare information. In Proc. of the Eleventh International Conference on Artificial Intelligence, Workshop on Knowledge Discovery in Databases, pages 25–36,., 1994.
Google Scholar
C. Merz and P. Murphy. UCI repository of machine learning databases. Technical Report http://www.ics.uci.edu/ mlearn/MLRepository.html.Irvine, CA: University of California, Department of Information and Computer Science, 1996.
Google Scholar
B. Padmanabhan and A. Tuzhilin. A belief-driven method for discovering unexpected patterns. In Proc. of the Fourth International Conference on Knowledge Discovery and Data Mining, pages 27–31, April 1998.
Google Scholar
G. Piatetsky-Shapiro. Discovery, analysis, and presentation of strong rules. G. Piatesky-Shapiro and W. Frawley (eds.) Knowledge Discovery in Database, AAAI/MIT Press, 1991.
Google Scholar
G. Piatetsky-Shapiro and C. Matheus. The interestingness of deviations. In AAAI Workshop on Knowledge Discovery in Database, pages 25–36, Seattle, Washington, July 1994.
Google Scholar
J. Quinlan. C4.5: Programs for Machine Learning. Morgan Kauffmann Publishers, Inc., 1993.
Google Scholar
A. Silberschatz and A. Tuzhilin. On subjective measures of interestingness in knowledge discovery. In Proc. of the First International Conference on Knowledge Discovery and Data Mining, pages 275–281, Montreal, 1995.
Google Scholar
A. Silberschatz and A. Tuzhilin. What makes patterns interesting in knowledge discovery systems. IEEE Trans. on Knoweldge and Data Engineering, 8(6):970–974, 1996.
Article Google Scholar
P. Smith. Into Statistics. Springer-Verlag, Singapore, 1998.
MATH Google Scholar
E. Suzuki. Autonomous discovery of reliable exception rules. In Proc. of the 3rd International Conference on Knowledge Discovery and Data Mining, pages 259–263, Newport Beach, CA, USA., August 1997.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing, National University of Singapore, Singapore, 117599
Huan Liu & Farhad Hussain
Department of Computer Science, Hong Kong University of Science and Technology, Hong Kong
Hongjun Lu
Department of Computing, Hong Kong Polytechnic University, Hong Kong
Ling Feng

Authors

Huan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Hongjun Lu
View author publications
You can also search for this author in PubMed Google Scholar
Ling Feng
View author publications
You can also search for this author in PubMed Google Scholar
Farhad Hussain
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Systems Engineering, Yamaguchi University, Tokiwa-Dai, 2557, Ube, 755, Japan
Ning Zhong
Department of Computer Science and Technology, Tsinghua University, Beijing, China
Lizhu Zhou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, H., Lu, H., Feng, L., Hussain, F. (1999). Efficient Search of Reliable Exceptions. In: Zhong, N., Zhou, L. (eds) Methodologies for Knowledge Discovery and Data Mining. PAKDD 1999. Lecture Notes in Computer Science(), vol 1574. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48912-6_27

Download citation

DOI: https://doi.org/10.1007/3-540-48912-6_27
Published: 24 September 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65866-5
Online ISBN: 978-3-540-48912-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics