The Representation Race — Preprocessing for Handling Time Phenomena

Morik, Katharina

doi:10.1007/3-540-45164-1_2

Katharina Morik⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1810))

Included in the following conference series:

European Conference on Machine Learning

1497 Accesses
12 Citations

Abstract

Designing the representation languages for the input, L _E, and output, L _H, of a learning algorithm is the hardest task within machine learning applications. This paper emphasizes the importance of constructing an appropriate representation L _E for knowledge discovery applications using the example of time related phenomena. Given the same raw data — most frequently a database with time-stamped data — rather different representations have to be produced for the learning methods that handle time. In this paper, a set of learning tasks dealing with time is given together with the input required by learning methods which solve the tasks. Transformations from raw data to the desired representation are illustrated by three case studies.

Download to read the full chapter text

Chapter PDF

Some Machine Learning Approaches to the Analysis of Temporal Data

On the Role of Time in Learning

Time in Data Models

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In Proceedings of the ACM SIGMOD Conference on Management of Data, Washington, D. C., may 1993. 11
Google Scholar
Rakesh Agrawal, Heikki Mannila, Ramakrishnan Srikant, Hannu Toivonen, and A. Inkeri Verkamo. Fast discovery of association rules. In Usama M. Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth, and Ramasamy Uthurusamy, editors, Advances in Knowledge Discovery and Data Mining, chapter 12, pages 307–328. AAAI Press/The MIT Press, Cambridge Massachusetts, London England, 1996. 11
Google Scholar
Rakesh Agrawal and Ramakrishnan Srikant. Mining sequential patterns. In International Conference on Data Engineering, Taipei, Taiwan, mar 1995. 11
Google Scholar
J. F. Allen. Towards a general theory of action and time. Artificial Intelligence, 23:123–154, 1984. 8
Article MATH Google Scholar
James F. Allen. Maintaining knowledge about temporal intervals. In R. J. Brachman and H. J. Levesque, editors, Readings in Knowledge Representation, chapter VII, pages 509–523. Morgan Kaufman, Los Altos, CA, 1985. 8
Google Scholar
M. Bauer, U. Gather, and M. Imhoff. Analysis of high dimensional data from intensive care medicine. Technical Report 13/1998, Sonderforschungsbereich 475, Universität Dortmund, 1998. 10, 15
Google Scholar
Francesco Bergadano and Daniele Gunetti. Inductive logic programming:from machine learning to software engineering. The MIT Press, Cambridge, Mass., 1996. 4
Google Scholar
Pvael Brazdil. Data transformation and model selection by experimentation and meta-learning. In C. Giraud-Carrier and M. Hilario, editors, Workshop Notes — Upgrading Learning to the Meta-Level: Model Selection and Data Transformation, number CSR-98-02 in Technical Report, pages 11–17. Technical University Chemnitz, April 1998. 6
Google Scholar
Gautam Das, King-Ip Lin, Heikki Mannila, Gopal Renganathan, and Padhraic Smyth. Rule Discovery from Time Series. In Rakesh Agrawal, Paul E. Stolorz, and Gregory Piatetsky-Shapiro, editors, Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (KDD-98), pages 16–22, Ney York City, 1998. AAAI Press. 10, 11, 14, 16
Google Scholar
Luc Dehaspe and Hannu Toivonen. Discovery of Frequent DATALOG Patterns. Data Mining and Knowledge Discovery, 3(1):7–36, 1999. 11
Article Google Scholar
Luc DeRaedt. Interactive Theory Revision: an Inductive Logic Programming Approach. Acad. Press, London [u.a.], 1992. 4
Google Scholar
G. Dong and S. Ginsburg. On the decomposition of chain datalog programs into p (left-)linear l-rule components. Logic Programming, 23:203–236, 1995. 11
Article MATH MathSciNet Google Scholar
Robert Engels. Planning tasks for knowledge discovery in databases; performing task-oriented user-guidance. In Proc. of th 2nd Int. Conf. on Knowledge Discovery in Databases, aug 1996. 6
Google Scholar
Robert Engels, Guido Lindner, and Rudi Studer. A guided tour through the data mining jungle. In Proceedings of the 3nd International Conference on Knowledge Discovery in Databases (KDD-97), August 14–17 1997. 6
Google Scholar
Gerald Gazdar and Chris Mellish. Natural Language Processing in PROLOG. Addison Wesley, Workingham u.a., 1989. 7
Google Scholar
Valery Guralnik and Jaideep Srivastava. Event detection from time series data. In Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 33–42, San Diego, USA, 1999. 11
Google Scholar
T. Joachims. Making large-scale SVM learning practical. In B. Schölkopf, C. Burges, and A. Smola, editors, Advances in Kernel Methods-Support Vector Learning, chapter 11. MIT-Press, 1999. 15
Google Scholar
J.-U. Kietz and S. Wrobel. Controlling the complexity of learning in logic through syntactic and task-oriented models. In Stephen Muggleton, editor, Inductive Logic Programming., number 38 in The A.P.I.C. Series, chapter 16, pages 335–360. Academic Press, London [u.a.], 1992. 4
Google Scholar
Jörg-Uwe Kietz and Marcus Lübbe. An efficient subsumption algorithm for inductive logic programming. In W. Cohen and H. Hirsh, editors, Proceedings of the 11th International Conference on Machine Learning IML-94, San Francisco, CA, 1994. Morgan Kaufmann. 5
Google Scholar
Volker Klingspor. Reaktives Planen mit gelernten Begriffen. PhD thesis, Univ. Dortmund, 1998. 16
Google Scholar
Volker Klingspor and Katharina Morik. Learning understandable concepts for robot navigation. In K. Morik, V. Klingspor, and M. Kaiser, editors, Making Robots Smarter — Combining Sensing and Action through Robot Learning. Kluwer, 1999. 15
Google Scholar
S. Kramer, B. Pfahringer, and C. Helma. Stochastic propositionalization of nondeterminate background knowledge. In D. Page, editor, Procs. 8th International Workshop on Inductive Logic Programming, pages 80–94. Springer, 1998. 4
Google Scholar
Nada Lavrač and Sašo Džeroski. Inductive Logic Programming — Techniques and Applications. Number 148 in Artificial Intelligence. Ellis Horwood, Hertfortshire, 1994. 4
MATH Google Scholar
H. Liu and H. Motoda. Feature Extraction, Construction, and Selection: A Data Mining Perspective. Kluwer, 1998. 5
Google Scholar
R.S. Michalski and T.G. Dietterich. Learning to predict sequences. In R.S. Michalski, J.G. Carbonell, and T.M. Mitchell, editors, Machine Learning-An Artificial Intelligence Approach Vol II, pages 63–106. Tioga Publishing Company, Los Altos, 1986. 8
Google Scholar
Ryszard Michalski. Inferential learning theory as a basis for multistrategy task-adaptive learning. In Michalski and Tecuci, editors, Multistrategy Learning. George Mason University, USA, 1991. 5
Google Scholar
D. Michie, D. J. Spiegelhalter, and C. C. Taylor. Machine Learning, Neural and Statistical Classification. Ellis Horwood, New York u.a., 1994. 6
MATH Google Scholar
Katharina Morik. Tailoring representations to different requirements. In Osamu Watanabe and Takashi Yokomori, editors, Algorithmic Learning Theory — Procs. 10th Int. Conf. ALT99, Lecture Notes in Artificial Intelligence, pages 1–12. Springer, 1999. 15
Google Scholar
Katharina Morik, Peter Brockhausen, and Thorsten Joachims. Combining statistical learning with a knowledge-based approach — A case study in intensive care monitoring. In Proc. 16th Int’l Conf. on Machine Learning (ICML-99), Bled, Slowenien, 1999. 15
Google Scholar
Katharina Morik and Stephanie Wessel. Incremental signal to symbol processing. In K. Morik, M. Kaiser, and V. Klingspor, editors, Making Robots Smarter — Combining Sensing and Action through Robot Learning, chapter 11, pages 185–198. Kluwer Academic Publ., 1999. 9, 14, 16
Google Scholar
S. Muggleton, A. Srinivasan, R. King, and M. Sternberg. Biochemical knowledge discovery using inductive logic programming. In Hiroshi Motoda, editor, Procs. First International Conference on Discovery Science. Springer, 1998. 8
Google Scholar
Dorian Pyle. Data Preparation for Data Mining. Morgan Kaufmann Publishers, 1999. 9
Google Scholar
Anke D. Rieger. Program Optimization for Temporal Reasoning within a Logic Programming Framework. PhD thesis, Universität Dortmund, Germany, Dortmund, FRG, 1998. 11
Google Scholar
D. Sleeman, R. Oehlman, and R. Davidge. Specification of Consultant-0 and a Comparision of Several Learning Algorithms. Deliverable D5.1, Esprit Project P2154, 1989. 6
Google Scholar
Devika Subramanian. A theory of justified reformulations. In D. Benjamin, Paul, editor, Change of Representation and Inductive Bias, pages 147–167. Kluwer, 1990. 4
Google Scholar
C. Theusinger and G. Lindner. Benutzerunterstützung eines KDD-Prozesses anhand von Datencharackteristiken. In F. Wysotzki, P. Geibel, and K. Schädler, editors, Beiträge zum Treffen der GI-Fachgruppe 1.1.3 Machinelles Lernen (FGML-98), volume 98/11 of Technical Report. Technical University Berlin, 1998. 6
Google Scholar
Vladimir N. Vapnik. The Nature of Statistical Learning Theory. Springer, New York, 1995. 4, 13
MATH Google Scholar
D.H. Wolpert and W.G. Macready. No free lunch theorems for search. Technical Report SFI-TR-95-02-010, Santa Fé Institute, Santa Fé, CA., 1995. 4
Google Scholar
Stefan Wrobel. Concept Formation and Knowledge Revision. Kluwer Academic Publishers, Dordrecht, 1994. 4
MATH Google Scholar
Wei Zhang. A region-based approach to discovering temporal structures in data. In Ivan Bratko and Saso Dzeroski, editors, Proc. of 16th Int. Conf. on Machine Learning, pages 484–492. Morgan Kaufmann, 1999. 12
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science VIII, Univ. Dortmund, D-44221, Dortmund, Germany
Katharina Morik

Authors

Katharina Morik
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institut d’Investigació en Intelligència Artificial, IIIA, Spanish Council for Scientific Research, CSIC, Campus, U.A.B., 08193, Bellaterra, Catalonia, Spain
Ramon López de Mántaras & Enric Plaza &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Morik, K. (2000). The Representation Race — Preprocessing for Handling Time Phenomena. In: López de Mántaras, R., Plaza, E. (eds) Machine Learning: ECML 2000. ECML 2000. Lecture Notes in Computer Science(), vol 1810. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45164-1_2

Download citation

DOI: https://doi.org/10.1007/3-540-45164-1_2
Published: 14 January 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67602-7
Online ISBN: 978-3-540-45164-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

The Representation Race — Preprocessing for Handling Time Phenomena

Abstract

Chapter PDF

Similar content being viewed by others

Some Machine Learning Approaches to the Analysis of Temporal Data

On the Role of Time in Learning

Time in Data Models

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

The Representation Race — Preprocessing for Handling Time Phenomena

Abstract

Chapter PDF

Similar content being viewed by others

Some Machine Learning Approaches to the Analysis of Temporal Data

On the Role of Time in Learning

Time in Data Models

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation