Skip to main content

Proposal of the Methodology for Identification of Repetitive Sequences in Big Data

  • Conference paper
  • First Online:
Software Engineering and Algorithms in Intelligent Systems (CSOC2018 2018)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 763))

Included in the following conference series:

  • 845 Accesses

Abstract

The aim of this paper is to propose and describe methodology for identification of repetitive sequences in big data sets. These repetitive sequences can represent for example sequences of failures that emerge in industrial processes. Proposed methodology deals with sequences which are based on time, when the elements of particular sequence emerged. One way to approach such identification is to use so called brute-force scanning, but this approach is very demanding on computational power and computational time for big data sets cases. Our methodology approaches this issue from the side of data mining and data analysis point of view.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Tanuška P.: Tézy inauguračnej prednášky, MTF STU (2013)

    Google Scholar 

  2. Friedman, J.H.: Data Mining and Statistics: What’s the Connection? Stanford University, Stanford, CA 94305, 10 November 2016. http://statweb.stanford.edu/~jhf/ftp/dm-stat.pdf

  3. Babcock, B., Datar, M., Motwani, R., O’Callaghan, L.: Maintaining variance and k-medians over data stream windows. In: Proceedings of ACM Symposium on Principles of Database Systems (2003)

    Google Scholar 

  4. Kamath, C.: On the role of data mining techniques in uncertainty quantification. Int. J. Uncertain. Quantification 2(1), 73–94 (2012)

    Article  MathSciNet  Google Scholar 

  5. Nazari, Z., et al.: A new hierarchical clustering algorithm. In: ICIIBMS 2015, Track2: Artificial Intelligence, Robotics, and Human-Computer Interaction, Okinawa, Japan

    Google Scholar 

  6. Alpydin, E.: Introduction to Machine Learning, pp. 143–158. The MIT Press, Cambridge (2010)

    Google Scholar 

Download references

Acknowledgments

This publication was written with financial support of the KEGA agency in the frame of the project 040STU-4/2016 “Modernization of the Automatic Control Hardware course by applying the concept Industry 4.0”.

This publication is the result of implementation of the project: “UNIVERSITY SCIENTIFIC PARK: CAMPUS MTF STU - CAMBO” (ITMS: 26220220179) supported by the Research & Development Operational Program funded by the EFRR.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Martin Nemeth .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nemeth, M., Michalconok, G. (2019). Proposal of the Methodology for Identification of Repetitive Sequences in Big Data. In: Silhavy, R. (eds) Software Engineering and Algorithms in Intelligent Systems. CSOC2018 2018. Advances in Intelligent Systems and Computing, vol 763. Springer, Cham. https://doi.org/10.1007/978-3-319-91186-1_40

Download citation

Publish with us

Policies and ethics