Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Text Stream Processing

  • Jeong-Hyon HwangEmail author
  • Alan G. Labouseur
  • Paul W. Olsen
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_80751


Document stream processing


A text stream is a continuously generated series of comments or small text documents. Each comment or text document may be associated with a time stamp indicating when it was produced or received by a certain device or system. Text stream processing refers to real-time extraction of desired information from text streams (through categorizing and clustering documents in text streams, detecting and tracking topics, matching patterns, and discovering events). Streaming text media (e.g., Twitter, WeChat, Facebook, news feeds, etc.) have fresher content with richer attributes and tend to have broader coverage compared to traditional electronic media (e.g., forums, blogs, and web sites). These advantages make them ripe for use in many engaging, innovative, and empowering applications (see Key Applications, below). In contrast to offline text mining, which analyzes a static collection of text documents (see  “Text Mining”), text stream...

This is a preview of subscription content, log in to check access.


  1. 1.
    Banerjee A, Basu S. Topic models over text streams: a study of batch and online unsupervised learning. In: Proceedings of the 7th SIAM International Conference on Data Mining; 2007. p. 437–42.Google Scholar
  2. 2.
    Bhide M, Chakaravarthy VT, Ramamritham K, Roy P. Keyword search over dynamic categorized information. In: Proceedings of the IEEE International Conference on Data Engineering; 2009. p. 258–69.Google Scholar
  3. 3.
    Chen C, Li F, Ooi BC, Wu S. TI: an efficient indexing mechanism for real-time search on tweets. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2011. p. 649–60.Google Scholar
  4. 4.
    Elkhalifa L, Adaikkalavan R, Chakravarthy S. InfoFilter: a system for expressive pattern specification and detection over text streams. In: Proceedings of the 2005 ACM Symposium on Applied Computing; 2005. p. 1084–8.Google Scholar
  5. 5.
    Fung GPC, Yu JX, Yu PS, Lu H. Parameter free bursty events detection in text streams. In: Proceedings of the 31st International Conference on Very Large Data Bases; 2005. p. 181–92.Google Scholar
  6. 6.
    He Q, Chang K, Lim E-P, Zhang J. Bursty feature representation for clustering text streams. In: Proceedings of the 7th SIAM International Conference on Data Mining; 2007. p. 491–6.Google Scholar
  7. 7.
    Kleinberg J. Bursty and hierarchical structure in streams. Data Min Knowl Disc. 2003;7(4):373–97.MathSciNetCrossRefGoogle Scholar
  8. 8.
    Li R, Wang S, Chang KC-C. Towards social data platform: automatic topic-focused monitor for twitter stream. Proc VLDB Endow. 2013;6(14):1966–77.CrossRefGoogle Scholar
  9. 9.
    Mei Q, Zhai C. Discovering evolutionary theme patterns from text: an exploration of temporal text mining. In: Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2005. p. 198–207.Google Scholar
  10. 10.
    Mouratidis K, Pang H. An incremental threshold method for continuous text search queries. In: Proceedings of the 25th International Conference on Data Engineering; 2009. p. 1187–90.Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  • Jeong-Hyon Hwang
    • 1
    Email author
  • Alan G. Labouseur
    • 2
  • Paul W. Olsen
    • 3
  1. 1.Department of Computer ScienceUniversity at Albany – State University of New YorkAlbanyUSA
  2. 2.School of Computer Science and MathematicsMarist CollegePoughkeepsieUSA
  3. 3.Department of Computer ScienceThe College of Saint RoseAlbanyUSA