Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6277))

Abstract

This work presents a survey of methods that visualize text streams. Existing methods are classified and compared from the aspect of visualization process. We introduce new aspects of method comparison: data type, text representation, and the temporal drawing approach. The subjectivity of visualization is described, and evaluation methodologies are explained. Related research areas are discussed and some future trends in the field anticipated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Scott Owen, G., Domik, G., Rhyne, T.M., Brodlie, K.W., Santos, B.S.: Definitions and rationale for visualization, http://www.siggraph.org/education/materials/HyperVis/visgoals/visgoal2.htm (accessed in February 2010)

  2. Friendly, M., Denis, D.: Milestones in the history of thematic cartography, statistical graphics, and data visualization, vol. 9 (2008)

    Google Scholar 

  3. Thomas, J.J., Cook, K.A.: Illuminating the Path: The Research and Development Agenda for Visual Analytics. National Visualization and Analytics Ctr (2005)

    Google Scholar 

  4. Risch, J., Kao, A., Poteet, S., Wu, Y.: Text Visualization for Visual Text Analytics. In: Simoff, S.J., Böhlen, M.H., Mazeika, A. (eds.) Visual Data Mining. LNCS, vol. 4404, pp. 154–171. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  5. Keim, D.A., Mansmann, F., Thomas, J.: Visual Analytics: How Much Visualization and How Much Analytics? In: ACM SIGKDD Workshop on Visual Analytics and Knowledge Discovery - VAKD 2009 (2009)

    Google Scholar 

  6. Tukey, J.W.: Exploratory Data Analysis. Addison-Wesley, Reading (1977)

    MATH  Google Scholar 

  7. Allan, J.: Tracking, Event- based Information Organization. Kluwer Academic Publishers, Dordrecht (2002)

    MATH  Google Scholar 

  8. Luo, D., Yang, J., Fan, J., Ribarsky, W., Luo, H.: EventRiver: Interactive Visual Exploration of Streaming Text. EG/IEEE EuroVis 2009 (2009) (to be published)

    Google Scholar 

  9. Carpendale, S.: Evaluating information visualizations. In: Information Visualization: Human-Centered Issues and Perspectives, pp. 19–45. Springer, Heidelberg (2008)

    Google Scholar 

  10. Tufte, E.R.: Visual Explanations. Graphics Press (1997)

    Google Scholar 

  11. Benson, J., Crist, D., Lafleur, P.: Agent-based visualization of streaming text. In: Proc. IEEE Info. Vis. Conf., Raleigh (2008)

    Google Scholar 

  12. Weskamp, M.: (2004), http://marumushi.com/projects/newsmap (acc. in Apr. 2010)

  13. Albrecht-Buehler, C., Watson, B., Shamma, D.A.: Visualizing live text streams using motion and temporal pooling. IEEE Comp. Graph. App. 25(3), 52–59 (2005)

    Article  Google Scholar 

  14. Mao, Y., Dillon, J., Lebanon, G.: Sequential document visualization. IEEE Transactions on Visualization and Computer Graphics 13(6), 1208–1215 (2007)

    Article  Google Scholar 

  15. Linguistic Data Consortium: The New York Times Annotated Corpus, http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2008T19

  16. Leskovec, J., Backstrom, L., Kleinberg, J.M.: Meme-tracking and the dynamics of the news cycle. In: Proc. of the 15th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pp. 497–506 (2009)

    Google Scholar 

  17. Wise, J.A., Thomas, J.J., Pennock, K., Lantrip, D., Pottier, M., Schur, A., Crow, V.: Visualizing the non-visual: Spatial analysis and interaction with information from text documents. In: Proc. IEEE Symp. Info. Vis., pp. 51–58 (1995)

    Google Scholar 

  18. Miller, N.E., Wong, P.C., Brewster, M., Foote, H.: TOPIC ISLANDS - a wavelet-based text visualization system. In: Proc. 9th IEEE Conf. on Vis., pp. 189–196 (1998)

    Google Scholar 

  19. Berendt, B., Subasic, I.: STORIES in time: A graph-based interface for news tracking and discovery. In: Web Intel./IAT Workshops, pp. 531–534. IEEE, Los Alamitos (2009)

    Google Scholar 

  20. Kontostathis, A., Galitsky, L., Pottenger, W.M., Roy, S., Phelps, D.J.: A Survey of Emerging Trend Detection in Textual Data Mining (2003)

    Google Scholar 

  21. Yang, Y., Akers, L., Klose, T., Yang, C.B.: Text mining and visualization tools - impressions of emerging capabilities. World Patent Info. 30(4), 280–293 (2008)

    Article  Google Scholar 

  22. Salton, G., Wong, A., Yang, A.C.S.: A vector space model for automatic indexing. Communications of the ACM 18, 229–237 (1975)

    Article  Google Scholar 

  23. Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1), 1–47 (2002)

    Article  Google Scholar 

  24. Mei, Q., Zhai, C.: Discovering evolutionary theme patterns from text: an exploration of temporal text mining. In: Grossman, R., Bayardo, R.J., Bennett, K.P. (eds.) Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Chicago, Illinois, USA, August 21-24, pp. 198–207. ACM, New York (2005)

    Chapter  Google Scholar 

  25. Moens, M.F.: Information Extraction, Algorithms and Prospects in a Retrieval Context. Springer, Heidelberg (2006)

    MATH  Google Scholar 

  26. Gregory, M., Chinchor, N., Whitney, P.: User-directed sentiment analysis: Visualizing the affective content of documents. In: Proc. of the Workshop on Sentiment and Subjectivity in Text. Association for Computational Linguistics (2006)

    Google Scholar 

  27. Sammon, J.: A nonlinear mapping for data structure analysis. IEEE Transactions on Computing 5(18), 401–409 (1969)

    Article  Google Scholar 

  28. Lin, X., Soergel, D., Marchionini, G.: A Self-organizing semantic map for information retrieval. In: Proc. 14th. Ann. Int. ACM SIGIR Conf. on R&D In Information Retrieval, pp. 262–269 (1991)

    Google Scholar 

  29. Chalmers, M., Chitson, P.: Bead: Explorations in information visualization. In: Proc. of the 15th ACM SIGIR Conf. on R&D in Information Retrieval (1992)

    Google Scholar 

  30. Rennison, E.: Galaxy of news: An approach to visualizing and understanding expansive news landscapes. In: ACM User Interface Soft. and Tech., pp. 3–12 (1994)

    Google Scholar 

  31. Davidson, G.S., Hendrickson, B., Johnson, D.K., Meyers, C.E., Wylie, B.N.: Knowledge mining with VxInsight: Discovery through interaction. J. Intell. Inf. Syst. 11(3), 259–285 (1998)

    Article  Google Scholar 

  32. Kaski, S., Lagus, K., Kohonen, T.: WEBSOM - Self-organizing maps of document collections. Neurocomputing 21, 101–117 (1998)

    Article  MATH  Google Scholar 

  33. Risch, J., Rex, D., Dowson, S., Walters, T., May, R., Moon, B.: The starlight information visualization system. In: Proc. IEEE Conf. Info. Vis., pp. 42–49 (1997)

    Google Scholar 

  34. Havre, S., Hetzler, E.G., Nowell, L.T.: Themeriver: Visualizing theme changes over time. In: Proc. IEEE Conf. Info. Vis., pp. 115–124 (2000)

    Google Scholar 

  35. Kabán, A., Girolami, M.: A dynamic probabilistic model to visualise topic evolution in text streams. J. Intelligent Information Systems 18(2-3), 107–125 (2002)

    Article  Google Scholar 

  36. Andrews, K., Kienreich, W., Sabol, V., Becker, J., Droschl, G., Kappe, F., Granitzer, M., Auer, P., Tochtermann, K.: The infosky visual explorer: exploiting hierarchical structure and document similarities. Info. Vis. 1(3-4), 166–181 (2002)

    Article  Google Scholar 

  37. Wong, P.C., Foote, H., Adams, D., Cowley, W., Thomas, J.: Dynamic visualization of transient data streams. In: Proc. IEEE Symp. Info. Vis. (2003)

    Google Scholar 

  38. Fortuna, B., Grobelnik, M., Mladenic, D.: Visualization of text document corpus. Informatica (Slovenia) 29(4), 497–504 (2005)

    Google Scholar 

  39. Paulovich, F.V., Minghim, R.: Text map explorer: a tool to create and explore document maps. In: IV, pp. 245–251. IEEE Computer Society, Los Alamitos (2006)

    Google Scholar 

  40. Don, A., Zheleva, E., Gregory, M., Tarkan, S., Auvil, L., Clement, T., Shneiderman, B., Plaisant, C.: Discovering interesting usage patterns in text collections: integrating text mining with visualization. In: Proc. 16th ACM Conf. Information and Knowledge Management, CIKM, pp. 213–222 (2007)

    Google Scholar 

  41. Ghoniem, M., Luo, D., Yang, J., Ribarsky, W.: NewsLab: Exploratory Broadcast News Video Analysis. In: IEEE Symp. Vis. Analytics Sci. and Tech., pp. 123–130 (2007)

    Google Scholar 

  42. Paulovich, F.V., de Oliveira, M.C.F., Minghim, R.: The projection explorer: A flexible tool for projection-based multidimensional visualization. In: Proc. 20th Brazilian Symp. Comp. Graph. and Image Processing (SIBGRAPI), pp. 27–36 (2007)

    Google Scholar 

  43. Alencar, A.B., de Oliveira, M.C.F., Paulovich, F.V., Minghim, R., Andrade, M.G.: Temporal-PEx: Similarity-based visualization of time series. In: Proc. 20th Brazilian Symp. Comp. Graph. and Image Processing, SIBGRAPI (2007)

    Google Scholar 

  44. Ishikawa, Y., Hasegawa, M.: T-Scroll: Visualizing trends in a time-series of documents for interactive user exploration. In: Kovács, L., Fuhr, N., Meghini, C. (eds.) ECDL 2007. LNCS, vol. 4675, pp. 235–246. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  45. Terachi, M., Saga, R., Sheng, Z., Tsuji, H.: Visualized technique for trend analysis of news articles. In: Nguyen, N.T., Borzemski, L., Grzech, A., Ali, M. (eds.) IEA/AIE 2008. LNCS (LNAI), vol. 5027, pp. 659–668. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  46. Petrović, S., Dalbelo Bašić, B., Morin, A., Zupan, B., Chauchat, J.H.: Textual features for corpus visualization using correspondence analysis. Intell. Data Anal. 13(5), 795–813 (2009)

    Google Scholar 

  47. Strobelt, H., Oelke, D., Rohrdantz, C., Stoffel, A., Keim, D.A., Deussen, O.: Document cards: A top trumps visualization for documents. IEEE Trans. Vis. Comput. Graph 15(6), 1145–1152 (2009)

    Article  Google Scholar 

  48. Prabowo, R., Thelwall, M., Alexandrov, M.: Generating overview timelines for major events in an RSS corpus. J. Informetrics 1(2), 131–144 (2007)

    Article  Google Scholar 

  49. Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. American Society for Info. Science 41 (1990)

    Google Scholar 

  50. Jackson, J.E.: A User’s Guide to Principal Components. John Willey, New York (1991)

    Book  MATH  Google Scholar 

  51. Greenacre, M.J.: Correspondence analysis in practice. Chapman and Hall, Boca Raton (2007)

    Book  MATH  Google Scholar 

  52. Kruskal, J.B., Wish, M.: Multidimensional Scaling. Sage Publications, CA (1978)

    Google Scholar 

  53. York, J., Bohn, S., Pennock, K., Lantrip, D.: Clustering and dimensionality reduction in spire. In: AIPA Steering Group. Proc. Symp. Advanced Intelligence Processing and Analysis. Office of R&D, Washington (1995)

    Google Scholar 

  54. Paulovich, F.V., Nonato, L.G., Minghim, R., Levkowitz, H.: Least square projection: A fast high-precision multidimensional projection technique and its application to document mapping. IEEE T. Vis. Comp. Graph. 14(3), 564–575 (2008)

    Article  Google Scholar 

  55. Kohonen, T.: Self-Organizing Maps. Springer, Berlin (1995)

    Google Scholar 

  56. Fruchterman, T.M.J., Reingold, E.M.: Graph drawing by force-directed placement. Software: Practice and Experience 21(11), 1129–1164 (1991)

    Article  Google Scholar 

  57. Minghim, R., Paulovich, F.V., Lopes, A.A.: Content-based text mapping using multidimensional projections for exploration of document collections. In: IS&T/SPIE Symp. on Elect. Imag. - Vis. and Data Anal., San Jose (2006)

    Google Scholar 

  58. Shneiderman, B.: Treemaps for space-constrained visualization of hierarchies, http://www.cs.umd.edu/hcil/treemap-history/index.shtml (accessed in April 2010)

  59. Berndt, D.J., Clifford, J.: Using dynamic time warping to find patterns in time series. In: AAAI Workshop on Knowledge Discovery in Databases, pp. 359–370 (1994)

    Google Scholar 

  60. Keogh, E.J., Lonardi, S., Ratanamahatana, C.A., Wei, L., Lee, S.H., Handley, J.: Compression-based data mining of sequential data. Data Min. Knowl. Discov. 14(1), 99–129 (2007)

    Article  MathSciNet  Google Scholar 

  61. Hearst, M.: User Interfaces and Visualization. Addison-Wesley Longman, Amsterdam (1999)

    Google Scholar 

  62. Eler, D.M., Paulovich, F.V., de Oliveira, M.C.F., Minghim, R.: Coordinated and multiple views for visualizing text collections. In: IEEE 12th Conf. Info. Vis., pp. 246–251 (2008)

    Google Scholar 

  63. van Wijk, J.J.: Views on visualization. IEEE T. Vis. Comp. Graph. 12(4) (2006)

    Google Scholar 

  64. McGrath, J.E.: Methodology matters: doing research in the behavioral and social sciences. Morgan Kaufmann Publishers Inc., San Francisco (1995)

    Google Scholar 

  65. Microsoft: Pivot project, http://www.getpivot.com/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Šilić, A., Bašić, B.D. (2010). Visualization of Text Streams: A Survey. In: Setchi, R., Jordanov, I., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2010. Lecture Notes in Computer Science(), vol 6277. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15390-7_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15390-7_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15389-1

  • Online ISBN: 978-3-642-15390-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics