Advertisement

Text Visualization for Visual Text Analytics

  • John Risch
  • Anne Kao
  • Stephen R. Poteet
  • Y. -J. Jason Wu
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4404)

Abstract

The termvisual text analytics describes a class of information analysis techniques and processes that enable knowledge discovery via the use of interactive graphical representations of textual data. These techniques enable discovery and understanding via the recruitment of human visual pattern recognition and spatial reasoning capabilities. Visual text analytics is a subclass of visual data mining / visual analytics, which more generally encompasses analytical techniques that employ visualization of non-physically-based (or “abstract”) data of all types. Text visualization is a key component in visual text analytics. While the term “text visualization” has been used to describe a variety of methods for visualizing both structured and unstructured characteristics of text-based data, it is most closely associated with techniques for depicting the semantic characteristics of the free-text components of documents in large document collections. In contrast with text clustering techniques which serve only to partition text corpora into sets of related items, these so-called semantic mapping methods also typically strive to depict detailed inter- and intra-set similarity structure. Text analytics software typically couples semantic mapping techniques with additional visualization techniques to enable interactive comparison of semantic structure with other characteristics of the information, such as publication date or citation information. In this way, value can be derived from the material in the form of multidimensional relationship patterns existing among the discrete items in the collection. The ultimate goal of these techniques is to enable human understanding and reasoning about the contents of large and complexly related text collections.

Keywords

Pacific Northwest National Laboratory Text Unit Document Vector Text Collection Text Summarization 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. Scientific American 284(5), 34–43 (2001)CrossRefGoogle Scholar
  2. 2.
    Feldman, R., Dagan, I.: KDT - Knowledge discovery in text. In: Proceedings of the First International Conference on Knowledge Discovery (KDD 1995), Montreal (1995)Google Scholar
  3. 3.
    Thomas, J.J., Cook, K.A. (eds.): Illuminating the Path: The Research and Development Agenda for Visual Analytics. IEEE Computer Society Press, Los Alamitos (2005)Google Scholar
  4. 4.
    Eick, S., et al.: WEB-Based Text Visualization. 1997, National Institute of Statistical Sciences: Technical Report 64. p. http://www.niss.org/technicalreports/tr64.pdf
  5. 5.
    Rohrer, R.M., Ebert, D.S., Sibert, J.L.: The shape of Shakespeare: Visualizing text using implicit surfaces. In: Proceedings of the IEEE Symposium on Information Visualization 1998, North Carolina (1998)Google Scholar
  6. 6.
    Miller, N.E., et al.: Topic Islands: A wavelet-based text visualization system. In: Proceedings of the 9th IEEE Conference on Visualization VIS 1998, Research Triangle Park, North Carolina (1998)Google Scholar
  7. 7.
    Lin, X., Soergel, D., Marchionini, D.: A self-organizing semantic map for information retrieval. In: Proceedings of the Fourteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, Chicago (1991)Google Scholar
  8. 8.
    Chalmers, M., Chitson, P.: Bead: Explorations in information visualization. In: Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, Copenhagen (1992)Google Scholar
  9. 9.
    Wise, J.A., et al.: Visualizing the non-visual: Spatial analysis and interaction with information from text documents. In: Proceedings of the IEEE Information Visualization Symposium InfoVis 1995, Atlanta, Georgia (1995)Google Scholar
  10. 10.
    Kohonen, T., et al.: Self organization of a massive document collection. IEEE Transactions on Neural Networks 11(3), 574–585 (2000)CrossRefGoogle Scholar
  11. 11.
    Doyle, L.: Semantic roadmaps for literature searchers. Journal of the Association for Computing Machinery 8(4), 367–391 (1961)MathSciNetGoogle Scholar
  12. 12.
    Sammon, J.W.: A nonlinear mapping for data structure analysis. IEEE Transactions on Computing 18(5), 401–409 (1969)CrossRefGoogle Scholar
  13. 13.
    Crouch, D.: The visual display of information in an information retrieval environment. In: Proceedings of the 9th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, Pisa (1986)Google Scholar
  14. 14.
    York, J., et al.: Clustering and dimensionality reduction in SPIRE. In: Proceedings of the Symposium on Advanced Information Processing and Analysis, AIPA 1995, Tysons Corner, VA (1995)Google Scholar
  15. 15.
    Salton, G.: Automatic Text Processing. Addison-Wesley, Reading (1989)Google Scholar
  16. 16.
    Wall, M.E., Rechtsteiner, A., Rocha, L.M.: Singular value decomposition and principal component analysis. In: Berrar, D.P., Dubitzky, W., Granzow, M. (eds.) A Practical Approach to Microarray Data Analysis, pp. 91–109. Kluwer, Norwell (2003)CrossRefGoogle Scholar
  17. 17.
    Deerwester, S.C., et al.: Indexing by latent semantic analysis. Journal of American Society for Information Science 41(6), 391–407 (1990)CrossRefGoogle Scholar
  18. 18.
    Booker, A., et al.: Visualizing text data sets. Computing in Science & Engineering 1(4), 26–35 (1999)CrossRefGoogle Scholar
  19. 19.
    Huang, S., Ward, M., Rundensteiner, E.: Exploration of dimensionality reduction for text visualization. Worcester Polytechnic Institute, Computer Science Department: Technical Report TR-03-14, Worcester (2003)Google Scholar
  20. 20.
    Chatfield, C., Collins, A.: Introduction to Multivariate Analysis. Chapman & Hall, London (1980)zbMATHGoogle Scholar
  21. 21.
    Kruskal, J.: Multidimensional scaling by optimizing goodness-of-fit to a nonmetric hypothesis. Psychometrika 29, 1–27 (1964); Reprinted in Key Texts in Multidimensional Scaling, In: Davies, P.M., Coxon, A.P.M.(eds.). Heinemann Educational Books, Exeter, N.H., pp. 59–83 (1982)Google Scholar
  22. 22.
    Risch, J.S., et al.: The STARLIGHT information visualization system. In: Card, S., Mackinlay, J., Shneiderman, B. (eds.) Readings in Information Visualization: Using Vision To Think, pp. 551–560. Morgan Kaufmann, San Francisco (1999)Google Scholar
  23. 23.
    Hartigan, J.A.W., A., M.: Algorithm AS136: A k-Means Clustering Algorithm. Applied Statistics 28, 100–108 (1979)zbMATHCrossRefGoogle Scholar
  24. 24.
    Bertin, J.: Semiology of Graphics: Diagrams, Networks, Maps. University of Wisconsin Press, Madison (1983)Google Scholar
  25. 25.
    Lagus, K., Kaski, S., Kohonen, T.: Mining massive document collections by the WEBSOM method. Information Sciences 163(1-3), 135–156 (2004)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • John Risch
    • 1
  • Anne Kao
    • 2
  • Stephen R. Poteet
    • 2
  • Y. -J. Jason Wu
    • 2
  1. 1.Future Point Systems, Inc. 
  2. 2.Boeing Phantom Works 

Personalised recommendations