Unlocking the Semantics of Roget’s Thesaurus Using Formal Concept Analysis

  • L. John Old
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2961)


Roget’s Thesaurus is a semantic dictionary that is organized by concepts rather than words. It has an elaborate implicit structure that has not, in the 150 years since its inception, been made explicit. Formal Concept Analysis (FCA) is a tool that can be used by researchers for the organization, analysis and visualization of complex hidden structures. In this paper we illustrate two ways in which FCA is being used to explicate the implicit structures in Roget’s Thesaurus: implications and Type-10 chain components.


Word Sense Disambiguation Formal Concept Analysis Neighbourhood Lattice Word Index Semantic Neighbourhood 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Berrey, L. (ed.): Roget’s international thesaurus (, 3rd edn. Crowell, NewYork (1962)Google Scholar
  2. 2.
    Bryan, R.M.: Abstract thesauri and graph theory applications to thesaurus research. In: Sedelow, S.Y. (ed.) Automated language analysis, report on research 1972-73, pp. 45–89. University of Kansas, Lawrence (1973)Google Scholar
  3. 3.
    Godin, R., Mili, H.: Building and Maintaining Analysis-Level Class Hierarchies Using Galois Lattices. In: Paepcke, A. (ed.) Proceedings of the ACM Conference on Object- Oriented Programming Systems, Languages, and Applications (OOPSLA 1993), pp. 394–410. ACM Press, Washington (1993)CrossRefGoogle Scholar
  4. 4.
    Jacuzzi, V.: Modeling semantic association using the hierarchical structure of Roget’s international thesaurus. Paper presented at the Dictionary Society of North America Conference, Columbus, Missouri (May 1991)Google Scholar
  5. 5.
    Lindig, C., Snelting, G.: Assessing Modular Structure of Legacy Code Based on Mathematical Concept Analysis. In: Proceedings of the 19th International Conference on Software Engineering (ICSE 1997), Boston, USA, pp. 349–359 (1997)Google Scholar
  6. 6.
    Miller, G., Beckwith, R., Fellbaum, C., Gross, D., Miller, K., Tengi, R.: Five papers onWordNet. Technical Report. Princeton University, Princeton, N.J (1993)Google Scholar
  7. 7.
    Priss, U.: Relational Concept Analysis: Semantic structures in dictionaries and lexical databases. Shaker Verlag, Aachen (1996); Doctoral Dissertation, Technical University of Darmstadt (1998)Google Scholar
  8. 8.
    Sedelow, S.Y.: Exploring the terra incognita of whole-language thesauri. In: Gamble, R., Ball, W. (eds.) Proceedings of the Third Midwest AI and Cognitive Science Conference, pp. 108–111. Southern Illinois University, Carbondale (1991)Google Scholar
  9. 9.
    Sedelow Jr., W.A.: Computer-based planning technology: an overview of inner structure analysis. In: Old, L.J. (ed.) Getting at disciplinary interdependence, pp. 7–23. Arkansas University Press, Little Rock (1990)Google Scholar
  10. 10.
    Talburt, J.R., Mooney, D.M.: An evaluation of Type-10 homograph discrimination at the semi-colon level in Roget?s international thesaurus. In: Proceedings of the 1990 ACM SIGSMALL/PC Symposium, pp. 156–159 (1990)Google Scholar
  11. 11.
    Wille, R.: Restructuring lattice theory: an approach based on hierarchies of concepts. In: Rival, I. (ed.) Ordered sets, pp. 445–470. Reidel, Dordrecht (1982)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • L. John Old
    • 1
  1. 1.School of ComputingNapier University 

Personalised recommendations