Skip to main content

Improving on Popularity as a Proxy for Generality When Building Tag Hierarchies from Folksonomies

  • Chapter
Social Informatics (SocInfo 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8851))

Included in the following conference series:

  • International Conference on Social Informatics

Abstract

Building taxonomies for Web content manually is costly and timeconsuming. An alternative is to allow users to create folksonomies: collective social classifications. However, folksonomies have inconsistent structures and their use for searching and browsing is limited. Approaches have been proposed for acquiring implicit hierarchical structures from folksonomies, but these approaches suffer from the “generality-popularity” problem, in that they assume that popularity is a proxy for generality (that high level taxonomic terms will occur more often than low level ones). In this paper we test this assumption, and propose an improved approach (based on the Heymann-Benz algorithm) for tackling this problem by direction checking relations against a corpus of text. Our results show that popularity works as a proxy for generality in at most 77 of cases, but that this can be improved to 81% using our approach. This improvement will translate to higher quality tag hierarchy structures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Vander Wal, T.: Folksonomy Coinage and Definition (2007), http://vanderwal.net/folksonomy.html

  2. O’Reilly, T.: What is web 2.0: design patterns and business models for the next generation of software (2005), http://oreilly.com/web2/archive/what-is-web-20.html (June 20, 2013)

  3. Gupta, M., Li, R., Yin, Z., Han, J.: An Overview of Social Tagging and Applications. In: Aggarwal, C. (ed.) Social Network Data Analytics, pp. 447–497. Springer, New York (2011)

    Chapter  Google Scholar 

  4. Strohmaier, M., Helic, D., Benz, D., Körner, C., Kern, R.: Evaluation of Folksonomy Induction Algorithms. ACM Transactions on Intelligent Systems and Technology 3(4), Article 74 (2012)

    Google Scholar 

  5. Mathes, A.: Folksonomies-cooperative classification and communication through shared metadata. Computer Mediated Communication 47(10) (2004), http://adammathes.com/academic/computer-mediated-communication/folksonomies.pdf

  6. Golder, S., Huberman, B.: Usage patterns of collaborative tagging systems. Journal of Information Science 32(2), 198–208 (2006)

    Article  Google Scholar 

  7. Guy, M., Tonkin, E.: Tidying up tags. D-Lib Magazine 12(1) (January 2006), ISSN 1082-9873

    Google Scholar 

  8. Heymann, P., Garcia-Molinay, H.: Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems. InfoLab Technical Report, Stanford (2006)

    Google Scholar 

  9. Solskinnsbakk, G., Gulla, J.: A Hybrid Approach to Constructing Tag Hierarchies. In: International conference on: On the move to meaningful internet systems: Part II, Hersonissos, Crete, Greece, pp. 975–982 (2010)

    Chapter  Google Scholar 

  10. Benz, D., Hotho, A., Stutzer, S.: Semantics made by you and me: Self-emerging ontologies cancapture the diversity of shared knowledge. In: 2nd Web Science Conference (WebSci 2010), Raleigh, NC, USA (2010)

    Google Scholar 

  11. Begelman, G., Keller, P., Smadja, F.: Automated tag clustering: Improving search and exploration in the tag space. In: Collaborative Web Tagging Workshop at WWW 2006, Edinburgh, Scotland, pp.15-33 (2006)

    Google Scholar 

  12. Lin, H., Davis, J.: Computational and crowdsourcing methods for extracting ontological structure from folksonomy. In: 7th Extended Semantic Web Conference (ESWC 2010), Heraklion, Greece, pp.472-477 (2010)

    Google Scholar 

  13. Schmitz, P.: Inducing ontology from flickr tags. In: Collaborative Web Tagging Workshop at WWW 2006, Edinburgh, Scotland (2006)

    Google Scholar 

  14. Angeletou, S., Sabou, M., Specia, L., Motta, E.: Bridging the gap between folksonomies and the semantic web: An experience report. In : 4th European Semantic Web Conference (ESWC 2007), Innsbruck, Austria, pp.30-43 (2007)

    Google Scholar 

  15. Laniado, D., Eynard, D., Colombetti, M.: Using WordNet to turn a folksonomy into a hierarchy of concepts. In: 4th italian semantic web workshop: Semantic web application and perspectives, Bari, Italy, pp. 192–201 (2007)

    Google Scholar 

  16. Morrison, P.: Tagging and searching: Search retrieval effectiveness of folksonomies on the world wide web. Information Processing and Management 44(4), 1562–1579 (2008)

    Article  MathSciNet  Google Scholar 

  17. Berners-Lee, T., Hendler, J., Lassila, O.: The Semantic Web. Scientific American 284(5), 28–37 (2001)

    Article  Google Scholar 

  18. Park, Y., Byrd, R., Boguraev, B.: Towards Ontologies On Demand. In: Workshop on Semantic Web Technologies for Searching and Retrieving Scientific Data (ISWC-03), Florida, USA (2003)

    Google Scholar 

  19. Mika, P.: Ontologies are us: A unified model of social networks and semantics. Web Semantics: Science, Services and Agents on the World Wide Web 5(1), 5–15 (2007)

    Article  Google Scholar 

  20. Kiu, C.-C., Tsui, E.: TaxoFolk: a hybrid taxonomy–folksonomy classification for enhanced knowledge navigation. Knowledge Management Research & Practice 8(1), 24–32 (2010)

    Article  Google Scholar 

  21. Zheng, H., Wu, X., Yu, Y.: Enriching WordNet with Folksonomies. In: Washio, T., Suzuki, E., Ting, K.M., Inokuchi, A. (eds.) PAKDD 2008. LNCS (LNAI), vol. 5012, pp. 1075–1080. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  22. Van Damme, C., Hepp, M., Siorpaes, K.: Folksontology: An integrated approach for turning folksonomies into ontologies. In: ESWC Workshop Bridging the Gap between Semantic Web and Web 2, pp. 57–70. Innsbruck, Austria (2007)

    Google Scholar 

  23. Solskinnsbakk, G., Gulla, J.: Mining tag similarity in folksonomies. In: 3rd international workshop on Search and mining user-generated contents (SMUC 2011), Glasgow, Scotland, pp. 53–60 (2011)

    Google Scholar 

  24. Hearst, M.: Automatic acquisition of hyponyms from large text corpora. In: 14th Conference on Computational Linguistics, Morristown, NJ, USA, pp. 539–545 (1992)

    Google Scholar 

  25. Cimiano, P., Hotho, A., Staab, S.: Learning concept hierarchies from text corpora using formal concept analysis. Journal of Artificial Intelligence Research 24(1), 305–339 (2005)

    MATH  Google Scholar 

  26. Harris, Z.: Mathematical structures of language. John Wiley and Son (1968)

    Google Scholar 

  27. Faure, D., Nedellec, C.: A corpus-based conceptual clustering method for verb frames and ontology. In: The LREC Workshop on Adapting Lexical and Corpus Resources to Sublanguages and Applications, pp. 5–12 (1998)

    Google Scholar 

  28. Berland, M., Charniak, E.: Finding parts in very large corpora. In: the 37th Annual Meeting of the Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, pp. 57–64 (1999)

    Google Scholar 

  29. Snow, R., Jurafsky, D., Ng., A.: Learning syntactic patterns for automatic hypernym discovery. In: The Eighteenth Annual Conference on Neural Information Processing Systems (NIPS 2004), Vancouver, Canada, vol. 17 (2004)

    Google Scholar 

  30. Hearst, M.: Automated discovery of wordnet relations. In: Fellbaum, C. (ed.) WordNet: An Electronic Lexical Database, MIT Press, Cambridge (1998)

    MATH  Google Scholar 

  31. Wu, H., Zubair, M., Maly, K.: Harvesting social knowledge from folksonomies. In: 17th Conference on Hypertext and Hypermedia, Odense, Denmark, pp. 111–114 (2006)

    Google Scholar 

  32. Hoser, B., Hotho, A., Jäschke, R., Schmitz, C., Stumme, G.: Semantic network analysis of ontologies. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, pp. 514–529. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  33. Schmitz, C., Hotho, A., Jäschke, R., Stumme, G.: Mining association rules in folksonomies. In: 10th IFCS Conference: Studies in Classification, Data Analysis and Knowledge Organization, Ljubljana, Slovenia, pp. 261–270 (2006)

    Google Scholar 

  34. Sanderson, M., Croft, B.: Deriving concept hierarchies from text. In: 22nd ACM Conference of the Special Interest Group in Information Retrieval, Berkeley, California, USA, pp. 206–213 (1999)

    Google Scholar 

  35. Schwarzkopf, E., Heckmann, D., Dengler, D., Kröner, A.: Mining the structure of tag spaces for user modeling. In: Workshop on Data Mining for User Modeling at the 11th International Conference on User Modeling, Corfu, Griechenland, pp. 63–75 (2007)

    Google Scholar 

  36. Hamasaki, M., Matsuo, Y., Nishimura, T., Takeda, H.: Ontology extraction using social network. In: International Workshop on the Semantic Web for Collaborative Knowledge Acquisition, Hyderabad, India (2007)

    Google Scholar 

  37. Plangprasopchok, A., Lerman, K., Getoor, L.: From saplings to a tree: Integrating structured metadata via relational affinity propagation. In: Proceedings of the AAAI Workshop on Statistical Relational AI, Menlo Park, CA, USA (2010)

    Google Scholar 

  38. Frey, B., Dueck, D.: Clustering by passing messages between data points. Science 315(5814), 972–976 (2007)

    Article  MathSciNet  Google Scholar 

  39. Angeletou, S., Sabou, M., Motta, E.: Semantically Enriching Folksonomies with FLOR. In: 1st International Workshop on Collective Semantics: Collective Intelligence & the Semantic Web (CISWeb 2008), Tenerife, Spain (2008)

    Google Scholar 

  40. Cantador, I., Szomszor, M., Alani, H., Fernández, M., Castells, P.: Enriching ontological user profiles with tagging history for multi-domain recommendations. In: 1st International Workshop on Collective Semantics: Collective Intelligence & the Semantic Web (CISWeb 2008), Tenerife, Spain (2008)

    Google Scholar 

  41. Tesconi, M., Ronzano, F., Marchetti, A., Minutoli, S.: Semantify del.icio.us: Automatically Turn your Tags into Senses. In: Social Data on the Web Workshop at the 7th International Semantic Web Conference, Karlsruhe, Germany (2008)

    Google Scholar 

  42. Garcia, A., Szomszor, M., Alani, H., Corcho, O.: Preliminary results in tag disambiguation using dbpedia. In: 1st International Workshop in Collective Knowledge Capturing and Representation, California, USA (2009)

    Google Scholar 

  43. Specia, L., Motta, E.: Integrating Folksonomies with the Semantic Web. In: 4th European Conference on The Semantic Web: Research and Applications, Innsbruck, Austria, pp. 624–639 (2007)

    Google Scholar 

  44. Giannakidou, E., Koutsonikola, V., Vakali, A., Kompatsiaris, Y.: Co-clustering tags and social data sources. In: 9th International Conference on Web-Age Information Management, Zhangjiajie, China, pp. 317–324 (2008)

    Google Scholar 

  45. Lin, H., Davis, J., Zhou, Y.: An integrated approach to extracting ontological structures from folksonomies. In: 6th European Semantic Web Conference, Heraklion, Greece, pp. 654–668 (2009)

    Google Scholar 

  46. Cimiano, P.: Ontology Learning and Population from Text: Algorithms, Evaluation and Applications, vol. 27. Springer (2006)

    Google Scholar 

  47. Plangprasopchok, A., Lerman, K., Getoor, L.: Growing a Tree in the Forest: Constructing Folksonomies by Integrating Structured Metadata. In: 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, pp. 949–958 (2010)

    Google Scholar 

  48. Plangprasopchok, A., Lerman, K.: Constructing Folksonomies from User-Specified Relations on Flickr. In: 18th International World Wide Web Conference, Madrid, Spain, pp. 781–790 (2009)

    Google Scholar 

  49. Almoqhim, F., Millard, D.E., Shadbolt, N.: An approach to building high-quality tag hierarchies from crowdsourced taxonomic tag pairs. In: Jatowt, A., Lim, E.-P., Ding, Y., Miura, A., Tezuka, T., Dias, G., Tanaka, K., Flanagin, A., Dai, B.T. (eds.) SocInfo 2013. LNCS, vol. 8238, pp. 129–138. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  50. Porter, M.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)

    Article  Google Scholar 

  51. Miller, G.: WordNet: a lexical database for English. Communications of the ACM 38(11), 39–41 (1995)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Almoqhim, F., Millard, D.E., Shadbolt, N. (2014). Improving on Popularity as a Proxy for Generality When Building Tag Hierarchies from Folksonomies. In: Aiello, L.M., McFarland, D. (eds) Social Informatics. SocInfo 2014. Lecture Notes in Computer Science, vol 8851. Springer, Cham. https://doi.org/10.1007/978-3-319-13734-6_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-13734-6_7

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-13733-9

  • Online ISBN: 978-3-319-13734-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics