Skip to main content

An Exhaustive and Edge-Removal Algorithm to Find Cores in Implicit Communities

  • Conference paper
Book cover Advances in Data and Web Management (APWeb 2007, WAIM 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4505))

Abstract

Web community is intensely studied in web resource discovery. Many literatures use core as the signature of a community. A core is a complete bipartite graphs, denoted as Ci,j. But discovery of all possible Ci,j in the web is a challenging job. This work has been investigated by trawling [1][2]. Trawling employs repeated elimination/generation procedure until the graph is pruned to a satisfied state and then enumerate all possible Ci,j. We proposed a new method that uses exhaustive and edge removal method. Our algorithm avoids scanning dataset many times. Also, we improve crawling method by only recording potential fans to save disk space. The experiment result show that the new algorithm works properly and many new Ci,j can be found by our method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kumar, R., Raghavan, P., et al.: Trawling the web for emerging cyber-communities. In: Proceedings of the 8th WWW Conference, Toronto, Canada, pp. 403–415 (1999)

    Google Scholar 

  2. Kumar, R., Raghavan, P., et al.: Extracting large-scale knowledge base from the web. In: Proceedings of 25th VLDB Conference, Edinburgh, Scotland, pp. 639–650 (1999)

    Google Scholar 

  3. Broder, A., Kumar, R., et al.: Graph structure in the web. Computer Networks 33(1-6), 309–320 (2000)

    Article  Google Scholar 

  4. Gibson, D., Kleinberg, J., et al.: Inferring Web Communities from Link Topology. In: Proceedings of the 9th ACM Conference on Hypertext and Hypermedia, Pittsburgh, PA, USA, pp. 225–234 (1998)

    Google Scholar 

  5. Chakrabarti, S., Dom, B.E., et al.: Automatic resource compilation by analyzing hyperlink structure and associated text. Computer Networks 30(1-7), 65–74 (1998)

    Google Scholar 

  6. Dean, J., Henzinger, M.R.: Finding Related Pages in the World Wide Web. In: Proceedings of the 8th WWW Conference, Toronto, Canada, pp. 389–401 (1999)

    Google Scholar 

  7. Reddy, P.K., Kitsuregawa, M.: Inferring Web Community through relaxed-cocition and power-law. Annual Report of KITSUREGAWA Lab., pp. 27–40 (2001)

    Google Scholar 

  8. Flake, G.W., Lawrence, S., et al.: Efficient Identification of Web Communities. In: Proceedings of the 6th ACM SIGKDD Conference on Knowledge discovery and data mining, Boston, MA, USA, pp. 150–160 (2000)

    Google Scholar 

  9. Girvan, M., Newman, M.E.J.: Community structure in social and biological networks. Proc. Natl. Acad. Sci. USA 99, 7821–7826 (2002)

    Article  MATH  MathSciNet  Google Scholar 

  10. Newman, M.E.J.: Fast algorithm for detecting community structure in networks. Phys. Rev. E 69, 066133 (2004)

    Google Scholar 

  11. Newman, M.E.J.: Detecting community structure in networks. Europe. Phys. J. B 38, 321–330 (2004)

    Article  Google Scholar 

  12. http://www.cs.cornell.edu/home/kleinber/web-graph.ps

  13. http://perso.wanadoo.fr/sebastien.ailleret/index-eng.html

  14. Broder, A.Z., Glassman, S.C., et al.: Syntactic Clustering of the Web. Computer Networks 29(8-13), 1157–1166 (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Guozhu Dong Xuemin Lin Wei Wang Yun Yang Jeffrey Xu Yu

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Yang, N., Lin, S., Gao, Q. (2007). An Exhaustive and Edge-Removal Algorithm to Find Cores in Implicit Communities. In: Dong, G., Lin, X., Wang, W., Yang, Y., Yu, J.X. (eds) Advances in Data and Web Management. APWeb WAIM 2007 2007. Lecture Notes in Computer Science, vol 4505. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72524-4_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-72524-4_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-72483-4

  • Online ISBN: 978-3-540-72524-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics