Skip to main content

Abstract

One of the important elements of the new generation of the Web is the emergence of blogs. Currently a considerable number of users are creating content using blogs. Although Persian blogs have a short history, they have improved significantly during this short period. Because of fundamental differences between Persian and other languages, limited work has been done to analyze Persian blogs. In this work, a system named BlogDisc for automatic discovery and accumulation of Persian blogs is developed. This system uses content as well as link structure of the blogs. As an important part of this research, we propose an algorithm to recognize blogs that are not hosted on special blog hosts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S. Chakrabarti, M. van den Berg, and B. E. Dom. “Focused crawling: a new approach to topic-specific web resource discovery”. In Proceedings of the Eighth International World Wide Web Conference, Toronto, Canada, May 1999.

    Google Scholar 

  2. K. Balog and M. de Rijke, “Decomposing Bloggers’ Moods. Towards a Time Series Analysis of Moods in the Blogsphere”, In Proceedings of WWW2006 Workshop on Blogging Ecosystem, Edinburgh, Scotland, Apr. 2006.

    Google Scholar 

  3. S. Nakajima, J. Tatemura, Y. Hino, Y. Hara, and K. Tanaka, “Discovering Important Bloggers Based on a Blog Thread Analysis”, In Proceedings of WWW2005 Workshop on Blogging Ecosystem, Chiba , Japan, May 2006.

    Google Scholar 

  4. T. Nanno, “Automatic Collection and Monitoring of Japanese Blogs”, WWW 2004 Workshop on the Blogging Ecosystem: Aggregation, Analysis and Dynamics, New York, May 18th 2004

    Google Scholar 

  5. R. Kumar, J. Novak, P. Raghavan, and A. Tomkins. On the bursty evolution of blogsphere. In Proc. Of the 12th International World Wide Web Conference, pages 568- 576, 2003.

    Google Scholar 

  6. T. Nanno, T. Fujiki, Y. Suzuki, M. Okumura, “Automatically collecting, monitoring, and mining japanese blogs”. WWW (Alternate Track Papers & Posters) 2004: 320-321

    Google Scholar 

  7. K. Sheykh Esmaili, M. Jamali, M. Neshati, and H. Abolhassani, Y. Soltan-Zadeh, “Experiments on Persian Blogs”, WWW2006 Workshop on Blogging Ecosystem, Edinburgh, Scotland, Apr. 2006.

    Google Scholar 

  8. K. Sheykh Esmaili, M. Neshati, M. Jamali, H. Abolhassani, J. Habibi, “Comparing Performance of Recommendation Techniques in the Blogsphere”, Proceedings of ECAI2006 Workshop on Recommender Systems, Trento, Italy, August 2006.

    Google Scholar 

  9. R. Kumar, J. Novak, P. Raghavan, and A. Tomkins, “Structure and Evolution of Blogsphere”, Communications of the ACM, Volume 47, Issue 12 (December 2004).

    Google Scholar 

  10. Webstats4u ,http://www.Webstats4u.com.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer

About this paper

Cite this paper

Esmaili, K.S., Abolhassani, H., Abbassi, Z. (2007). BlogDisc: A System for Automatic Discovery and Accumulation of Persian Blogs. In: Elleithy, K. (eds) Advances and Innovations in Systems, Computing Sciences and Software Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-6264-3_48

Download citation

  • DOI: https://doi.org/10.1007/978-1-4020-6264-3_48

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-1-4020-6263-6

  • Online ISBN: 978-1-4020-6264-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics