Abstract
Blogging is yet another popular and prominent application in the era of Web 2.0. According to recent measurements often considered as conservative, as of now worldwide there are more than 152 million blogs with content spanning over every aspect of life and science, necessitating long term blog preservation and knowledge management. In this work, we present a range of issues that arise when facing the task of blog preservation. We argue that current web archiving solutions are not able to capture the dynamic and continuously evolving nature of blogs, their network and social structure as well as the exchange of concepts and ideas that they foster. Furthermore, we provide directions and objectives that could be reached to realize robust digital preservation, management and dissemination facilities for blogs. Finally, we introduce the BlogForever EC funded project, its main motivation and findings towards widening the scope of blog preservation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Agarwal, N., Liu, H.: Blogosphere: research issues, tools and applications. ACM SIGKDD Explor. 10(1), 18–31 (2008)
Arango-Docio, S., Sleeman, P., Kalb, H.: BlogForever: D2.1 survey implementation report. BlogForever WP2 Deliverable (2011)
Archive-it. Web Archiving Services. http://www.archive-it.org/. Accessed 11 April 2012
Arvidson, A.: Kulturarw3. In: Proceedings Conference on Strategies for the Internet: Preserving the Present for the Future, Copenhagen, pp. 101–104 (2001)
Ashley, K., Davis, R., Guy, M., Kelly, B., Pinsent, E., Farrell, S.: A guide to web preservation (2010)
Bhola, S., Strom, R., Bagchi, S., Zhao, Y.: Exactly-once delivery in a content-based publish-subscribe system. In: Proceedings International Conference on Dependable Systems and Networks (DNS), Washington, DC, pp. 7–16 (2002)
Billenness, C.: The future of the past – shaping new visions for EU-research in digital preservation. In: Proceedings Workshop, European Commission, Information Society and Media Directorate-General, Luxemburg (2011)
Campbell, L., Dulabahn, B.: Digital Preservation: the Twitter Archives and NDIIPP. In: Proceedings 7th International Conference Preservation of Digital Objects (iPRES), Vienna (2010)
CERN. Invenio. http://invenio-software.org/. Accessed 09 April 2012
Commission, European: Information and Communications Technologies (2011)
Edelstein O., Factor, M., King, R., Risse, T., Salant, E., Taylor, P.: Evolving domains, problems and solutions for long term. In: Proceedings 8th International Conference Preservation of Digital Objects (iPRES), Singapore (2011)
Heritrix.: IA Web Crawler. https://webarchive.jira.com/wiki/display/Heritrix/. (2012). Accessed 14 April 2012
Java A., Kolari P., Finin, T., Oates, T.: Modeling the spread of influence on the blogosphere. In: Proceedings 3rd WWW Workshop on Weblogging Ecosystem: Aggregation, Analysis and Dynamics, Edinburgh (2006)
Kalb, H., Kasioumis, N., GarcÃa Llopis, J., Postaci, S., Arango-Docio, S.: BlogForever: D4.1 User requirements and platform specifications report. Blogforever WP4 Deliverable (2011)
Kalb, H., Gkotsis, G, Pincent, E., Banos, V., Davis, R.: BlogForever D2.3 Weblog Ontologies Report. BlogForever WP2 Deliverable (2012)
Khare, R., Celik, T.: Microformats: a pragmatic path to the semantic web. In: Proceedings 15th International Conference on World Wide Web (WWW), Edinburgh, pp. 865–866 (2006)
Kim, Y., Ross, S.: BlogForever: D2.5 Weblog spam filtering report and associated methodology. BlogForever WP2 Report (2012)
LAWA. Longitudinal Analytics of Web Archive Data Project. http://www.lawa-project.eu/ (2012). Accessed 15 April 2012
Library of Congress. Web Archive. http://lcweb2.loc.gov/diglib/lcwa/html/sept11/. (2011). Accessed 10 April 2012
LiWA. Living Web Archives Project. http://liwa-project.eu. Accessed 15 April 2012
McPhillips, S.: PANDORA Archive technical details. http://pandora.nla.gov.au/pandoratech.html. (2012). Accessed 05 Aug 2004
Occasio News Archive Database. http://newsarchive.occasio.net/. (2012). Accessed 10 April 2012
PADICAT: The Digital Heritage of Catalonia. http://www.padicat.cat/. (2012). Accessed 10 April 2012
PageFreezer.com - Social Media and Website Archiving. http://pagefreezer.com. (2012). Accessed 10 April 2012
Papazoglou, M.P., Ribbers, P.M.A.: E-business: Organizational and Technical Foundations. Wiley, West Sussex (2006)
Rynning, M., Banos, V., Stepanyan, K., Joy, M., Gulliksen, M.: BlogForever: D2.4 Weblog spider prototype and associated methodology. BlogForever WP2 Deliverable (2011)
Sroka, T.N.: Understanding the Political Influence of Blogs: A Study of the Growing Importance of the Blogosphere in the US Congress. Institute for Politics, Democracy and the Internet. http://www.ipdi.org/UploadedFiles/PoliticalInfluenceofBlogs.pdf. (2006) Accessed 14 June 2009
Stepanyan, K., Gkotsis, G., Pincent, E., Banos, V., Davis, R.: BlogForever D2.6 Data extraction methodology report. BlogForever WP2 Deliverable (2012)
The Internet Archive. http://archive.org. (1996)
VaultPress - Safeguard your site. http://www.vaultpress.com. (2012). Accessed 10 April 2012
Web Archiving, Library of Congress. http://www.loc.gov/webarchiving/. (2012). Accessed 12 April 2012
Web Curator Tool Project. http://webcurator.sourceforge.net/. (2012). Accessed 12 April 2012
Winer, D.: Original announcement of blog ping. http://xmlrpc.scripting.com/weblogsCom.html. (2012). Accessed 12 April 2012
Acknowledgements
The research leading to these results has received funding from the European Commission Framework Programme 7 (FP7), BlogForever project, grant agreement No.269963. We would also like to thank all BlogForever project partners for their invaluable contributions to the project.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Banos, V., Baltas, N., Manolopoulos, Y. (2013). Blog Preservation: Current Challenges and a New Paradigm. In: Cordeiro, J., Maciaszek, L.A., Filipe, J. (eds) Enterprise Information Systems. Lecture Notes in Business Information Processing, vol 141. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40654-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-40654-6_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40653-9
Online ISBN: 978-3-642-40654-6
eBook Packages: Computer ScienceComputer Science (R0)