Skip to main content

Using Web Crawlers for Feature Extraction of Social Nets for Analysis

  • Conference paper
  • First Online:
Information Technology - New Generations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 558))

  • 3212 Accesses

Abstract

This paper presents a crawler based feature extraction technique for social network analysis. This technique crawl a predefined actor and his associated activities in social network space. From the activities, a set of features are extracted that can be used for a broad spectrum of social network analysis. The utility can act as a middle ware providing a level of abstraction to researchers involved in social network analysis. The tools provide a formatted set of ready features with open APIs that can be easily integrated in any application.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Wilson, R. E., Gosling, S. D., & Graham, L. T. (2012). A review of Facebook research in the social sciences. Perspectives on Psychological Science, 3, 203–220.

    Article  Google Scholar 

  2. https://www.informatica.com/products/data-integration/connectors- powerexchange.html#fbid=UAP0D-8ts7R. Acessed 10 Jan 2017.

  3. NODEXL: https://nodexl.codeplex.com/. Acessed 10 Jan 2017.

  4. Mislove, A., Marcon, M., Gummadi, K. P., Druschel, P., Bhattacharjee, B. (2007). Measurement and analysis of online social networks. In Proceedings of the 7th ACM SIGCOMM conference on Internet measurement (IMC’07), San Diego.

    Google Scholar 

  5. Gjoka, M., Kurant, M., Butts, C., Markopoulou, A. (2010). Walking in facebook: a case study of unbiased sampling of OSNs. INFOCOM, San Diego, Proceedings IEEE, (pp. 1–9)

    Google Scholar 

  6. Frank, O. (2011). Survey sampling in networks. In The SAGE handbook of social network analysis, London, Sage, (pp. 389–403).

    Google Scholar 

  7. Shkapenyuk, V., & Suel, T. (2002). Design and implementation of a high-performance distributed web crawler. In: Proceedings of the International Conference on Data Engineering, San Jose, CA, United States, IEEE CS Press (pp. 357–368).

    Google Scholar 

  8. Boldi, P., Codenotti, B., Santini, M., & Vigna, S. (2004). Ubicrawler: a scalable fully distributed web crawler. Software Practice and Experience, 34, 711–726.

    Article  Google Scholar 

  9. Yakushev, A. V., Alexander, V. B., Sloot. P. M. A. Topic crawler for social networks monitoring Knowledge Engineering and the Semantic Web Volume 394 of the series Communications in Computer and Information Science, vol 394. Springer, Berlin, Heidelberg, (pp. 214–227).

    Google Scholar 

  10. Gjoka, M., Kurant, M., Butts, C., Markopoulou, A.(2010). Walking in facebook: a case study of unbiased sampling of osns In: Proceedings of the 2010 I.E. Conference on Computer Communications, (pp. 1–9). San Diego, IEEE.

    Google Scholar 

  11. Wilson, C., Boe, B., Sala, A., Puttaswamy, K. P., Zhao, B. Y. (2009). User interactions in social networks and their implications. In Proceedings of the 4th ACM European conference on Computer systems (EuroSys’09), Nuremberg.

    Google Scholar 

  12. Saif, H., He, Y., Alani, H. (2012). Alleviating data sparsity for twitter sentiment analysis. In The 2nd workshop on Making Sense of Microposts. (#MSM ’12): Big Things Come in Small Packages at the 21st International Conference on the World Wide Web (WWW ’12), pp. 2–9, Lyon, France.

    Google Scholar 

  13. Abel, F., Gao, Q., Houben, G. J., Tao, K. (2011). Semantic enrichment of twitter posts for user profile construction on the social web. In The Semanic Web: Research and Applications, (pp. 375–389). Springer-Verlag Berlin, Heidelberg.

    Google Scholar 

  14. Steiner, T., Verborgh, R., Vallés, J. G., de Walle, R. (2011). Adding meaning to Facebook microposts via a mash-up API and tracking its data provenance. In Next Generation Web Services Practices (NWeSP), 7th International Conference on, 2011, (pp. 342–345).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fozia Noor .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Cite this paper

Noor, F., Shah, A., Gill, W., Khan, S.A. (2018). Using Web Crawlers for Feature Extraction of Social Nets for Analysis. In: Latifi, S. (eds) Information Technology - New Generations. Advances in Intelligent Systems and Computing, vol 558. Springer, Cham. https://doi.org/10.1007/978-3-319-54978-1_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-54978-1_40

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-54977-4

  • Online ISBN: 978-3-319-54978-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics