Skip to main content

Mining Credible and Relevant News from Social Networks

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10721))

Abstract

Today, people are increasingly accessing news through social networks like Twitter. This is regardless of the fact that whether the news is regarding a parliamentary election, or a famous entertainment celebrity. Moreover, these platforms allow people to like, retweet and comment on the shared news article. This shapes the opinions and beliefs of the people who read it along with the news article itself. However, a major problem we face today is the misuse of these networks for spreading rumors and misleading news content. This is the practice of yellow journalism which aims at disrupting public sentiment.

To address this problem, we present a methodology to find credible and relevant tweets that refer to actual news articles published on news websites. Our methodology scores each tweet based on the reputation of the users sharing it, the news publisher which published the news article, and the popularity of the news concepts mentioned in the article. We model the interaction between these three entities in the form of a tripartite graph and propose a Co-HITS algorithm based formulation to score all the entities involved. The scores of individual entities is used to assign a score for each tweet that indicates the credibility and relevance of the news mentioned in it. We find that the presence of many bots is also a big problem in these networks and can affect the results of such explorations. Thus, we use existing bot detection techniques to identify bots and propose an approach to limit their influence on the system in an efficient manner. Finally, we present a qualitative evaluation of our proposed system on a set of approximately 8000 tweets.

A. Garg–This author is now at Adobe Research.

V. Syal–This author is now at University of California, San Diego, USA.

P. Gudlani–This author is now at Google Inc.

D. Patel–Work performed when the author was affiliated to Indian Institute of Technology, Roorkee, India.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Visentin L.: Facebook wages war on click-bait. http://www.smh.com.au/digital-life/digital-life-news/facebook-wages-war-on-clickbait-20140825-108dd8.html

  2. Viner, K.: How technology disrupted the truth. https://www.theguardian.com/media/2016/jul/12/how-technology-disrupted-the-truth

  3. Castillo, C., Mendoza, M., Poblete, B.: Information credibility on twitter. In: Proceedings of the 20th International Conference on World Wide Web, WWW 2011, pp. 675–684. ACM, New York (2011)

    Google Scholar 

  4. Mukherjee, S., Weikum, G.: Leveraging joint interactions for credibility analysis in news communities. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM 2015, pp. 353–362. ACM, New York (2015)

    Google Scholar 

  5. Ferrara, E., Varol, O., Davis, C.A., Menczer, F., Flammini, A.: The rise of social bots. CoRR, abs/1407.5225 (2014)

    Google Scholar 

  6. Twitter: Twitter developer documentation. https://dev.twitter.com/rest/public.

  7. Chu, Z., Gianvecchio, S., Wang, H., Jajodia, S.: Detecting automation of twitter accounts: Are you a human, bot, or cyborg? IEEE Trans. Dependable Secure Comput. 9(6), 811–824 (2012)

    Article  Google Scholar 

  8. Dickerson, J.P., Kagan, V., Subrahmanian, V.S.: Using sentiment to detect bots on twitter: Are humans more opinionated than bots? In: 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 620–627. IEEE (2014)

    Google Scholar 

  9. Zhang, C.M., Paxson, V.: Detecting and analyzing automated activity on twitter. In: Spring, N., Riley, G.F. (eds.) PAM 2011. LNCS, vol. 6579, pp. 102–111. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-19260-9_11

    Chapter  Google Scholar 

  10. Deng, H., Lyu, M.R., King, I.: A generalized co-hits algorithm and its application to bipartite graphs. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2009, pp. 239–248. ACM, New York (2009)

    Google Scholar 

  11. Cha, M., Haddadi, H., Benevenuto, F., Gummadi, K.P.: Measuring user influence in twitter: The million follower fallacy. In: ICWSM, vol. 10, pp. 10–17 (2010)

    Google Scholar 

  12. Hannon, J., Bennett, M., Smyth, B.: Recommending twitter users to follow using content and collaborative filtering approaches. In: Proceedings of the Fourth ACM Conference on Recommender Systems, RecSys 2010, pp. 199–206. ACM, New York (2010)

    Google Scholar 

  13. Tao, K., Abel, F., Gao, Q., Houben, G.-J.: TUMS: Twitter-based user modeling service. In: García-Castro, R., Fensel, D., Antoniou, G. (eds.) ESWC 2011. LNCS, vol. 7117, pp. 269–283. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-25953-1_22

    Chapter  Google Scholar 

  14. Mazumder, S., Mehta, S., Patel, D.: Identifying top-k consistent news-casters on twitter. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM 2015, pp. 1875–1878. ACM, New York (2015)

    Google Scholar 

  15. Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM (JACM) 46(5), 604–632 (1999)

    Article  MathSciNet  MATH  Google Scholar 

  16. Toutanova, K., Klein, D., Manning, C.D., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, NAACL 2003, Vol. 1, pp. 173–180, Stroudsburg. Association for Computational Linguistics (2003)

    Google Scholar 

  17. Alexa: Alexa - top sites by category: News. http://www.alexa.com/topsites/category/News.

  18. Widman, J.: Edgerank. http://edgerank.net/.

  19. Salihefendic, A.: How hacker news ranking algorithm works. https://medium.com/hacking-and-gonzo/how-hacker-news-ranking-algorithm-works-1d9b0cf2c08d.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ankur Garg .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Garg, A., Syal, V., Gudlani, P., Patel, D. (2017). Mining Credible and Relevant News from Social Networks. In: Reddy, P., Sureka, A., Chakravarthy, S., Bhalla, S. (eds) Big Data Analytics. BDA 2017. Lecture Notes in Computer Science(), vol 10721. Springer, Cham. https://doi.org/10.1007/978-3-319-72413-3_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-72413-3_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-72412-6

  • Online ISBN: 978-3-319-72413-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics