Mining Credible and Relevant News from Social Networks

Garg, Ankur; Syal, Varun; Gudlani, Pankaj; Patel, Dhaval

doi:10.1007/978-3-319-72413-3_6

Mining Credible and Relevant News from Social Networks

Ankur Garg¹⁷,
Varun Syal¹⁷,
Pankaj Gudlani¹⁷ &
…
Dhaval Patel¹⁸

Conference paper
First Online: 25 November 2017

2223 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10721))

Abstract

Today, people are increasingly accessing news through social networks like Twitter. This is regardless of the fact that whether the news is regarding a parliamentary election, or a famous entertainment celebrity. Moreover, these platforms allow people to like, retweet and comment on the shared news article. This shapes the opinions and beliefs of the people who read it along with the news article itself. However, a major problem we face today is the misuse of these networks for spreading rumors and misleading news content. This is the practice of yellow journalism which aims at disrupting public sentiment.

To address this problem, we present a methodology to find credible and relevant tweets that refer to actual news articles published on news websites. Our methodology scores each tweet based on the reputation of the users sharing it, the news publisher which published the news article, and the popularity of the news concepts mentioned in the article. We model the interaction between these three entities in the form of a tripartite graph and propose a Co-HITS algorithm based formulation to score all the entities involved. The scores of individual entities is used to assign a score for each tweet that indicates the credibility and relevance of the news mentioned in it. We find that the presence of many bots is also a big problem in these networks and can affect the results of such explorations. Thus, we use existing bot detection techniques to identify bots and propose an approach to limit their influence on the system in an efficient manner. Finally, we present a qualitative evaluation of our proposed system on a set of approximately 8000 tweets.

A. Garg–This author is now at Adobe Research.

V. Syal–This author is now at University of California, San Diego, USA.

P. Gudlani–This author is now at Google Inc.

D. Patel–Work performed when the author was affiliated to Indian Institute of Technology, Roorkee, India.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Visentin L.: Facebook wages war on click-bait. http://www.smh.com.au/digital-life/digital-life-news/facebook-wages-war-on-clickbait-20140825-108dd8.html
Viner, K.: How technology disrupted the truth. https://www.theguardian.com/media/2016/jul/12/how-technology-disrupted-the-truth
Castillo, C., Mendoza, M., Poblete, B.: Information credibility on twitter. In: Proceedings of the 20th International Conference on World Wide Web, WWW 2011, pp. 675–684. ACM, New York (2011)
Google Scholar
Mukherjee, S., Weikum, G.: Leveraging joint interactions for credibility analysis in news communities. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM 2015, pp. 353–362. ACM, New York (2015)
Google Scholar
Ferrara, E., Varol, O., Davis, C.A., Menczer, F., Flammini, A.: The rise of social bots. CoRR, abs/1407.5225 (2014)
Google Scholar
Twitter: Twitter developer documentation. https://dev.twitter.com/rest/public.
Chu, Z., Gianvecchio, S., Wang, H., Jajodia, S.: Detecting automation of twitter accounts: Are you a human, bot, or cyborg? IEEE Trans. Dependable Secure Comput. 9(6), 811–824 (2012)
Article Google Scholar
Dickerson, J.P., Kagan, V., Subrahmanian, V.S.: Using sentiment to detect bots on twitter: Are humans more opinionated than bots? In: 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pp. 620–627. IEEE (2014)
Google Scholar
Zhang, C.M., Paxson, V.: Detecting and analyzing automated activity on twitter. In: Spring, N., Riley, G.F. (eds.) PAM 2011. LNCS, vol. 6579, pp. 102–111. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-19260-9_11
Chapter Google Scholar
Deng, H., Lyu, M.R., King, I.: A generalized co-hits algorithm and its application to bipartite graphs. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2009, pp. 239–248. ACM, New York (2009)
Google Scholar
Cha, M., Haddadi, H., Benevenuto, F., Gummadi, K.P.: Measuring user influence in twitter: The million follower fallacy. In: ICWSM, vol. 10, pp. 10–17 (2010)
Google Scholar
Hannon, J., Bennett, M., Smyth, B.: Recommending twitter users to follow using content and collaborative filtering approaches. In: Proceedings of the Fourth ACM Conference on Recommender Systems, RecSys 2010, pp. 199–206. ACM, New York (2010)
Google Scholar
Tao, K., Abel, F., Gao, Q., Houben, G.-J.: TUMS: Twitter-based user modeling service. In: García-Castro, R., Fensel, D., Antoniou, G. (eds.) ESWC 2011. LNCS, vol. 7117, pp. 269–283. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-25953-1_22
Chapter Google Scholar
Mazumder, S., Mehta, S., Patel, D.: Identifying top-k consistent news-casters on twitter. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, CIKM 2015, pp. 1875–1878. ACM, New York (2015)
Google Scholar
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM (JACM) 46(5), 604–632 (1999)
Article MathSciNet MATH Google Scholar
Toutanova, K., Klein, D., Manning, C.D., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, NAACL 2003, Vol. 1, pp. 173–180, Stroudsburg. Association for Computational Linguistics (2003)
Google Scholar
Alexa: Alexa - top sites by category: News. http://www.alexa.com/topsites/category/News.
Widman, J.: Edgerank. http://edgerank.net/.
Salihefendic, A.: How hacker news ranking algorithm works. https://medium.com/hacking-and-gonzo/how-hacker-news-ranking-algorithm-works-1d9b0cf2c08d.

Download references

Author information

Authors and Affiliations

Indian Institute of Technology, Roorkee, India
Ankur Garg, Varun Syal & Pankaj Gudlani
IBM TJ Watson Research Center, Yorktown Heights, USA
Dhaval Patel

Authors

Ankur Garg
View author publications
You can also search for this author in PubMed Google Scholar
Varun Syal
View author publications
You can also search for this author in PubMed Google Scholar
Pankaj Gudlani
View author publications
You can also search for this author in PubMed Google Scholar
Dhaval Patel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ankur Garg .

Editor information

Editors and Affiliations

International Institute of Information Technology, Hyderabad, India
P. Krishna Reddy
Rajiv Gandhi Education City, Sonepat, India
Ashish Sureka
University of Texas at Arlington, Arlington, Texas, USA
Sharma Chakravarthy
University of Aizu, Aizu-Wakamatsu, Japan
Subhash Bhalla

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Garg, A., Syal, V., Gudlani, P., Patel, D. (2017). Mining Credible and Relevant News from Social Networks. In: Reddy, P., Sureka, A., Chakravarthy, S., Bhalla, S. (eds) Big Data Analytics. BDA 2017. Lecture Notes in Computer Science(), vol 10721. Springer, Cham. https://doi.org/10.1007/978-3-319-72413-3_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-72413-3_6
Published: 25 November 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-72412-6
Online ISBN: 978-3-319-72413-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics