Skip to main content
Log in

Learning to predict opinion share and detect anti-majority opinionists in social networks

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

We address the problem of detecting anti-majority opinionists using the value-weighted mixture voter (VwMV) model. This problem is motivated by the fact that 1) each opinion has its own value and an opinion with a higher value propagates more easily/rapidly and 2) there are always people who have a tendency to disagree with any opinion expressed by the majority. We extend the basic voter model to include these two factors with the value of each opinion and the anti-majoritarian tendency of each node as new parameters, and learn these parameters from a sequence of observed opinion data over a social network. We experimentally show that it is possible to learn the opinion values correctly using a short observed opinion propagation data and to predict the opinion share in the near future correctly even in the presence of anti-majoritarians, and also show that it is possible to learn the anti-majoritarian tendency of each node if longer observation data is available. Indeed, the learned model can predict the future opinion share much more accurately than a simple polynomial extrapolation can do. Ignoring these two factors substantially degrade the performance of share prediction. We also show theoretically that, in a situation where the local opinion share can be approximated by the average opinion share, 1) when there are no anti-majoritarians, the opinion with the highest value eventually takes over, but 2) when there are a certain fraction of anti-majoritarians, it is not necessarily the case that the opinion with the highest value prevails and wins, and further, 3) in both cases, when the opinion values are uniform, the opinion share prediction problem becomes ill-defined and any opinion can win. The simulation results support that this holds for typical real world social networks. These theoretical results help understand the long term behavior of opinion propagation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

Notes

  1. This may look a rather unnatural assumption because it is unlikely that all the different opinions are initiated at the same time. Since each opinion is initiated by a single person and the goal is to see how it is propagated, it should be allowed that each opinion is assigned to only one node and all the remaining nodes are in neutral states, i.e., unaffected by any opinion yet. We could have changed the timing of each opinion’s initial utterance, but chose the simplest case.

  2. This assumes that the average delay time is 1.

  3. If the goal is to predict which opinion wins eventually, it is sufficient to identify which opinion has the highest value, but if we want to estimate the share of each opinion, we need to estimate the values accurately.

  4. This makes the analysis drastically simpler, but the results remains valid qualitatively.

  5. Their results is that the basic voter model converges after O(n 3logn) steps with probability 1- o(1) where n is the number of nodes.

  6. Opinion propagation is directional. Choosing bidirectional networks means that opinion can propagate in both directions.

  7. It would be the best if we can use the real opinion propagation data. However, as we are not able to find such data, the next best is to use the network structures constructed from the real world social media data (not synthetic networks).

  8. It may sound more reasonable to weight each difference by the share itself, but we decided not to do so. We rather considered the prediction problem as the classification problem.

References

  • Agarwal, N., & Liu, H. (2008). Blogosphere: research issues, tools, and applications. SIGKDD Explorations, 10, 18–31.

    Article  Google Scholar 

  • Arenson, A.J. (1996). Rejection of the power of judicial review in britain. Deakin Law Review, 3, 37–53.

    Google Scholar 

  • Bakshy, E., Hofman, J., Mason, W., Watts, D. (2011). Everyone’s an influencer: quantifying influences on twitter. In Proceedings of the 4th international conference on web search and data mining (WSDM’11) (pp. 65–74).

  • Castellano, C., Munoz, M.A., Pastor-Satorras, R. (2009). Nonlinear q-voter model. Physical Review E, 80, 041129:1–041129:8.

    Article  Google Scholar 

  • Chen, W., Wang, Y., Yang, S. (2009). Efficient influence maximization in social networks. In Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining (KDD’09) (pp. 199–208).

  • Chen, W., Yuan, Y., Zhang, L. (2010). Scalable influence maximization in social networks under the linear threshold model. In Proceedings of the 10th IEEE international conference on data mining (ICDM’10) (pp. 88–97).

  • Crandall, D., Cosley, D., Huttenlocner, D., Kleinberg, J., Suri, S. (2008). Feedback effects between similarity and social influence in online communities. In Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining (KDD’08) (pp. 160–168).

  • Domingos, P. (2005). Mining social networks for viral marketing. IEEE Intelligent Systems, 20, 80–82.

    Article  Google Scholar 

  • Domingos, P., & Richardson, M. (2001). Mining the network value of customers. In Proceedings of the 7th ACM SIGKDD international conference on knowledge discovery and data mining (KDD’01) (pp. 57–66).

  • Donnelly, P., & Welsh, D. (1984). The antivoter problem: random 2-colourings of graphs. In Graph theory and combinatorics (pp. 133–144).

  • Even-Dar, E., & Shapira, A. (2007). A note on maximizing the spread of influence in social networks. In Proceedings of the 3rd international workshop on internet and network economics (WINE’07) (pp 281–286).

  • Gill, J., & Gainous, J. (2002). Why does voting get so complicated? A review of theories for analyzing democratic participation. Statistical Science, 17, 383–404.

    Article  MathSciNet  MATH  Google Scholar 

  • Gruhl, D., Guha, R., Liben-Nowell, D., Tomkins, A. (2004). Information diffusion through blogspace. SIGKDD Explorations, 6, 43–52.

    Article  Google Scholar 

  • Holme, P., & Newman, M.E.J. (2006). Nonequilibrium phase transition in the coevolution of networks and opinions. Physical Review E, 74, 056108:1–056108:5.

    Google Scholar 

  • Huber, M., & Reinert, G. (2004). The stationary distribution in the antivoter model: exact sampling and approximations. In Stein’s method: Expository lectures and applications. IMS lecture notes (Vol. 46, pp. 75–92).

  • Kempe, D., Kleinberg, J., Tardos, E. (2003). Maximizing the spread of influence through a social network. In Proceedings of the 9th ACM SIGKDD international conference on knowledge discovery and data mining (KDD’03) (pp. 137–146).

  • Kimura, M., Saito, K., Motoda, H. (2009). Blocking links to minimize contamination spread in a social network. ACM Transactions on Knowledge Discovery from Data, 3, 9:1–9:23.

    Article  Google Scholar 

  • Kimura, M., Saito, K., Nakano, R., Motoda, H. (2010a). Extracting influential nodes on a social network for information diffusion. Data Mining and Knowledge Discovery, 20, 70–97.

    Article  MathSciNet  Google Scholar 

  • Kimura, M., Saito, K., Ohara, K., Motoda, H. (2010b). Learning to predict opinion share in social networks. In Proceedings of the 24th AAAI conference on artificial intelligence (AAAI’10) (pp. 1364–1370).

  • Kimura, M., Saito, K., Ohara, K., Motoda, H. (2011). Detecting anti-majority opinionists using value-weighted mixture voter model. In Proceedings of the 14th international conference on discovery science (DS’11). LNAI (Vol. 6926, pp. 150–164).

  • Klimt, B., & Yang, Y. (2004). The enron corpus: a new dataset for email classification research. In Proceedings of the 15th European conference on machine learning (ECML’04) (pp. 217–226).

  • Leskovec, J., Adamic, L.A., Huberman, B.A. (2007a). The dynamics of viral marketing. ACM Transactions on the Web, 1, 5:1–5:39.

    Article  Google Scholar 

  • Leskovec, J., Krause, A., Guestrin, C., Faloutsos, C., VanBriesen, J., Glance, N. (2007b). Cost-effective outbreak detection in networks. In Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining (KDD’07) (pp. 420–429).

  • Liggett, T.M. (1999). Stochastic interacting systems: Contact, voter, and exclusion processes. New York: Springer.

    Book  MATH  Google Scholar 

  • Mathioudakis, M., Bonch, F., Castillo, C., Gionis, A., Ukkonen, A. (2011). Sparsification of influence networks. In Proceedings of the 17th ACM SIGKDD conference on knowledge discovery and data mining (KDD’11) (pp. 529–537).

  • Matloff, N. (1977). Ergodicity conditions for a dissonant voting model. Annals of Probability, 5, 371–386.

    Article  MathSciNet  MATH  Google Scholar 

  • Newman, M.E.J. (2003). The structure and function of complex networks. SIAM Review, 45, 167–256.

    Article  MathSciNet  MATH  Google Scholar 

  • Newman, M.E.J., Forrest, S., Balthrop, J. (2002). Email networks and the spread of computer viruses. Physical Review E, 66, 035101:1–035101:4.

    Google Scholar 

  • Palla, G., Derényi, I., Farkas, I., Vicsek, T. (2005). Uncovering the overlapping community structure of complex networks in nature and society. Nature, 435, 814–818.

    Article  Google Scholar 

  • Richardson, M., & Domingos, P. (2002). Mining knowledge-sharing sites for viral marketing. In Proceedings of the 8th ACM SIGKDD international conference on knowledge discovery and data mining (KDD’02) (pp. 61–70).

  • Röllin, A. (2007). Translated poisson approximation using exchangeable pair couplings. Annals of Applied Probablity, 17, 1596–1614.

    Article  MATH  Google Scholar 

  • Romero, D., Meeder, B., Kleinberg, J. (2011). Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter. In Proceedings of the 20th international world wide web conference (WWW’11) (pp. 695–704).

  • Sood, V., & Redner, S. (2005). Voter model on heterogeneous graphs. Physical Review Letters, 94, 178701:1–178701:4.

    Article  Google Scholar 

  • Wu, F., & Huberman, B.A. (2008). How public opinion forms. In Proceedings of the 4th international workshop on internet and network economics (WINE’08) (pp. 334–341).

  • Yang, H., Wu, Z., Zhou, C., Zhou, T., Wang, B. (2009). Effects of social diversity on the emergence of global consensus in opinion dynamics. Physical Review E, 80, 046108:1–046108:5.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Masahiro Kimura.

Additional information

This work was partly supported by Asian Office of Aerospace Research and Development, Air Force Office of Scientific Research under Grant No. AOARD-10-4053, and JSPS Grant-in-Aid for Scientific Research (C) (No. 23500194).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kimura, M., Saito, K., Ohara, K. et al. Learning to predict opinion share and detect anti-majority opinionists in social networks. J Intell Inf Syst 41, 5–37 (2013). https://doi.org/10.1007/s10844-012-0222-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-012-0222-7

Keywords

Navigation