Identifying Key Opinion Leaders in Evolving Co-authorship Networks—A Descriptive Study of a Proxy Variable for Betweenness Centrality

Putzke, Johannes; Takeda, Hideaki

doi:10.1007/978-3-319-30569-1_24

Johannes Putzke⁶ &
Hideaki Takeda⁶

Part of the book series: Studies in Computational Intelligence ((SCI,volume 644))

1087 Accesses

Abstract

Many researchers identify influentials in a network by their betweenness centrality. Whereas betweenness centrality can be calculated in small, static, connected networks, its calculation in complex, large, evolving networks frequently causes some problems. Hence, we propose a proxy variable for a node’s betweenness centrality that can be calculated in large, evolving networks. We illustrate our approach using the example of Key Opinion Leader (KOL) identification in an evolving co-authorship network of researchers who have published articles about PCSK9 (a protein that regulates cholesterol levels).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The execution times are too high even when applying improved algorithms to calculate betweenness centrality (e.g. [3, 6, 30]).
2.
DBLP is a database of computer science publications; http://dblp.org, accessed on July 22nd 2015.
3.
PCSK9 is a protein which regulates LDL cholesterol levels. By blocking PCSK9, cholesterol levels can be brought substantially down. Hence, drugs can be developed that reduce the risk of cardiovascular diseases by blocking PCSK9.
4.
http://www.ncbi.nlm.nih.gov/pubmed, accessed on June 4th 2015.
5.
http://gephi.org, accessed on July 14th 2015.
6.
In the analyses, we left out the years 1994–2002, since no papers about PCSK9 were published then.
7.
Although Freeman [13] proposed a standardised measure of betweenness centrality that can theoretically be used for comparing centrality scores between components of different size, we think that it is, for example, not meaningful to compare the maximal betweenness centrality of a node in a component with three actors to that of a node in the main component of a co-authorship network.
8.
The main component of a network is also sometimes referred to as the “giant component”.
9.
There were no other meaningful big components in the network. For example, the second (third) biggest component in the network comprised 1.23 % (0.69 %) of all authors.
10.
These influential people include basic researchers as well as researchers conducting clinical trials. Hence, some context knowledge is helpful for reading the tables.
11.
We suppose that node A will have a high betweenness centrality in the final network, although node B and node C are more likely to co-author a paper in the future than two random nodes if both have co-authored a paper with node A. In the literature, this fact has been termed the “forbidden triad” [14].
12.
This is true for all years except 2004 and 2005. However, there were only very few publications in these years (14 and 18 respectively, compare Table 1), and the high correlation coefficients between an author’s degree centrality and this author’s betweenness centrality for those two years can be explained by chance. Furthermore, the differences in the correlation coefficients between the number of an author’s unclosed triads and betweennness centrality and author’s degree centrality and betweenness centrality for the years 2004 and 2005 are not very large (0.4428995 vs. 0.5012240 and 0.4435424 vs. 0.4919271).
13.
The “Florentine families network” is a very small network (with 16 nodes only).
14.
Encouraged by a literature review and interviews with marketing managers from the pharmaceutical industry, we assumed that authors with a high betweenness centrality have a high influence as well. Although we think that this is a reasonable assumption for co-authorship networks, we want to be clear that structural importance and dynamic influence of nodes do not necessarily have to be the same.

References

Aggarwal, C., Subbian, K.: Evolutionary network analysis: a survey. ACM Comput. Surv. 47(1), 10 (2014). doi:10.1145/2601412
Google Scholar
Backstrom, L., Huttenlocher, D., Kleinberg, J., Lan, X.: Group formation in large social networks: membership, growth, and evolution. In: Paper presented at the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Philadelphia, PA, USA
Google Scholar
Bader, D.A., Kintali, S., Madduri, K., Mihail, M.: Approximating betweenness centrality. In: Algorithms and Models for the Web-Graph, pp. 124–137. Springer, (2007)
Google Scholar
Barabási, A.L., Albert, R.: Emergence of scaling in random networks. Science 286(5439), 509–512 (1999). doi:10.1126/science.286.5439.509
Article MathSciNet MATH Google Scholar
Barabási, A.L., Jeong, H., Neda, Z., Ravasz, E., Schubert, A., Vicsek, T.: Evolution of the social network of scientific collaborations. Physica A 311(3–4), 590–614 (2002)
Article MathSciNet MATH Google Scholar
Brandes, U.: A faster algorithm for betweenness centrality. J. Math. Sociol. 25(2), 163–177 (2001)
Article MATH Google Scholar
Breiger, R.L., Pattison, P.E.: Cumulated social roles—the duality of persons and their algebras. Soc. Netw. 8(3), 215–256 (1986). doi:10.1016/0378-8733(86)90006-7
Article Google Scholar
Butts, C.T.: Social network analysis with sna. J. Stat. Softw. 24(6), 1–51 (2008)
Article Google Scholar
Czárdi, G., Nepusz, T.: The igraph software package for complex network research. InterJournal, Complex Syst. 1695 (2006)
Google Scholar
Ediger, D., Jiang, K., Riedy, J., Bader, D., Corley, C., Farber, R., Reynolds, W.N.: Massive social network analysis: mining twitter for social good. In: Paper presented at the 39th International Conference on Parallel Processing (ICPP) San Diego, CA
Google Scholar
Franceschet, M.: Collaboration in computer science: a network science approach. J. Am. Soc. Inform. Sci. Technol. 62(10), 1992–2012 (2011). doi:10.1002/asi.21614
Article Google Scholar
Freeman, L.C.: Set of measures of centrality based on betweenness. Sociometry 40(1), 35–41 (1977). doi:10.2307/3033543
Article Google Scholar
Freeman, L.C.: Centrality in social networks conceptual clarification. Soc. Netw. 1(3), 215–239 (1979). doi:10.1016/0378-8733(78)90021-7
Article Google Scholar
Granovetter, M.S.: The strength of weak ties. Am. J. Sociol. 78(6), 1360–1380 (1973). doi:10.1086/225469
Article Google Scholar
Jeong, H., Mason, S.P., Barabási, A.L., Oltvai, Z.N.: Lethality and centrality in protein networks. Nature 411(6833), 41–42 (2001). doi:10.1038/35075138
Article Google Scholar
Knuth, D.E.: The Stanford GraphBase: A Platform for Combinatorial Computing. Addison-Wesley Reading (1993)
Google Scholar
Liu, P., Xia, H.: Structure and evolution of co-authorship network in an interdisciplinary research field. Scientometrics 103(1), 101–134 (2015). doi:10.1007/s11192-014-1525-y
Article MathSciNet Google Scholar
McLaughlin, A., Bader, D.A.: Scalable and high performance betweenness centrality on the GPU. In: Paper presented at the International Conference for High Performance Computing, Networking, Storage and Analysis, New Orleans, Louisana
Google Scholar
Morstatter, F., Pfeffer, J., Liu, H., Carley, K.M.: Is the sample good enough? comparing data from Twitter’s streaming API with Twitter’s firehose. In: Paper presented at the International AAAI Conference on Web and Social Media, Cambridge, Massachusetts
Google Scholar
Newman, M.E.: Scientific collaboration networks—I. Network construction and fundamental results. Phys. Rev. E 64, 016131 (2001)
Google Scholar
Newman, M.E.: Scientific collaboration networks—II. Shortest paths, weighted networks, and centrality. Phys. Rev. E 64, 016132 (2001)
Google Scholar
Newman, M.E.J.: The structure of scientific collaboration networks. Proc. Nat. Acad. Sci. USA 98(2), 404–409 (2001). doi:10.1073/pnas.021544898
Article MathSciNet MATH Google Scholar
Newman, M.E.J.: Coauthorship networks and patterns of scientific collaboration. Proc. Nat. Acad. Sci. 101(suppl 1), 5200–5205 (2004). doi:10.1073/pnas.0307545100
Article Google Scholar
Shi, X., Bonner, M., Adamic, L.A., Gilbert, A.C.: The very small world of the well-connected. In: Paper presented at the Nineteenth ACM Conference on Hypertext and Hypermedia, Pittsburgh, PA, USA
Google Scholar
Spearman, C.: The proof and measurement of association between 2 Things (Reprinted from Amer. J. Psychol. vol. 15, pp. 72–101, 1904). Am. J. Psychol 100(3–4), 441–471 (1987)
Article Google Scholar
Tomassini, M., Luthi, L.: Empirical analysis of the evolution of a scientific collaboration network. Physica A 385(2), 750–764 (2007)
Article Google Scholar
Vidgen, R., Henneberg, S., Naude, P.: What sort of community is the European Conference on information systems? a social network analysis 1993–2005. Eur. J. Inf. Syst. 16(1), 5–19 (2007)
Google Scholar
Watts, D.J., Strogatz, S.H.: Collective dynamics of ‘small-world’ networks. Nature 393(6684), 440–442 (1998). doi:10.1038/30918
Article Google Scholar
Yan, E., Ding, Y.: Applying centrality measures to impact analysis: a coauthorship network analysis. J. Am. Soc. Inform. Sci. Technol. 60(10), 2107–2118 (2009). doi:10.1002/asi.21128
Article Google Scholar
Yang, J., Chen, Y.: Fast computing betweenness centrality with virtual nodes on large sparse networks. PLoS ONE 6(7), e22557 (2011). doi:10.1371/journal.pone.0022557
Article Google Scholar
Yang, Y., Dong, Y., Chawla, N.V.: Predicting node degree centrality with the node prominence profile. Sci. Rep. 4(7236) (2014). doi:10.1038/srep07236

Download references

Acknowledgments

This work was supported by a fellowship within the FITweltweit programme of the German Academic Exchange Service (DAAD).

Author information

Authors and Affiliations

National Institute of Informatics, 2-1-2 Hitotsubashi, Chiyoda, Tokyo, 101-8430, Japan
Johannes Putzke & Hideaki Takeda

Authors

Johannes Putzke
View author publications
You can also search for this author in PubMed Google Scholar
Hideaki Takeda
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Johannes Putzke .

Editor information

Editors and Affiliations

LE2I , UFR Sciences et Techniques, Université de Bourgogne, Dijon, France
Hocine Cherifi
New York University, Center for Data Science, New York, USA
Bruno Gonçalves
BioComplex Laboratory, Computer Sciences, Florida Institute of Technology, Melbourne, FL, Florida, USA
Ronaldo Menezes
111 Dana, Physics Department, Northeastern University, Boston, Massachusetts, USA
Roberta Sinatra

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Putzke, J., Takeda, H. (2016). Identifying Key Opinion Leaders in Evolving Co-authorship Networks—A Descriptive Study of a Proxy Variable for Betweenness Centrality. In: Cherifi, H., Gonçalves, B., Menezes, R., Sinatra, R. (eds) Complex Networks VII. Studies in Computational Intelligence, vol 644. Springer, Cham. https://doi.org/10.1007/978-3-319-30569-1_24

Download citation

DOI: https://doi.org/10.1007/978-3-319-30569-1_24
Published: 03 March 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-30568-4
Online ISBN: 978-3-319-30569-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics