Skip to main content

GitHub and Stack Overflow: Analyzing Developer Interests Across Multiple Social Collaborative Platforms

  • Conference paper
  • First Online:
Social Informatics (SocInfo 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10540))

Included in the following conference series:

Abstract

Increasingly, software developers are using a wide array of social collaborative platforms for software development and learning. In this work, we examined the similarities in developer’s interests within and across GitHub and Stack Overflow. Our study finds that developers share common interests in GitHub and Stack Overflow; on average, 39% of the GitHub repositories and Stack Overflow questions that a developer had participated fall in the common interests. Also, developers do share similar interests with other developers who co-participated activities in the two platforms. In particular, developers who co-commit and co-pull-request same GitHub repositories and co-answer same Stack Overflow questions, share more common interests compare to other developers who co-participate in other platform activities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Notes

  1. 1.

    https://github.com/.

  2. 2.

    http://stackoverflow.com/.

  3. 3.

    https://archive.org/details/stackexchange.

References

  1. Badashian, A.S., Esteki, A., Gholipour, A., Hindle, A., Stroulia, E.: Involvement, contribution and influence in GitHub and stack overflow. In: CSSE (2014)

    Google Scholar 

  2. Bajaj, K., Pattabiraman, K., Mesbah, A.: Mining questions asked by web developers. In: MSR (2014)

    Google Scholar 

  3. Barua, A., Thomas, S.W., Hassan, A.E.: What are developers talking about? an analysis of topics and trends in stack overflow. Empir. Softw. Eng. 19(3), 619–651 (2014)

    Article  Google Scholar 

  4. Bissyandé, T.F., Lo, D., Jiang, L., Réveillere, L., Klein, J., Traon, Y.L.: Got issues? who cares about it? a large scale investigation of issue trackers from GitHub. In: ISSRE (2013)

    Google Scholar 

  5. Gousios, G.: The GHTorrent dataset and tool suite. In: MSR (2013)

    Google Scholar 

  6. Jiang, J., Lo, D., He, J., Xia, X., Kochhar, P.S., Zhang, L.: Why and how developers fork what from whom in GitHub. Empir. Softw. Eng. 22(1), 547–578 (2017)

    Article  Google Scholar 

  7. Kochhar, P.S., Lo, D.: Revisiting assert use in GitHub projects. In: EASE (2017)

    Google Scholar 

  8. Rahman, M.M., Roy, C.K.: An insight into the pull requests of GitHub. In: MSR (2014)

    Google Scholar 

  9. Ray, B., Posnett, D., Filkov, V., Devanbu, P.: A large scale study of programming languages and code quality in GitHub. In: FSE (2014)

    Google Scholar 

  10. Rosen, C., Shihab, E.: What are mobile developers asking about? a large scale study using stack overflow. Empir. Softw. Eng. 21(3), 1192–1223 (2015)

    Article  Google Scholar 

  11. Sheoran, J., Blincoe, K., Kalliamvakou, E., Damian, D., Ell, J.: Understanding “watchers” on GitHub. In: MSR (2014)

    Google Scholar 

  12. Thung, F., Bissyandé, T.F., Lo, D., Jiang, L.: Network structure of social coding in GitHub. In: CSMR (2013)

    Google Scholar 

  13. Vasilescu, B., Filkov, V., Serebrenik, A.: StackOverflow and GitHub: associations between software development and crowdsourced knowledge. In: SocialCom (2013)

    Google Scholar 

  14. Vasilescu, B., Yu, Y., Wang, H., Devanbu, P., Filkov, V.: Quality and productivity outcomes relating to continuous integration in GitHub. In: FSE (2015)

    Google Scholar 

  15. Wang, S., Lo, D., Jiang, L.: An empirical study on developer interactions in StackOverflow. In: SAC (2013)

    Google Scholar 

  16. Yang, X.-L., Lo, D., Xia, X., Wan, Z.-Y., Sun, J.-L.: What security questions do developers ask? a large-scale study of stack overflow posts. J. Comput. Sci. Technol. 31(5), 910–924 (2016)

    Article  Google Scholar 

  17. Zou, J., Xu, L., Guo, W., Yan, M., Yang, D., Zhang, X.: Which non-functional requirements do developers focus on? an empirical study on stack overflow using topic analysis. In: MSR (2015)

    Google Scholar 

Download references

Acknowledgments

This research is supported by the National Research Foundation, Prime Minister’s Office, Singapore under its International Research Centres in Singapore Funding Initiative.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Roy Ka-Wei Lee .

Editor information

Editors and Affiliations

Appendix 1: Example for Cross-Platform Similarity Score Calculation

Appendix 1: Example for Cross-Platform Similarity Score Calculation

Figure 4 shows an example for the calculation of cross-platform similarity score \(Sim^{SO\text {-}GH}(d)\). Consider developer d who has participated activities in GitHub and Stack Overflow. d has forked 2 repositories; Repository A which description contains the tag set {Java, Android}, and Repository B which description contains the tag set {Java}, and watched Repository C which description contains the tag set {C#}. d also favorited 2 Stack Overflow questions; Question D which are tagged with {Android}, and Question F which are tagged with {iOS}, and answered Question E which are tagged with {Java}. We can estimate d’s interests in GitHub (i.e. \(I^{GH}(d)\)) as {Java, Android, C#} and d’s interests in Stack Overflow (i.e., \(I^{SO}(d)\)) as {Android, iOS}. The common interests of d (i.e., CI(d)) would be {Java, Android}. Therefore, \(Shared^{R}(d)\) would include repositories A and B, while \(Shared^{Q}(d)\) would include questions D and E. Thus, \(Sim^{SO\text {-}GH}(d) = \frac{|2| + |2)|}{|3| + |3|}\).

Fig. 4.
figure 4

Example of cross-platform similarity score calculation

1.1 Appendix 2: Example for Co-Participation Similarity Score Calculation

Figure 5 shows an example for the calculation of co-participation similarity score for watch activity \(Sim^W(d,co^W(d))\) for developer d. Let us consider two developers d and \(d'\) and assume that there are no other developers. Developer d watched repositories A and B. Developer \(d'\) co-watched B with d. Thus, \(co^W(d)\) is {d’}. In addition to B, developer \(d'\) also watched repositories C and D. \(Shared^W(d,d')\) would then include B and C as both of the repositories share common interests with the repositories that d watched. \(Sim^W(d) = \bigg [\sum _{d'\in Co^W(d)} \frac{|2|}{|3|}\bigg ] / |1| = 0.67\).

Fig. 5.
figure 5

Example of co-participation similarity score calculation for watch activity

It is important to note that the co-participation similarity scores only consider the similarity in interests between pairs of developers who have co-participated in at least one common repository or question with each other but the developers may have participated in many other repositories and questions different from each other. For example, developers d and \(d'\) only watched one common repository but they had watched many other repositories which were different from each other. Also, when computing the co-participation similarity measure between developers who participated in a particular activity, we only consider the interests of the developers in that target activity. For instance, when computing \(Sim^W(d)\), we consider how similar are the interests between developers based only on the watch activities, i.e., we do not consider repositories forked by the developers or questions answered and favorited by the developers.

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Lee, R.KW., Lo, D. (2017). GitHub and Stack Overflow: Analyzing Developer Interests Across Multiple Social Collaborative Platforms. In: Ciampaglia, G., Mashhadi, A., Yasseri, T. (eds) Social Informatics. SocInfo 2017. Lecture Notes in Computer Science(), vol 10540. Springer, Cham. https://doi.org/10.1007/978-3-319-67256-4_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-67256-4_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-67255-7

  • Online ISBN: 978-3-319-67256-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics