Abstract
Predicting what type of work was performed by developers in OSS has attracted several researchers. Previous studies have shown that fixing bugs are usually done by modifying very few files. In this study, we try to predict the trends and characteristics of how developers commit source code to version control systems (SVN, CVS, etc.). We want to investigate if number of files that are committed together is an indication of type of activity. In other words, is number of files in a commit related to the type of activity? Interestingly, we found that a strong correlation exits between type of activity and number of files involved in a commit. To make sure our study is not biased for a specific project, we have analyzed ten projects. Parsing messages in commits and processing the results, we found in coherent with previous studies bug fixing involves few files in a commit and commits that affect several files are usually related to adding functionality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kagdi H, Collard ML, Maletic JI (2007) A survey and taxonomy of approaches for mining software repositories in the context of software evolution. J Softw Maint Evolut Res Pr 19(2):77–131 (March/April)
Santiago D, Liliana T (2008) Database specification deliverable ID: D3.1, Version: 2.0, Oct 10
Arafat O, Riehle D (2009) The commit size distribution of open source software. HICSS. pp 1–8
Purushothaman R, Perry DE (2005) Toward understanding the rhetoric of small source code changes. IEEE Trans Software Eng 31(6):511–526
Hindle A, Godfrey MW, Holt RC (2007) Release pattern discovery via partitioning: methodology and case study. ACM Special Interest Group on Software Engineering, Washington, pp 19–27
Alali A, Kagdi H, Maletic J (2008) What’s a typical commit? A characterization of open source software repositories. ICPC. IEEE, Amsterdam, pp 182–191
Scacchi W (2007) Free/open source software development: recent research results and emerging opportunities. ESEC/SIGSOFT FSE (Companion), pp 459–468
Hindle A, Germán DM, Holt RC (2008) What do large commits tell us: a taxonomical study of large commits. MSR. ACM Press, New York, pp 99–108
Romain Robbes (2007) Mining a change-based software repository. MSR. p 15
Ohloh,Inc.Ohloh API. See http://www.ohloh.net/api
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media B.V.
About this paper
Cite this paper
Marzban, M., Khoshmanesh, Z., Sami, A. (2012). Cohesion Between Size of Commit and Type of Commit. In: J. (Jong Hyuk) Park, J., Chao, HC., S. Obaidat, M., Kim, J. (eds) Computer Science and Convergence. Lecture Notes in Electrical Engineering, vol 114. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2792-2_22
Download citation
DOI: https://doi.org/10.1007/978-94-007-2792-2_22
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-2791-5
Online ISBN: 978-94-007-2792-2
eBook Packages: EngineeringEngineering (R0)