Skip to main content

Selecting Open Source Projects for Traceability Case Studies

  • Conference paper
  • First Online:
Requirements Engineering: Foundation for Software Quality (REFSQ 2019)

Abstract

[Context & Motivation] Once research questions and initial theories have shaped, empirical research typically requires to select cases to study subsumed ideas. Issue trackers of todays open source systems (OSS) are a gold mine for empirical research, not least to study trace links among the included issue artifacts. [Question / problem] The huge amount of available OSS projects complicates the process of finding suitable cases to support the research goals. Further, simply picking a large number of projects on a random basis does not imply generalizability. Therefore the selection process should be carefully designed. [Principle ideas / results] In this paper we propose a method to choose OSS projects to study trace links found in issue tracking systems. Builds upon purposive sampling and cluster analysis, relevant project characteristics are identified whereas irrelevant information is filtered. Every step of the method is demonstrated on a live example. [Contributions] The proposed strategy selects an information-rich, representative and diverse sample of OSS to perform a traceability case study. Our work may be used as practical guide for other researchers to perform project selection tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 74.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.apache.org.

  2. 2.

    https://jira.apache.org/jira/browse/PIG-4059.

  3. 3.

    https://projects.apache.org/projects.html.

References

  1. Apache Annual Report FY2018. https://s.apache.org/FY2018AnnualReport (2018). Accessed 29 Sept 2018

  2. Apache Incubation Process. https://incubator.apache.org/policy/process.html (2018). Accessed 29 Sept 2018

  3. Commenting on an Issue. https://confluence.atlassian.com/jira064/commenting-on-an-issue-720416302.html (2018). Accessed 29 Sept 2018

  4. https://www.atlassian.com/customers (2019). Accessed 03 Jan 2019

  5. Comparison of issue-tracking systems. https://en.wikipedia.org/wiki/Comparison_of_issue-tracking_systems (2019). Accessed 03 Jan 2019

  6. Antoniol, G., Canfora, G., Casazza, G., De Lucia, A., Merlo, E.: Recovering traceability links between code and documentation. IEEE Trans. Softw. Eng. 28(10) 970–983 (2002)

    Google Scholar 

  7. Asuncion, H.U., Asuncion, A.U., Taylor, R.N.: Software traceability with topic modeling. In: Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering, ICSE 2010, Cape Town, South Africa, 1–8 May 2010. vol. 1, ACM (2010)

    Google Scholar 

  8. Curtis, S., Gesler, W., Smith, G., Washburn, S.: Approaches to sampling and case selection in qualitative research: examples in the geography of health. Soc. Sci. Med. 50(7—-8), 1001–1014 (2000)

    Article  Google Scholar 

  9. De Lucia, A., Fasano, F., Oliveto, R., Tortora, G.: 20th IEEE International Conference on Software Maintenance (ICSM) (2004)

    Google Scholar 

  10. Dekhtyar, A., Hayes, J.H., Sundaram, S.K., Holbrook, E.A., Dekhtyar, O.: Technique integration for requirements assessment. In: 15th IEEE International Requirements Engineering Conference, RE 2007, 15–19th October 2007, New Delhi, India. IEEE Computer Society (2007)

    Google Scholar 

  11. Flyvbjerg, B.: Five misunderstandings about case-study research. Qual. Inq. 12(2), 219–245 (2006)

    Article  Google Scholar 

  12. Foucault, M., Palyart, M., Falleri, J., Blanc, X.: Computing contextual metric thresholds, ACM (2014)

    Google Scholar 

  13. Ishak, N.M., Bakar, A.Y.A.: Developing sampling frame for case study: challenges and conditions. World J. Educ. 4(3), 29–35 (2014)

    Google Scholar 

  14. ISO/IEC 14143/1: Information technology, software measurement, functional size measurement, Part 1: definition of concepts. Standard, International Organization for Standardization, Geneva (2007)

    Google Scholar 

  15. Issue management tools - popularity ranking (2017). https://project-management.zone/ranking/category/issue

  16. Jira Issue Tracking System (2018). https://www.atlassian.com/software/jira

  17. Miles, M.B., Huberman, A.M., Huberman, M.A., Huberman, M.: Qualitative Data Analysis: An Expanded Sourceboo. Sage, Thousand Oaks (1994)

    MATH  Google Scholar 

  18. Nagappan, M., Zimmermann, T., Bird, C.: Diversity in software engineering research, ACM (2013)

    Google Scholar 

  19. Palinkas, L.A., Horwitz, S.M., Green, C.A., Wisdom, J.P., Duan, N., Hoagwood, K.: Purposeful sampling for qualitative data collection and analysis in mixed method implementation research. Adm. Policy Ment. Health Ment. Health Serv. Res. 42(5), 533–544 (2015)

    Article  Google Scholar 

  20. Patton, M.Q.: Qualitative Evaluation and Research Methods. Sage Publications, Thousand Oaks (1990)

    Google Scholar 

  21. Rath, M., Rempel, P., Mäder, P.: The IlmSeven dataset. In: 25th IEEE International Requirements Engineering Conference, RE 2017, Lisbon, Portugal, 4–8 September 2017. pp. 516–519. IEEE Computer Society (2017)

    Google Scholar 

  22. Rempel, P., Mäder, P., Kuschke, T.: Towards feature-aware retrieval of refinement traces. In: 7th International Workshop on Traceability in Emerging Forms of Software Engineering, TEFSE 2013, 19 May 2013, San Francisco, CA, USA. IEEE Computer Society (2013)

    Google Scholar 

  23. Runeson, P., Höst, M., Rainer, A., Regnell, B.: Case Study Research in Software Engineering - Guidelines and Examples. Wiley, Hoboken (2012)

    Book  Google Scholar 

  24. Suri, H.: Purposeful sampling in qualitative research synthesis. Qual. Res. J. 11(2), 63–75 (2011)

    Article  Google Scholar 

  25. Teddlie, C., Yu, F.: Mixed methods sampling: a typology with examples. J. Mixed Methods Res. 1(1), 77–100 (2007)

    Google Scholar 

  26. Tomova, M.T., Rath, M., Mäder, P.: Use of trace link types in issue tracking systems. In: Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, ICSE 2018, Gothenburg, Sweden, 27 May–03 June 2018. ACM (2018)

    Google Scholar 

  27. Tryfos, P.: Methods for Business Analysis and Forecasting: Text and Cases. Wiley, Hoboken (1998)

    Google Scholar 

  28. Van Ryzin, G.G.: Cluster analysis as a basis for purposive sampling of projects in case study evaluations. Eval. Pract. 16(2), 109–119 (1995)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michael Rath .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rath, M., Tomova, M.T., Mäder, P. (2019). Selecting Open Source Projects for Traceability Case Studies. In: Knauss, E., Goedicke, M. (eds) Requirements Engineering: Foundation for Software Quality. REFSQ 2019. Lecture Notes in Computer Science(), vol 11412. Springer, Cham. https://doi.org/10.1007/978-3-030-15538-4_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-15538-4_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-15537-7

  • Online ISBN: 978-3-030-15538-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics