Abstract
An integral part of data science is executing efficient computations on large datasets. Such computations are driven by computer programs, and as problems increase in size and complexity, the accompanying software solutions tend to become larger and more intricate, too. Such software is built by organizations structured around teams. Data science is also a team effort, so effective communication and collaboration is extremely important in both software engineering and data science. Software developed by one team must be comprehensible to other teams to foster use, reuse, and evolution. This is where software maintainability, as a pertinent quality attribute, kicks in. This chapter presents important lessons from the realm of software engineering in the context of data science. The aim is to educate data science practitioners how to craft evolvable programs and increase productivity.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
You may also want to read the blog at https://people.neilon.software . This article depicts in a funny way various stereotypes in relation to roles. Managing people is tedious, and social life in an organization is as important as technical expertise.
- 2.
We still don’t understand the underlying algorithm. That is another level of reverse engineering. Apparently, we may control the depth of our quest.
- 3.
For example, the Lucas sequence starts differently, although it has the same recurrent formula as the Fibonacci sequence: L(n) = L(n - 1) + L(n - 2), L(0) = 2, L(1) = 1.
- 4.
Walt Scacchi, “Winning and Losing in Large-Scale Software Development: A Multi-Decade Perspective,” Computer 51, no. 10 (October 2018): 58–65.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2019 Ervin Varga
About this chapter
Cite this chapter
Varga, E. (2019). Software Engineering. In: Practical Data Science with Python 3. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-4859-1_3
Download citation
DOI: https://doi.org/10.1007/978-1-4842-4859-1_3
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-4858-4
Online ISBN: 978-1-4842-4859-1
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)