Abstract
The amount of data that’s generated increases every day. Technology advances have facilitated the storage of huge amounts of data. This data deluge has forced users to adopt to the distributed system. Distributed systems look for distributed programming, which require extra care for fault tolerance and efficient algorithms. Distributed systems always look for two things—reliability on the system and availability of all the components.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2019 Raju Kumar Mishra and Sundar Rajan Raman
About this chapter
Cite this chapter
Mishra, R.K., Raman, S.R. (2019). Introduction to PySpark SQL. In: PySpark SQL Recipes. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-4335-0_1
Download citation
DOI: https://doi.org/10.1007/978-1-4842-4335-0_1
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-4334-3
Online ISBN: 978-1-4842-4335-0
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)