Spark SQL, DataFrames, and Datasets

Chellappan, Subhashini; Ganesan, Dharanitharan

doi:10.1007/978-1-4842-3652-9_4

Subhashini Chellappan³ &
Dharanitharan Ganesan⁴

1451 Accesses

Abstract

In the previous chapter on Spark Core, you learned about the RDD transformations and actions as the fundamentals and building blocks of Apache Spark. In this chapter, you will learn about the concepts of Spark SQL, DataFrames, and Datasets. As a heads up, the Spark SQL DataFrames and Datasets APIs are useful to process structured file data without the use of core RDD transformations and actions. This allows programmers and developers to analyze the structured data much faster than they would by applying the transformations on RDDs created.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 44.99; Price excludes VAT (USA)

Softcover Book: USD 59.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Bangalore, India
Subhashini Chellappan
Krishnagiri, Tamil Nadu, India
Dharanitharan Ganesan

Authors

Subhashini Chellappan
View author publications
You can also search for this author in PubMed Google Scholar
Dharanitharan Ganesan
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Chellappan, S., Ganesan, D. (2018). Spark SQL, DataFrames, and Datasets. In: Practical Apache Spark. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3652-9_4

Download citation

DOI: https://doi.org/10.1007/978-1-4842-3652-9_4
Published: 13 December 2018
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-3651-2
Online ISBN: 978-1-4842-3652-9
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)

Publish with us

Policies and ethics