Abstract
We already introduced the data.table package (Dowle, Srinivasan, Short, and Lianoglou, 2015). The data.table package is the heart of this chapter, covering the basics of accessing, editing, and manipulating data under the broad term data management. Although not glamorous, data management is a critical first step to data visualization or analysis. Furthermore, the majority of time on a particular analysis project often comes from data management. For example, running a linear model in R takes one line of code, once the data is clean and in the expected format. Data management is challenging because raw data comes in all types, shapes, and formats, and missing data is common. In addition, you may also have to combine or merge separate data sources. In this chapter, we go beyond the basic use of data.table to more-complex data management tasks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2016 Matt Wiley and Joshua F. Wiley
About this chapter
Cite this chapter
Wiley, M., Wiley, J.F. (2016). Data Munging with data.table. In: Advanced R. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-2077-1_8
Download citation
DOI: https://doi.org/10.1007/978-1-4842-2077-1_8
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-2076-4
Online ISBN: 978-1-4842-2077-1
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)