Abstract
Building Machine Learning systems and pipelines take significant effort, which is evident from the knowledge you gained in the previous chapters. In the first chapter, we presented some high-level architecture for building Machine Learning pipelines. The path from data to insights and information is not an easy and direct one. It is tough and also iterative in nature involving data scientists and analysts to reiterate through several steps multiple times to get to the perfect model and derive correct insights. The limitation of Machine Learning algorithms is the fact that they can only understand numerical values as inputs. This is because, at the heart of any algorithm, we usually have multiple mathematical equations, constraints, optimizations and computations. Hence it is almost impossible for us to feed raw data into any algorithm and expect results. This is where features and attributes are extremely helpful in building models on top of our data.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsAuthor information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Dipanjan Sarkar, Raghav Bali and Tushar Sharma
About this chapter
Cite this chapter
Sarkar, D., Bali, R., Sharma, T. (2018). Feature Engineering and Selection. In: Practical Machine Learning with Python. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3207-1_4
Download citation
DOI: https://doi.org/10.1007/978-1-4842-3207-1_4
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-3206-4
Online ISBN: 978-1-4842-3207-1
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)