Abstract
We’ve already seen most of the core building blocks that make up the modern web: HTTP, HTML, and CSS. However, we’re not completely finished with HTTP yet. So far, we’ve only been using one of HTTP’s request “verbs” or “methods”: “GET”. This chapter will introduce you to the other methods HTTP provides, starting with the “POST” method that is commonly used to submit web forms. Next, this chapter explores HTTP request and reply headers in more depth, and shows how you can deal with cookies using the requests library. The chapter closes with a discussion on other common forms of content other than HTML formatted pages you will frequently encounter on the web, and how to deal with them in your web scraping projects.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Seppe vanden Broucke and Bart Baesens
About this chapter
Cite this chapter
vanden Broucke, S., Baesens, B. (2018). Delving Deeper in HTTP. In: Practical Web Scraping for Data Science. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3582-9_4
Download citation
DOI: https://doi.org/10.1007/978-1-4842-3582-9_4
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-3581-2
Online ISBN: 978-1-4842-3582-9
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)