Skip to main content

Delving Deeper in HTTP

  • Chapter
  • First Online:
Practical Web Scraping for Data Science

Abstract

We’ve already seen most of the core building blocks that make up the modern web: HTTP, HTML, and CSS. However, we’re not completely finished with HTTP yet. So far, we’ve only been using one of HTTP’s request “verbs” or “methods”: “GET”. This chapter will introduce you to the other methods HTTP provides, starting with the “POST” method that is commonly used to submit web forms. Next, this chapter explores HTTP request and reply headers in more depth, and shows how you can deal with cookies using the requests library. The chapter closes with a discussion on other common forms of content other than HTML formatted pages you will frequently encounter on the web, and how to deal with them in your web scraping projects.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 16.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Seppe vanden Broucke and Bart Baesens

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

vanden Broucke, S., Baesens, B. (2018). Delving Deeper in HTTP. In: Practical Web Scraping for Data Science. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3582-9_4

Download citation

Publish with us

Policies and ethics