If you are not signed up already for the Zyte Developers Community newsletter, you can sign up here.
In this issue:
- Web scraping for a COVID-19 vaccine
- Scraping Reddit with Scrapy
- Easiest way to extract data from Wikipedia
- Dateparser – Python parser for human-readable dates
- A sip of BeautifulSoup
Web scraping for a COVID-19 vaccine
Miguel Magalhães was having a hard time finding covid vaccine slots in France. So he built a simple Python script that checked the server for the next available slot, until it finds a vacancy.
Scraping Reddit with Scrapy
This one is a bit long but interesting read. Brendan Ferris uses Scrapy to extract multiple fields like username, title, content link, etc. from posts on old.reddit.com.
Easiest way to extract data from Wikipedia
In this article, Elliot discusses the Wikipedia module in Python which allows you to scrape data from Wikipedia in the easiest way possible.
Dateparser – Python parser for human-readable dates
Dateparser was developed to make date extraction from HTML pages easier. Initially, used by web scraping developers, it was quickly adopted by the wider community and has been used for multiple applications like command-line tools, chatbots, etc. This blog post covers everything you need to know about date parsing.
A sip of BeautifulSoup
This article is for those among us who are just starting to get acquainted to web scraping. Runjot Kaur’s introduction to BeautifulSoup is very easy to understand.