Cleaning & Post-Processing Scrapy Data - Python Scrapy Beginners Series (Part 2)

Описание к видео Cleaning & Post-Processing Scrapy Data - Python Scrapy Beginners Series (Part 2)

Here we look at how to correct our scraped data when we run into issues such as:
- duplicate data
- missing data

We also look at how to use Scrapy item loaders and Scrapy Pipelines to process our data before we save it.

Useful links:
The github repo where you can download the code:
https://github.com/ScrapeOps/python-s...

The article which corresponds to this video:
https://thepythonscrapyplaybook.com/s...


00:00 - Intro
01:17 - Using Scrapy Items to better control our data
05:00 - Using Scrapy Item Loaders to remove a symbol from our scraped prices
14:15 - Using Scrapy Item Pipelines, converting pounds to dollars and removing duplicates
24:35 - What's coming in part 3

Комментарии

Информация по комментариям в разработке