You need to implement a web scraping script that extracts product data from an e-commerce website books.toscrape.com and stores the extracted data in a database.
- Scrape product details (title, price, availability, quantity in stock) from multiple pages of a given website.
- Use Asyncio to efficiently fetch data from multiple pages in parallel.
- Store the extracted data in a database (your choice) deployed as a separate Docker container.
- All other adjustments (like tests, infrastructure setup options, etc.) would be a plus for the additional discussion during the tech interview, but not required.
- Very good as an additional first look to try ScrapingAnt's API for the same task (still, free plan would allow only 1 concurrent request, so keep it in mind if using this approach)
- A public git repository containing the project code
- A short README explaining how to set up and run the project.
1-2 hours for a working draft without perfection