Skip to content

Instantly share code, notes, and snippets.

@kami4ka
Last active February 19, 2025 09:17
Show Gist options
  • Save kami4ka/26b0e0dfe5a1d66d658629e6d6d84308 to your computer and use it in GitHub Desktop.
Save kami4ka/26b0e0dfe5a1d66d658629e6d6d84308 to your computer and use it in GitHub Desktop.
ScrapingAnt Backend Test Task

ScrapingAnt's Backend Test Task

Task Description

You need to implement a web scraping script that extracts product data from an e-commerce website books.toscrape.com and stores the extracted data in a database.

The script should:

  • Scrape product details (title, price, availability, quantity in stock) from multiple pages of a given website.
  • Use Asyncio to efficiently fetch data from multiple pages in parallel.
  • Store the extracted data in a database (your choice) deployed as a separate Docker container.
  • All other adjustments (like tests, infrastructure setup options, etc.) would be a plus for the additional discussion during the tech interview, but not required.
  • Very good as an additional first look to try ScrapingAnt's API for the same task (still, free plan would allow only 1 concurrent request, so keep it in mind if using this approach)

Deliverables

  • A public git repository containing the project code
  • A short README explaining how to set up and run the project.

Time Estimate

1-2 hours for a working draft without perfection

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment