Last active
December 10, 2015 07:28
-
-
Save keroxil/4400976 to your computer and use it in GitHub Desktop.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| from scrapy.spider import BaseSpider | |
| class DmozSpider(BaseSpider): | |
| name = "dmoz" | |
| start_urls = [ | |
| "http://www.dmoz.org/Computers/Programming/Languages/Python/Books/", | |
| "http://www.dmoz.org/Computers/Programming/Languages/Python/Resources/" | |
| ] | |
| def parse(self, response): | |
| # Do something useful here with the response, | |
| # e.g. extract structured data from the page | |
| pass |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment