Skip to content

Instantly share code, notes, and snippets.

@mGalarnyk
Created September 26, 2017 03:48
Show Gist options
  • Save mGalarnyk/8014160aa48f4bdc44cfe14bda0faad7 to your computer and use it in GitHub Desktop.
Save mGalarnyk/8014160aa48f4bdc44cfe14bda0faad7 to your computer and use it in GitHub Desktop.
How many additional start_urls we want for the blog post: and youtube tutorial: https://youtu.be/O_j3OTXw2_E
for href in response.xpath("//h2[contains(@class, 'title headline-font')]/a[contains(@class, 'campaign-link')]//@href"):
# add the scheme, eg http://
url = "https:" + href.extract()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment