Skip to content

Instantly share code, notes, and snippets.

@toniher
Last active February 4, 2018 09:36
Show Gist options
  • Save toniher/a44384bfd2927a78448d2182eebd305e to your computer and use it in GitHub Desktop.
Save toniher/a44384bfd2927a78448d2182eebd305e to your computer and use it in GitHub Desktop.
Simple Python script for extraction es wikipedia extract text
import sys
import requests
import urllib
pagename = str( ' '.join( sys.argv[1:] ) )
wikiurl = "https://es.wikipedia.org/w/api.php?action=query&format=json&titles="+urllib.quote_plus( pagename )+"&prop=extracts&exintro&explaintext&redirects=true"
r = requests.get(wikiurl)
data = json.loads( r.content )
if data :
if 'query' in data :
if 'pages' in data['query'] :
for key in data['query']['pages'] :
page = data['query']['pages'][key]
if 'extract' in page :
extract = page['extract']
print extract.encode('utf-8')
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment