Skip to content

Instantly share code, notes, and snippets.

@BohuTANG
Created June 11, 2012 06:34
Show Gist options
  • Save BohuTANG/2908745 to your computer and use it in GitHub Desktop.
Save BohuTANG/2908745 to your computer and use it in GitHub Desktop.
arxiv.org fetcher
import re
import urllib
url = "http://arxiv.org/list/cs.DB/recent"
s=urllib.urlopen(url).read()
ss = s.replace("\n","")
titles=re.findall('<span class="descriptor">Title:</span>(.+?)</div>', ss, re.I)
i = 0
for t in titles:
print '(%d) %s' %(i, t)
i = i + 1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment