Skip to content

Instantly share code, notes, and snippets.

@brendon-codes
Created March 27, 2012 04:14
Show Gist options
  • Save brendon-codes/2212479 to your computer and use it in GitHub Desktop.
Save brendon-codes/2212479 to your computer and use it in GitHub Desktop.
Daring Fireball URL RegEx Matcher for Python
import re
def url_matcher():
"""
See: http://daringfireball.net/2010/07/improved_regex_for_matching_urls
"""
return re.compile(
ur"""(?i)\b((?:https?://|www\d{0,3}[.]|[a-z0-9.\-]+[.][a-z]{2,4}/)"""
ur"""(?:[^\s()<>]+|\(([^\s()<>]+|(\([^\s()<>]+\)))*\))+"""
ur"""(?:\(([^\s()<>]+|(\([^\s()<>]+\)))*\)|[^\s`!()"""
ur"""\[\]{};:'".,<>?\u00AB\u00BB\u201C\u201D\u2018\u2019]))"""
)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment