Skip to content

Instantly share code, notes, and snippets.

@kangwonlee
Last active March 6, 2017 01:08
Show Gist options
  • Save kangwonlee/eb1f086628f0555718b6b00a7a02cfc5 to your computer and use it in GitHub Desktop.
Save kangwonlee/eb1f086628f0555718b6b00a7a02cfc5 to your computer and use it in GitHub Desktop.
CV (Curriculum Vitae) conference paper entry parser in regular expression
"""
This is an attempt to automate extracting some fields from an entry of Curriculum Vitae on a conference publication.
For more about regular expression, please refer to https://docs.python.org/library/re.html.
"""
import re
def get_parser_cv_conf_paper():
return re.compile(
r'(?P<paper_number>\(\d+\))[.,]?\s*(?P<authors>.+?)\s*[,:;]?\s*["“](?P<paper_title>.+?)["“”],?\s*(?P<conference_info>.+)')
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment