Skip to content

Instantly share code, notes, and snippets.

View epoz's full-sized avatar

Etienne Posthumus epoz

View GitHub Profile
@epoz
epoz / STCN raw parser
Last active December 27, 2015 19:39
Read a STCN http://picarta.pica.nl/xslt/DB=3.11/ raw data dump, parse it, and spit it out as a columnar tab-separated-value file that can more easily be opened in Excel
'''
Read in a STCN data dump file, and convert it to a CSV file (delimited with tabs)
The data looks something like this:
SET: S0 [10000] TTL: 5 PPN: 339722142 PAG: 1 .
Ingevoerd: 1996:31-01-12 Gewijzigd: 1996:07-02-12 09:12:25 Status: 1996:31-01-12
0500 Aav
@epoz
epoz / gimmesrc.py
Created October 1, 2012 19:02
Retrieves the full source of a title from Wikisource
#!/usr/bin/env python
# Example: python gimmesrc.py De_Cive > txt
import sys, urllib, urllib2
URL = 'http://en.wikisource.org/w/index.php?action=raw&title='
if __name__ == '__main__':
title = sys.argv[1]
title_parts = []
@epoz
epoz / gist:3760964
Created September 21, 2012 11:26
Markdown Watcher and auto regenerater
#!/usr/bin/env python
'''
Markdown Watcher and auto regenerater
While sitting in an aeroplane, I found myself editing a bunch of Markdown
files and needing to regenerate the HTML and preview in a browser.
It was tedious re-typing the 'markdown' command every time, so I made
this little script to watch the *.markdown files and create the corresponding
.html flavour if the modification date of the markdown file is newer or the
html does not exist yet.
@epoz
epoz / convert.py
Created August 1, 2012 08:29
Converting BNE Bibliography ntriples to BibJSON
import sys
import json
import ntriples
from datetime import datetime
import httplib
ES_URL = "localhost:9200"
ES_PATH = "/bibserver/"
field_mapping = {
@epoz
epoz / Default (OSX).sublime-keymap
Created July 19, 2012 12:48
Sublime Text 2 command to get latest Iconclass Clipboard
[
{ "keys": ["ctrl+i"], "command": "icclipboard" }
]
@epoz
epoz / ntriple2bibjson.py
Created June 14, 2012 08:23
Converting DNB Bibliography ntriples to BibJSON
import sys
import json
import ntriples
import time
from datetime import datetime
import httplib
ES_URL = "localhost:9200"
ES_PATH = "/bibserver/"
@epoz
epoz / BL_RDF_2json.py
Created January 18, 2012 16:56
Streaming RDF/XML to JSON converter for the BL catalog data using Python and iterparse
This gist has been replaced by: https://gist.github.com/1731588