Skip to content

Instantly share code, notes, and snippets.

View ross-spencer's full-sized avatar
💭
🖖

Ross Spencer ross-spencer

💭
🖖
View GitHub Profile
@ross-spencer
ross-spencer / LDP-Queries
Created April 27, 2015 20:16
Building queries on Linked Data PRONOM
The-fr.org (based on Linked Data PRONOM):
Types in PRONOM:
http://the-fr.org/public/sparql/endpoint.php?query=SELECT+%3Fo+WHERE+%7B%0D%0A++%3Fs+%3Chttp%3A%2F%2Fthe-fr.org%2Fprop%2Fformat-registry%2FformatType%3E+%3Fo.%0D%0A%7D%0D%0ALIMIT+100&output=&jsonp=&key=&show_inline=1
Example of Word Processed Type:
http://the-fr.org/public/sparql/endpoint.php?query=SELECT+%3Fs+WHERE+%7B%0D%0A++%3Fs+%3Chttp%3A%2F%2Fthe-fr.org%2Fprop%2Fformat-registry%2FformatType%3E+%3Chttp%3A%2F%2Fthe-fr.org%2Fdef%2Fformat-registry%2FWordprocessedText%3E+.%0D%0A%7D%0D%0ALIMIT+100&output=&jsonp=&key=&show_inline=1
# emoji-data.txt
# Date: 2016-06-02, 09:26:10 GMT
# © 2016 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use, see http://www.unicode.org/terms_of_use.html
#
# Emoji Data for UTR #51
# Version: 3.0
#
# For documentation and usage, see http://www.unicode.org/reports/tr51
@ross-spencer
ross-spencer / Mozilla-Last-Modified-Thread.txt
Last active October 2, 2015 01:07
Maintaining last-modified dates on delivery of Archival materials from a repository
https://bugzilla.mozilla.org/show_bug.cgi?id=178506
Bug 178506 - save original last-modified date on file downloads
Resolved: WontFix
Reported: 2002-11-05 12:09 PST
--
@ross-spencer
ross-spencer / DocumentSummaryInformation
Last active October 15, 2015 22:33
corrupt-msoleps-stream
FE FF 00 00 01 00 02 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 01 00 00 00 05 D5 CD D5 9C 2E 1B 10 93 97 08 00 2B 2C F9 AE 30 00 00 00 10 00 00 00 01 00 00 00 01 00 00 00 10 00 00 00 02 00 00 00 E9 FD 00 00
Using Fuzzy Hashing to Create New Relationships Between Digital Archives
Fuzzy hases enable us to spot digital content that shares a high percentage of similar byte sequences.
There may be similarities between two word documents for example, but the items may not be the same, i.e.
a single byte-change will render comparision using cryptographic-hash useless.
A fuzzy hash offers the ability to spot similar items, n.b. not identical, thus enabling us to infer
relationshops between material - a capability we have very few options for at present.
Committing a fuzzy hash of an object to a digital repository can enable comparison at any point in time
@ross-spencer
ross-spencer / custom-props-access.go
Created October 20, 2015 02:47
Access OLE2 properties using RL's Golang code
//Code from Richard Lehane @richardlehane
package main
import (
"fmt"
"os"
"github.com/richardlehane/msoleps"
)
@ross-spencer
ross-spencer / PKZIP APPNOTE 6.3.4.
Created November 1, 2015 00:50
Unmodified copy of PKZIP Speciffication, source date: 1 November 2015: https://pkware.cachefly.net/webdocs/casestudies/APPNOTE.TXT
File: APPNOTE.TXT - .ZIP File Format Specification
Version: 6.3.4
Status: Final - replaces version 6.3.3
Revised: October 1, 2014
Copyright (c) 1989 - 2014 PKWARE Inc., All Rights Reserved.
1.0 Introduction
---------------
1.1 Purpose
@ross-spencer
ross-spencer / SSDEEP Spamsum Comparison - Overview
Last active November 10, 2015 22:20
SSDEEP and Spamsum algorithm comparison
Concept: http://dfrws.org/2006/proceedings/12-Kornblum.pdf
Comparison of two tools. The goal, utilize SPAMSUM in Golang to achieve SSDEEP like capability.
SSDEEP: http://ssdeep.sourceforge.net/
Spamsum: https://github.com/michielbuddingh/spamsum
Test Corpus: https://github.com/ross-spencer/opf-format-corpus
@ross-spencer
ross-spencer / file-generator.go
Created November 12, 2015 21:00
Richard's example to make random files Golang
package main
import "io/ioutil"
func main() {
bigBuff := make([]byte, 750000000)
ioutil.WriteFile("bigfile.test", bigBuff, 0666)
}
@ross-spencer
ross-spencer / rosetta-json.json
Last active November 25, 2015 07:37
Prototype JSON layout for Rosetta SIP ingest
{
"title": "Rosetta IE Mets JSON Example",
"author": "ross-spencer",
"description": "Minimum feature JSON for server/UI Mets Ingest",
"validator": "http://jsonlint.com/",
"standard": "http://ex-libris.com/spencer/ie-mets-json-schema/",
"rosetta.sip.item": [
{
"dc.fields": [
{