Skip to content

Instantly share code, notes, and snippets.

View ross-spencer's full-sized avatar
💭
🖖

Ross Spencer ross-spencer

💭
🖖
View GitHub Profile
@ross-spencer
ross-spencer / Multiple-match-siegfried.csv
Last active March 15, 2016 04:32
OLE2 file crossed with DXF to *simulate* multiple-identification of a digital object in the DROID format identification tool.
filename filesize modified errors md5 namespace id format version mime basis warning namespace id format mime basis warning namespace id format mime basis warning
c:\working\xslt\tirara-delivery-xsl.xslt 9955 2016-01-11T10:36:27+13:00 0cb60c860093ed9bb09340f3bcf2abbd pronom fmt/96 Hypertext Markup Language text/html byte match at [[[473 5]] [[9910 7]]] (signature 1/2) extension mismatch tika application/xml application/xml byte match at 0, 5 (signature 1/5) filename mismatch freedesktop.org application/xslt+xml XSLT stylesheet application/xslt+xml extension match xslt; byte match at 0, 5 (signature 2/3); byte match at 39, 15 (signature 1/3)
c:\working\xslt\tirara-delivery-xsl.xslt 9955 2016-01-11T10:36:27+13:00 0cb60c860093ed9bb09340f3bcf2abbd pronom fmt/101 Extensible Markup Language 1.0 application/xml, text/xml byte match at 0, 19 extension mismatch tika text/html text/html byte match at 473, 5 (signature 7/16) filename mismatch
@ross-spencer
ross-spencer / ooxml-skeleton-file-working.nul
Created February 21, 2014 22:27
Example byte sequence for a working OOXML skeleton-test-suite file. Based on PRONOM sig: http://www.nationalarchives.gov.uk/pronom/fmt/189, signature file v72 and before.
50 4B 03 04 00 00 00 00 50 4B 01 02 00 00 00 00 50 4B 05 06 00 00 00 00 00 00 00 00 00 00 5B 43 6F 6E 74 65 6E 74 5F 54 79 70 65 73 5D 2E 78 6D 6C 20 A2
@ross-spencer
ross-spencer / msoft-invalid-filename-regex
Created May 28, 2014 04:27
Draft regular expressions for detecting invalid Microsoft file names [Python]
#Based on: http://msdn.microsoft.com/en-us/library/aa365247(VS.85).aspx
#regex strings...
#CON|PRN|AUX|NUL
msdn_bad_names_one = "(^)(con|prn|aux|nul)($|.|.[0-9a-zA-Z]{1,5}$)" # badname + extension
#COM1-COM9 | LPT1-LPT9
msdn_bad_names_two = "((^)(COM|LPT)(.*[1-9]))($|(.[0-9a-zA-Z]{0,5}))" # badname + extension
@ross-spencer
ross-spencer / wave-validation.txt
Created August 22, 2014 03:12
2GB PCM WAVE Validation in JHOVE2: Windows XP Environment
15:03:25,117 [main] WARN TypeConverterDelegate : PropertyEditor [com.sun.beans.editors.EnumEditor] found through deprecated global PropertyEditorManager fallback - consider using a more isolated form of registration, e.g. on the BeanWrapper/BeanFactory!
First line repeated x 1394 times.
FileSource:
StartingOffset (byte): 0
EndingOffset (byte): 2225135359
Size (byte): 2225135360
FileSystemProperties:
Path: C:\Working\jhove2-2.1.0\rsPM_R22847462_A.wav
LastModified: 2014-08-22T14:49:46+12:00
@ross-spencer
ross-spencer / gist:a289b486851ef7fc6f51
Created August 22, 2014 04:08
FMT/648 Signature File v77. EOF Sequence difficulty for Skeleton Test Suite
<?xml version="1.0" encoding="utf-8"?>
<PRONOM-Report xmlns="http://pronom.nationalarchives.gov.uk">
<report_format_detail>
<FileFormat>
<FormatID>1447</FormatID>
<FormatName>Media View Pro</FormatName>
<FormatVersion>
</FormatVersion>
<FormatAliases>
</FormatAliases>
@ross-spencer
ross-spencer / example-jhove-jpg-output-not-well-formed
Created November 10, 2014 04:57
Example JHOVE output for a JPG file with no application marker segments
JhoveView (Rel. 1.11, 2013-09-29)
Date: 2014-11-10 17:51:43 NZDT
RepresentationInformation: C:\Documents and Settings\spencero\Desktop\testjpg\simple.JPG
ReportingModule: JPEG-hul, Rel. 1.2 (2007-02-13)
LastModified: 2014-11-10 17:05:30 NZDT
Size: 613
Format: JPEG
Status: Well-Formed, but not valid
ErrorMessage: File does not begin with SPIFF, Exif or JFIF segment
Offset: 73
@ross-spencer
ross-spencer / smart-phone-emoticons
Created November 27, 2014 11:20
Text based names for smartphone (iphone) emoticons generated via https://github.com/cooperhewitt/go-ucd
😄 SMILING FACE WITH OPEN MOUTH AND SMILING EYES
😃 SMILING FACE WITH OPEN MOUTH
😀 GRINNING FACE
😊 SMILING FACE WITH SMILING EYES
☺ WHITE SMILING FACE
😉 WINKING FACE
😍 SMILING FACE WITH HEART-SHAPED EYES
😘 FACE THROWING A KISS
😚 KISSING FACE WITH CLOSED EYES
😗 KISSING FACE
@ross-spencer
ross-spencer / ICA's Access Principles
Created December 11, 2014 00:43
ICA's Access Principles, from #aranzasa conference, Christchurch 2014
1. The public has the right of access to archives of public bodies. Both public and private entities should open their archives to the greatest extent possible.
2. Institutions holding archives make known the existence of the archives, including the existence of closed materials, and disclose the existence of restrictions that affect access to the archives.
3. Institutions holding archives adopt a pro-active approach to access.
4. Institutions holding archives ensure that restrictions on access are clear and of stated duration, are based on pertinent legislation, acknowledge the right of privacy and respect the rights of owners of private materials.
5. Archives are made available on equal and fair terms.
@ross-spencer
ross-spencer / question-everything-ibn-al-haytham
Last active August 29, 2015 14:13
Ibn al-Haytham (Alhazen) Science. Question Everything... via @cosmosontv
Ibn Al-Haytham was the first person ever to set down the rules of science.
He created an error-correcting mechanism. A systematic and relentless way
to sift out misconceptions in our way of thinking.
‘Finding truth is difficult. And the road to it is rough. As seekers of the
truth you will be wise to withhold judgement and not simply put your trust in
the writings of the ancients. You must question and critically examine those
writings from every side. You must admit only to argument and experiment, and
not to the sayings of any person. For every human being is vulnerable to all
kinds of imperfection. As seekers of the truth we must also suspect and question
@ross-spencer
ross-spencer / rs-useful-apps
Created January 22, 2015 22:14
Useful applications for Digital Preservation Analysis
Tortoise Diff 1.6.7: http://sourceforge.net/projects/tortoisesvn/files/Tools/1.6.7/TortoiseDiff-1.6.7.zip/download
GNU Nano: http://www.nano-editor.org/