Skip to content

Instantly share code, notes, and snippets.

View ross-spencer's full-sized avatar
💭
🖖

Ross Spencer ross-spencer

💭
🖖
View GitHub Profile
@ross-spencer
ross-spencer / dataverse.feature
Last active April 25, 2018 09:30
Dataverse Feature in Archivematica
Feature: Create an Archival Information Package (AIP) from a dataset from a
Dataverse repository
Alma is a Dataverse and Archivematica user
And they want to preserve their datasets outside of Dataverse
On their own choice of digital file storage
So that their data can persist independently of any repository system
Background:
Given the Storage Service is configured to connect to a Dataverse
@ross-spencer
ross-spencer / sitemap.json
Last active April 23, 2018 09:18
Archivematica Sitemap
{
"archivematicaSiteMap": {
"title": "Archivematica Sitemap Reference",
"description": "An auto-generated visual index for the Archivematica digital preservation system.",
"version": "1.7",
"baseURL": "http://127.0.0.1:62080",
"transferUUID": "403fdf3d-3b57-47c7-bda8-97fd03539cd2",
"aipUUID": "dd63166f-fb11-465e-857e-2f1347940e18",
"microserviceExamples": [
{"{ms_0}": "afe2c331-8fdb-4e34-8e85-7060688be822"},
@ross-spencer
ross-spencer / exe-analysis.md
Last active April 17, 2018 22:51
Fido Analysis

Exes in PRONOM

Signature in Sig File

    <FileFormat ID="1704"
        MIMEType="application/vnd.microsoft.portable-executable"
        Name="Windows Portable Executable" PUID="fmt/899" Version="32 bit">
        <InternalSignatureID>1249</InternalSignatureID>
        <Extension>dll</Extension>
@ross-spencer
ross-spencer / result.txt
Created April 17, 2018 11:58
Python Regex Bug?
None
<_sre.SRE_Match object; span=(0, 519), match='\x00\x00\x00\x00\x00\x00\x00\x01\x00\x00\x00\x00\>
None
@ross-spencer
ross-spencer / cronfile
Created April 16, 2018 08:51
Crontab example for Reingest in Archivematica
*/5 * * * * /home/ross-spencer/git/artefactual/automation-tools/run-reingest.sh 1> /dev/null 2> /home/ross-spencer/git/artefactual/automation-tools/cron.err
@ross-spencer
ross-spencer / metswriter.py
Created April 5, 2018 12:02
METS Reader/Writer Demo
#!/usr/bin/env python
# -*- coding: utf-8 -*-
import argparse
from datetime import datetime
import lxml
import logging
import metsrw
from metsrw.plugins import premisrw
import os
@ross-spencer
ross-spencer / info.txt
Created March 25, 2018 00:19
Difficulties updating FPR admin in Docker Install
ross-spencer@artefactual:~/git/artefactual-labs/am/src/archivematica:$ git branch
dev/am-labs-issue-38-rename-defaults
dev/fix-mcpclient-tests
dev/give-up-on-fpr-server
dev/issue-882-install-mediaconch-docker
dev/issue-898-clamav-fails-with-diacritics
dev/issue-966-mediaconch-verbosity-issue
* qa/1.x
stable/1.7.x
ross-spencer@artefactual:~/git/artefactual-labs/am/src/archivematica:$ git log

recover_aip should be described in the docs and the wiki API page. It works by providing it the path to an aip and the uuid of an aip and it replaces whatever was in storage with the content it finds at the path you provided.

The use case is to store copies of AIP's outside of the storage service (i.e. in Arkivum was the original use case) If the aip inside the SS fails a fixity check You can go get your backup off tape and user recover_aip to replace it.

@ross-spencer
ross-spencer / readme.md
Last active February 23, 2018 15:16
Create a complex directory layout

Creating a complex directory layout for QA

Unfortunatley, I haven't managed to complete the changes to the script below to submit it to code review. That being said I have set it up in such a way that it will create a structure suitable for testing. Using the default instructions below we can create a tree of:

  • 225 directories, 1001 files
  1. To QA issue: https://github.com/artefactual/archivematica/issues/821#issue-278984149
  2. Standup a vm with the latest AM 1.7 branch
@ross-spencer
ross-spencer / am-clamscan-clamd-compare.csv
Last active January 23, 2018 21:14
AM Clam Clamd Compare 36 files ~89MB total (2.5MB average file size)
tool exec avg_script_execution_time max_script_execution_time min_script_execution_time total_script_executions total_script_exec_time max_script_queue_wait
clamd archivematicaClamscan_v0.0 00:00:01.865767 00:00:03.844791 00:00:01.449927 55 00:01:42.617198 00:00:14.849126
clamscan archivematicaClamscan_v0.0 00:01:17.157061 00:03:06.288086 00:00:36.125767 39 00:50:09.125396 00:08:54.776462