There are a number of steps to follow to make a new pull-request against the Archivematica project. This walk-through should help guide folk through that.
Requirements
- A Linux-based operating system.
- A GitHub account.
- Git installed.
[{ | |
"file": "objects/transcription/alto-text-0001.xml", | |
"identifiers": [{ | |
"identifier": "file:///ARCH00152.dig354/transcription transcript/ARCH00152_355_0000.xml", | |
"identiferType": "URL" | |
}, | |
{ | |
"identifier": "http://hdl.handle.net/10622/3BF316F5-E00B-4148-B1ED-43EA61EFA263", | |
"identiferType": "HANDLE" | |
} |
{ | |
"authority": "10.5072/FK2", | |
"id": 1589, | |
"identifier": "QAWS8O", | |
"latestVersion": { | |
"UNF": "UNF:6:doAry72PFwD1Edcrhsj/Qw==", | |
"createTime": "2019-01-16T19:15:52Z", | |
"files": [ | |
{ | |
"dataFile": { |
Artefactual has its own Git repository server on Gitolite. The implications of this is that the work a user completes on GitHub will be overwritten by any mirroring process onto the GitHub servers.
In short, branches, pull-requests, etc. subsequently merged via GitHub will never make it onto the Artefactual servers. They push, but never pull.
git remote set-url origin [email protected]:archivematica.git
git checkout -b dev/issue-1-my-new-branch
select MicroServiceChainLinks.pk, MicroServiceChainLinks.microserviceGroup,
TasksConfigs.description, StandardTasksConfigs.execute,
StandardTasksConfigs.arguments
from MicroServiceChainLinks
inner join TasksConfigs
on TasksConfigs.pk = MicroServiceChainLinks.currentTask
inner join StandardTasksConfigs
#!/bin/bash | |
# Script to run the various coreutils checksum utilities suite. Tools include | |
# sha1sum, sha256sum etc. Easily extensible to the other algorithms in the | |
# same suite, for example, b2sum for Blake2 hash comparison. | |
# | |
# The script can be run standalone outside of Archivematica using a transfer | |
# style layout e.g. | |
# | |
# transfer/ |
#!/usr/bin/env python | |
# -*- coding: utf-8 -*- | |
"""Script to connect to an Elasticsearch instance to basically, perform a | |
search for ANY phrase across the index. | |
Bzsed on: https://gist.github.com/ross-spencer/895b5a346729075dd98f76cd5314728c | |
""" | |
from __future__ import print_function |
#!/usr/bin/env python | |
# -*- coding: utf-8 -*- | |
"""Script to generate a sample set of files with a random distribution of | |
dates. Right now, this is very likely to be a uniform distribution so numpy | |
needs to be explored some more. | |
""" | |
import argparse | |
import atexit | |
import datetime | |
import logging |
#!/usr/bin/env python | |
# -*- coding: utf-8 -*- | |
from __future__ import print_function | |
import sys | |
import xml.etree.ElementTree as et | |
try: | |
tree = et.parse(sys.argv[1]) | |
except IndexError: |
Finding PST (Email Archives in the UKWA) using moonshine
ross-spencer@artefactual:~/Desktop/Artefactual/moonshine$ ./moonshine-linux64 -ffb 2142444E -stat
2018/08/14 16:17:27 Searching Shine
2018/08/14 16:17:27 Created URL: https://www.webarchive.org.uk/shine/search?page=1&query=content_ffb:"2142444e"&sort=crawl_date&order=asc
2018/08/14 16:17:27 Pinging URL: https://www.webarchive.org.uk/shine/search?page=1&query=content_ffb:"2142444e"&sort=crawl_date&order=asc
2018/08/14 16:17:29 121 files discovered
2018/08/14 16:17:29 13 pages available