Skip to content

Instantly share code, notes, and snippets.

@joeltg
Last active April 15, 2019 05:10
Show Gist options
  • Save joeltg/f066945ee780bfee769a26cea753f255 to your computer and use it in GitHub Desktop.
Save joeltg/f066945ee780bfee769a26cea753f255 to your computer and use it in GitHub Desktop.
Namespace prefix mappings for the RDF properties produced by Apache Tika, for use as a JSON-LD context
{
"dc": "http://purl.org/dc/elements/1.1/",
"dcterms": "http://purl.org/dc/terms/",
"Iptc4xmpCore": "http://iptc.org/std/Iptc4xmpCore/1.0/xmlns/",
"Iptc4xmpExt": "http://iptc.org/std/Iptc4xmpExt/2008-02-29/",
"plus": "http://ns.useplus.org/ldf/xmp/1.0/",
"cp": "http://schemas.openxmlformats.org/package/2006/metadata/core-properties/",
"extended-properties": "http://schemas.openxmlformats.org/officeDocument/2006/extended-properties/",
"photoshop": "http://ns.adobe.com/photoshop/1.0/",
"xmp": "http://ns.adobe.com/xap/1.0/",
"xmpidq": "http://ns.adobe.com/xmp/identifier/qual/1.0/",
"xmpMM": "http://ns.adobe.com/xap/1.0/mm/",
"xmpRights": "http://ns.adobe.com/xap/1.0/rights/",
"xmpTPg": "http://ns.adobe.com/xap/1.0/t/pg/",
"xmpG": "http://ns.adobe.com/xap/1.0/g/",
"stDim": "http://ns.adobe.com/xap/1.0/sType/Dimensions#",
"stFnt": "http://ns.adobe.com/xap/1.0/sType/Font#",
"xmpGImg": "http://ns.adobe.com/xap/1.0/g/img/",
"stEvt": "http://ns.adobe.com/xap/1.0/sType/ResourceEvent#",
"stRef": "http://ns.adobe.com/xap/1.0/sType/ResourceRef#",
"stVer": "http://ns.adobe.com/xap/1.0/sType/Version#",
"stJob": "http://ns.adobe.com/xap/1.0/sType/Job#",
"xmpDM": "http://ns.adobe.com/xmp/1.0/DynamicMedia/",
"pdf": "http://ns.adobe.com/pdf/1.3/",
"crs": "http://ns.adobe.com/camera-raw-settings/1.0/",
"exifEX": "http://cipa.jp/exif/1.0/",
"exif": "http://ns.adobe.com/exif/1.0/",
"tiff": "http://ns.adobe.com/tiff/1.0/",
"odf": "http://docs.oasis-open.org/ns/office/1.2/meta/odf#",
"meta": "urn:oasis:names:tc:opendocument:xmlns:meta:1.0#"
}
@joeltg
Copy link
Author

joeltg commented Jan 8, 2019

Most properties are listed at https://tika.apache.org/1.2/api/constant-values.html. However Tika will produce values for properties with namespaces not listed here (e.g. xmpTPg:NPages for the number pages in a PDF); these additional properties are included from the Adobe XMP Specification parts 1 and 2. Section 3.4 of Part 2 indicates that PDFs can also include properties from the Exif and TIFF namespaces, but the links to both the Exif spec and the mappings mentioned are broken. I think that this PDF contains the same information and have included the three namespaces (exifEX, exif, and tiff) that it describes.

@joeltg
Copy link
Author

joeltg commented Jan 8, 2019

Furthermore, in section 1.2.2.3 of Part 2, the URI for stFnt is erroneously listed as http:ns.adobe.com/xap/1.0/sType/Font# (a particularly insidious typo since http:ns.adobe... is a valid URI). I've corrected it to http://ns.adobe.com/xap/1.0/sType/Font#, consistent with every other Adobe namespace.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment