Skip to content

Instantly share code, notes, and snippets.

@atomotic
Created July 11, 2017 14:27
Show Gist options
  • Save atomotic/5d9cbfb020817f7f776682fe591f41ad to your computer and use it in GitHub Desktop.
Save atomotic/5d9cbfb020817f7f776682fe591f41ad to your computer and use it in GitHub Desktop.
➜ tika -m -j http://www.anvur.org/rapporto-2016/files/Area01/VQR2011-2014_Area01_Tabelle.pdf | jq .
ERROR OpenType Layout tables used in font ABCDEE+Cambria,Bold are not implemented in PDFBox and will be ignored
ERROR OpenType Layout tables used in font Times New Roman are not implemented in PDFBox and will be ignored
ERROR OpenType Layout tables used in font Times New Roman,BoldItalic are not implemented in PDFBox and will be ignored
ERROR OpenType Layout tables used in font Times New Roman,Italic are not implemented in PDFBox and will be ignored
ERROR OpenType Layout tables used in font ABCDEE+Calibri,BoldItalic are not implemented in PDFBox and will be ignored
{
"Author": "Andrea Gordiani",
"Content-Length": "9651618",
"Content-Type": "application/pdf",
"Creation-Date": "2017-02-21T13:23:47Z",
"Last-Modified": "2017-02-21T13:23:47Z",
"Last-Save-Date": "2017-02-21T13:23:47Z",
"X-Parsed-By": [
"org.apache.tika.parser.DefaultParser",
"org.apache.tika.parser.pdf.PDFParser"
],
"access_permission:assemble_document": "true",
"access_permission:can_modify": "true",
"access_permission:can_print": "true",
"access_permission:can_print_degraded": "true",
"access_permission:extract_content": "true",
"access_permission:extract_for_accessibility": "true",
"access_permission:fill_in_form": "true",
"access_permission:modify_annotations": "true",
"created": "Tue Feb 21 14:23:47 CET 2017",
"creator": "Andrea Gordiani",
"date": "2017-02-21T13:23:47Z",
"dc:creator": "Andrea Gordiani",
"dc:format": "application/pdf; version=1.5",
"dcterms:created": "2017-02-21T13:23:47Z",
"dcterms:modified": "2017-02-21T13:23:47Z",
"meta:author": "Andrea Gordiani",
"meta:creation-date": "2017-02-21T13:23:47Z",
"meta:save-date": "2017-02-21T13:23:47Z",
"modified": "2017-02-21T13:23:47Z",
"pdf:PDFVersion": "1.5",
"pdf:docinfo:created": "2017-02-21T13:23:47Z",
"pdf:docinfo:creator": "Andrea Gordiani",
"pdf:docinfo:creator_tool": "Microsoft® Office Excel® 2007",
"pdf:docinfo:modified": "2017-02-21T13:23:47Z",
"pdf:docinfo:producer": "Microsoft® Office Excel® 2007",
"pdf:encrypted": "false",
"producer": "Microsoft® Office Excel® 2007",
"resourceName": "VQR2011-2014_Area01_Tabelle.pdf",
"xmp:CreatorTool": "Microsoft® Office Excel® 2007",
"xmpTPg:NPages": "414"
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment