- Fork this Gist
- Edit your version to share what your team's activity
- There's no third step
Forked from gregelin/1_pdfliberation_hackathon_activity.md
Last active
January 3, 2016 16:39
-
-
Save seamuskraft/8491193 to your computer and use it in GitHub Desktop.
Who is working together?
| Name | Organization | ||
|---|---|---|---|
| Seamus Kraft | [email protected] | @seamuskraft | OpenGov Foundation |
| Ross Tsiomenko | [email protected] | n/a | OpenGov Foundation |
Which challenge are you working on?
- Amnesty International Annual Reports – Torture Incident Database
- Comprehensive Annual Financial Reports
- Federal Communications Commission Daily Releases
- House of Representatives Financial Disclosures (OpenSecrets.org)
- IRS Form 990 – Not-for-Profit Organization Reports
- New York City Council and Community Board Documents
- New York City Economic Development Commission Monthly Snapshot
- New York City Environmental Impact Statements
- US Foreign Aid Reports (USAID)
- Other: Documenting What Everyone Else is Working On
How would you categorize the PDFs?
| PDF URL | Document Title |
|---|---|
| http://www.domain.org/docs/docurl.pdf | Report of Economic Data 2012 |
- Disclosure (filing, forms, report, ...)
- Legislative doc (laws, analysis, ...)
- Financial (statements, reports)
- Government statistical data
- Non-Government statistical data
- Press (press releases, statements, ...)
- Government reports
- Non-Government reports
- Directory
- Other:
- 1 page
- 2 to 9 pages
- 10+ pages
- 100+ pages
- Collection includes PDFs made from scanned documents
- PDFs include hand-written text
- Human authored
- Machine generated
- God only knows
- Simple table of data
- Complex table of data
- Multiple tables of data from document
- Table data greater than one page in length
- Highly structured form data
- Loosely structured form data
- Has human-written text
- Structure text of a report (e.g., headings, subheadings, ...)
- Other:
- CSV
- JSON
- text version (e.g., markdown)
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment