For each of these JSON word-coordinate file formats of the same OCR word-coordinate data, representing a single newspaper page:
- Open-ONI (58.9 kB)
- IIIF Annotation List (826.4 kB)
I ran a test in the Rails console, benchmarking the time needed to:
- load the source file into memory
- parse as JSON
- find nodes matching a search term ("October")