This is the corpus repository for https://archiscribe.jbaiter.de.
The goal is to have as much diverse OCR ground truth for 19th Century German prints as possible.
Currently the corpus contains 123 from 3 published across 3 years. Detailed statistics are available below.
Decade | # lines |
---|---|
1860 | 48 |
1880 | 50 |
1890 | 25 |
Total | 123 |
Year | # lines |
---|---|
1868 | 48 |
1881 | 50 |
1894 | 25 |
Total | 123 |
Title | Date | Archive.org | IIIF |
---|---|---|---|
Natur und Gemüth Ein Feld und Waldblüthenstrauß aus Tagen die nicht mehr sind, Gewunden von Friedrich Aulenbach | 1868 | bub_gb_HF46AAAAcAAJ | Manifest / Mirador |
Geschichte der Deutschen bis zur höchsten Machtentfaltung des Römisch ... | 1881 | geschichtederde00bessgoog | Manifest / Mirador |
Die forstlichen Verhaltnisse Preussens | 1894 | dieforstlichenv02hagegoog | Manifest / Mirador |