Table 'archivseiten' contains the information regarding the text recognition of a page, i.e. we find the extracted text as well as the OCR definition belonging to the page. Note that the archiving process leaves table 'archivseiten' untouched.
The following fields are relevant for us:
- Seite: reference to document and page of table 'archive' (document*1000+page)
- Ausschliessen: do not treat page with OCR
- Erfasst: page already treated with OCR
- Text: memo field for various information
- Indexiert: shows whether an index has been done for this document (obsolete)
- OCR: definition (0-x) making the desired language strings available
- ScreenQuality: reduction factor for screen copy: 0-50
2024-02-28 (c) by Archivista GmbH, CH-8118 Pfaffhausen