Tool Support for the Automatic Extraction of Table Data from Historical Journals

Swinemünder Bade-Anzeiger (Source: www.digitale-bibliothek-mv.de)

Qualitative and quantitative data analyses require a structured database in all specialist disciplines. Textual data, such as that found in newspapers, is often provided with additional tabular data in order to communicate information in a structured manner. At first glance, such tables appear structured, but in most cases they are to be regarded as semi-structured or unstructured, as it is often not possible to access individual elements of the data set in a targeted manner.

The aim of this project is to investigate the extent to which existing solutions for table extraction can be applied to historical journals. The aim is to develop a toolchain that allows tables with personal data to be extracted and processed in a reproducible manner using the historical journal "Swinemünder Badeanzeiger" in order to be used as a database for subsequent analysis.


back to the overview