Structure

A Config File has the following sections

Header

Headers is the main part of the config file. It helps the Parser locate the relevant table in the PDF document. It also give it information on what kind of data sits in each cell and how to identify row breaks.

The headers are arranged in the same order (and across multiple rows if the table has multi layer headers)

Each Header has a ‘Header Start’ and ‘Header End’ in the config file. In between this start and end, there are two main parts i.e.

Elements

Each element in headers will correspond with a cell in the table header of the table we are trying to extract. Each element will have

To clarify the word ‘Element’ does not actually appear in the config file (and the elements start immediately below ‘Header Start’

Example of a header

Example of a header