PDFDataNet Configurator
Section 1
The tabs are used to move between the various sections of the program.
NOTE: To enlarge a screenshot photo – please click on the image.
1.1 Document Class
A document class describes the type of documents you wish to process. In this section you configure the data sets you want to collect for each doument class. Typical document classes are Sales Orders, Supplier Invoices.
1.2 Templates
In the templates tab you create and templates using a sample PDF.
1.3 Data Zones
In the Data Zones section you configure how the data is going to be extracted from a Template.
1.4 Data Viewer
Data Viewer allows you to easily see the extracted data, load other examples of a PDF and generate test XML or CSV output files.
1.5 Transformations
The Transformations section allows you to maintain lookup tables for, for example, converting client product codes on a sales order into your own product codes.
1.5 Data Exporter
In data exporter you configure the output XML or CSV file.
CLICK HERE to find out more about the Data Exporter section.
Section 2
Intro text about this section to go here.
NOTE: To enlarge a screenshot photo – please click on the image.
2.1 Class Name
The class name describes the type of document you are processing.
2.2 Description
Enter a description of the class
2.3 PDF Output Path
Having extracted the data from the PDF this settings defines the location to move the PDF. Note that the path can include extracted variable names.
Section 3
Tables are used to extract the line details from the PDF.
NOTE: To enlarge a screenshot photo – please click on the image.
3.1 Table Name
Enter a meaningful table name to describe the line details you are extracting. For example, Products, TAX Analysis.
Section 4
In this section we define the data set of the data items we want to extract from each PDF.
NOTE: To enlarge a screenshot photo – please click on the image.
4.1 Field Name
Enter a meaningful name for the field (data item)
4.2 Prompt
This can be a shortened version of the field name that will be used in the user interface screens.
4.3 Variable Name
The variable name will be used by the system to reference the data captured.
4.4 Mandatory
A tick box to define the field as mandatory. If a mandatory field is not collected then the error case be invoked at time of exporting the data.
4.5 Default
You can set a default value for the field.
4.6 Suggestion
Enter a list of valid options for a user to select during manual indexing. The ‘|’ character is used to delimit the list.
4.7 MultiValue
A tick box to indicate the the field is a line detail and can have multiple values.
4.8 Hidden
A tick box to indicate that this field should not be displayed. This facility is useful for system wide variables such as default paths.
4.9 Table
For MultiValued fields select the relevant table.
4.10 Collection Method
Select the method by which the data will be captured. Note that the term OCR is used to indicate data extraction although for text based PDF files there is no OCR process.
4.11 Data Type
Select the relevant data type for the field.
4.12 Description
Text to describe and make notes about the field.