Any templates for the selected template group are displayed in the Associate Templates for Template Group pane. Each template consists of a PDF file with annotations.
You can annotate the text that needs to be extracted in a sample or for a template PDF file and provide rule information in the comment. This data is used to create rules.
The application pool user must have access to the template file location in order to manage PDF templates.
-
In the Template Group pane, select a template group.
-
Click Create Template in the Associate Templates for Template Group pane.
-
In the Create Template window, type a name for the template.
-
Click Browse to select the PDF template file.
-
Click Create. The rules defined in the template are automatically created and configured in the Associate Rules for Template pane.
-
To extract the tags that match the tag patterns defined in the Tag Discovery Patterns module, select the Match Tag Patterns check box in the Associate Templates for Template Group pane.
-
In the Template Group pane, you can set any one template group as a Default Template Group. The default template group is used to extract content from PDF files using Apache PDFBOX. You must ensure that templates and rules are configured for the default template group so that the content discovery task can extract data from the PDF files.
However, you can also process the PDF files by using the auto selected Default Template Group defined with the Match Tag Patterns condition delivered with the software.
Update a template file
-
To update a template file, click Edit Template . Click Browse in the Editing <Template Name> Template window to select the modified PDF template file, and click Update.
-
Click PDF Template Keywords in the Associate Templates for Template Group pane to open the PDF Template Keywords window with the list of keywords for creating the template document.
-
The rules for any edited template are completely regenerated in the Associate Rules for Template pane.