After creating the PDF template group for the sample PDF template file, you can apply the rules from the template group to extract content from any PDF file in the PDF Reader Pre-Processor module.
-
In the Data Capture Pre-Processor, click PDF Reader Pre-Processor module.
-
Click Browse Folder to select a base directory. For example, you can select any PDF PreProcessor Data sample file from the sample data located on Smart Community. For more information, see Find sample data on Smart Community.
-
Select a template group from the Template Group list.
-
Enter a file pattern to process files matching the specified file pattern.
-
On the Progress tab, select the check box beside the File Name column, and then click Preview to preview the output.
-
Click Start Processing.
-
After the processing of a file is complete, in the Progress tab, click next to the file name in the File Name column to view the content file generated for the selected file.
-
On the Outputs tab, click on the file name hyperlink to view the log file.
-
If the status of the file denotes Processed, select the file and click Delete Pre-Processed Files to delete the files that are already created for the PDF file. This enables the Start Processing button.
-
You can select Match Tag Patterns to extract the tags that match the tag patterns defined in the Tag Discovery Patterns module.
-
You can select the Generate Document Index File and Generate File Index File check boxes if you want to generate document index and file index files respectively. After processing is complete, the document index and file index files display in the Outputs tab.