Process the PDF files using the PDF Reader Pre-Processor module - SmartPlant Foundation - IM Update 46 - Help - Hexagon

SmartPlant Foundation Help

Language
English
Product
SmartPlant Foundation
Search by Category
Help
SmartPlant Foundation / SDx Version
10
SmartPlant Markup Plus Version
10.0 (2019)
Smart Review Version
2020 (15.0)

After creating the PDF template group for the sample PDF template file, you can apply the rules from the template group to extract content from any PDF file in the PDF Reader Pre-Processor module.

  1. In the Data Capture Pre-Processor, click PDF Reader Pre-Processor module.

  2. Click Browse Folder to select a base directory. For example, you can select any PDF PreProcessor Data sample file from the sample data located on Smart Community. For more information, see Find sample data on Smart Community.

  3. Select a template group from the Template Group list.

  4. Enter a file pattern to process files matching the specified file pattern.

  5. On the Progress tab, select the check box beside the File Name column, and then click Preview to preview the output.

  6. Click Start Processing.

  7. After the processing of a file is complete, in the Progress tab, click next to the file name in the File Name column to view the content file generated for the selected file.

  8. On the Outputs tab, click on the file name hyperlink to view the log file.

How can I extract assets from PDF file and create relationship with document?

You can use the Data Capture PDF Reader Pre-Processor module to extract assets from a PDF file and then create a relationship with the document of the corresponding file.

  1. To create a relationship with document, you must first create a DNS item with all the DNS properties using the Data Capture Document Naming System module. For more information, see Create a DNS item.

  2. Using the Data Capture PDF Reader Pre-Processor, create template for the file and configure a rule by setting the Target Object to Titleblock and Related Attribute to Asset. For more information, see Create and manage PDF reader pre-processor templates.

  3. Process the file using the Data Capture Pre-Processor module to create content file. For more information, see Process the PDF files using the PDF Reader Pre-Processor module.

  4. Run Content Discovery Task using the pre-processed content file to extract assets. For more information, see

    After content extraction, the relationship between document and asset is created provided the assets with similar name are present in the product database.

  • If the status of the file denotes Processed, select the file and click Delete Pre-Processed Files to delete the files that are already created for the PDF file. This enables the Start Processing button.

  • You can select Match Tag Patterns to extract the tags that match the tag patterns defined in the Tag Discovery Patterns module.

  • You can select the Generate Document Index File and Generate File Index File check boxes if you want to generate document index and file index files respectively. After processing is complete, the document index and file index files display in the Outputs tab.