Process the files using PDF Reader Pre-Processor module - SmartPlant Foundation - IM Update 46 - Help - Hexagon

SmartPlant Foundation Help

Language
English
Product
SmartPlant Foundation
Search by Category
Help
SmartPlant Foundation / SDx Version
10
SmartPlant Markup Plus Version
10.0 (2019)
Smart Review Version
2020 (15.0)
  1. In the Data Capture Pre-Processor module, click PDF Reader Pre-Processor module.

  2. Click Browse Folder to select a base directory.

  3. Select a template group from the Template Group list.

  4. Enter a file pattern to process files matching the specified file pattern.

  5. Type a domain name in the Domain Name box. This domain name will be taken into consideration during content discovery and the tags extracted from this file will be related to this domain.

    • It is recommended you restrict the number of characters in the domain name so that it does not exceed 10 characters.

  6. Select the Generate Document Index File and Generate File Index File check boxes if you want to generate document index and file index files respectively. After processing is complete, the document index and file index files display in the Outputs tab.

  7. Select the Output Raw Data option to generate the raw data file. This file can be used to see additional properties and view all the information in text format.

  8. On the Progress tab, select the check box beside the File Name column, and then click Preview to preview the output.

  9. Click Start Processing.

    • After the processing of a file is complete, in the Progress tab, click next to the file name in the File Name column to view the content file generated for the selected file.

    • If any tags are not extracted in the content file after processing the data, we recommend you to view the raw data file by selecting the Output Raw Data option to check for the missing tags in the input file.

    • If the status of the file denotes Processed, select the file and click Delete Pre-Processed Files to delete the files that are already created for the 3D file. This enables the Start Processing button.

    • On the Outputs tab, click on the file name hyperlink to view the log file.

    • The delimiters are considered based on the delimiters defined in the Central Settings module. For more information on this, see Central Settings.

You can now search and preview the list of Documents to be OCRed

You can review a list of documents that do not have searchable text so that you can process them with Optical Character Recognition software prior to processing.

  • Click Preview the Documents to be OCRed OCRed to generate a .txt file in the Outputs tab with a list of PDF files that need to be processed using the OCR software.

This option is available only if you specify the valid folder path in the Base Directory box while processing the PDF files.