Create a new content discovery task - SmartPlant Foundation - IM Update 48 - Help

Create a new content discovery task - SmartPlant Foundation - IM Update 48 - Help - Hexagon

SmartPlant Foundation Help

Language

English

Product

SmartPlant Foundation

Search by Category

Help

SmartPlant Foundation / SDx Version

SmartPlant Markup Plus Version

10.0 (2019)

Smart Review Version

2020 (15.0)

This functionality was modified in an update. For more information, see Create a new content discovery task (modified in an update).

When and how is the SPFHotSpotter.ini file updated?

The SPFHotSpotter.ini file is updated whenever any new tag discovery patterns are created or existing tag patterns are updated. The content discovery task uses the updated SPFHotspotter.ini file to extract tags.
After the tag discovery pattern are updated, in order to update the SPFHotSpotter.ini file, Data Capture initially checks for the SPFHotSpotter.ini file attached to the SmartConverter Control object. The SPFHotSpotter.ini file and the SmartConverter Control object must belong to same configuration type on a SmartPlant Foundation application server.
If no SmartConverter Control object is related to the same configuration item, the SmartConverter Control object related to the parent configuration item is used. If no SmartConverter Control object is related to the parent configuration item, the default SmartConverter Control object related to the ConfigurationTop is used. If no SmartConverter Control object is related to the ConfigurationTop, then the SPFHotspotter.ini file available at [drive]:\Program Files (x86)\SmartPlant\Foundation\SPFSmartConverter is updated.

What is the purpose of a default template group?

When you use Data Capture Content Discovery Task in the Desktop Client or Extract Content in the Web Client to extract content from multiple documents, the software automatically considers the templates and rules defined for the default template group. The default template group is considered only when a PDF file or a drawing file is attached to the document. To successfully extract content, ensure that the templates and rules are configured for the template group. However, if you have not chosen any template group as default, the software automatically considers a template group DefaultDrawingTemplateGroup for extracting the content. This default template group is provided with the software.

In the Web Client, to extract content from a single document, the software automatically considers the templates and rules defined for the auto selected default template group. The default template group is considered only when a PDF file or a drawing file is attached to the document. However, you have an option to select and apply any other template group instead of the default template group. For more information, see Extract data from a document.

For the auto selected default template group, the Match Tag Patterns option is pre-selected.

Before preparing to work with large amount of data, based on the size of the data it is recommended to configure the LicenseTimeoutSeconds property under Site Settings node in SmartPlant Foundation Server Manager. This setting will prevent the license token to timeout, thereby allowing the session to retain.

On the Content Discovery Task page, click Create Content Discovery Tasks .
Select the Document Criteria filter and Document Reader filter to process the documents that match the selected criteria.
Select one or more file types for the selected document reader filter.
Type search text in the Document Name Pattern box to process the documents that match the selected criteria.

If you want to schedule the content discovery task to process at a later date, click Tasks Start Date .
Click OK to view the list of document that will be processed. You can filter the documents for processing in this window.

By default, the tags extracted by the content discovery task are associated with an Unknown tag classification and an Unclassified security code.
FDW tags are created without applying the ENS definition.
To extract tags from the drawing and pdf files, the software applies the templates and rules from the template group which is set as default. For more information, see Manage drawing reader pre-processor templates and template groups and Manage PDF reader pre-processor templates.
After the content discovery task processes the documents those have a reader as a base reader, the reader gets changes to the Image or Document reader. If you have to process these documents by the content discovery task, you must specify the reader as Image or Document without specifying the actual file type.
When a content discovery task fails, large file sets are re-processed in smaller and smaller batches to find the problem. For example, documents are re-processed in batches of 100 then 10, drawings in batches of 20 then 2. For each batch, a child content discovery task is created under the master content discovery task.
Data Capture creates the relationship object SPFNCDTFailedCDT between a master content discovery task and a child content discovery task.
To check the status of a content discovery task for failed documents in the Desktop Client:
1. Click Find > Data Capture Items > Content Discovery Tasks.
2. Right-click a content discovery task, and click Show CDT for Failed Docs in the shortcut menu.
  
  To check the status of a master content discovery task, right-click a content discovery task, and click Show Root CDT in the shortcut menu.
You can select a content discovery task and click Rerun Content Discovery Task to rerun a content discovery task and process all the documents attached to it.
You can select a content discovery task and click Rerun Content Discovery Task for selected documents to process the selected document.
If the Content File and the GraphicsMap file are available in \\PreProcessedAlternateRenditions\PrepProcessedContentFiles folder and the \\PreProcessedContentFiles folder, then the content discovery task looks for the Content Files in \\PreProcessedAlternateRenditions\PrepProcessedContentFiles folder.
While extracting content from documents of drawing files and 3D models, a drawing representation object is created for each tag based on the Graphic OID property value in GraphicsMapFile.xml. The GraphicsMapFile.xml has the information for graphical navigation such as the corresponding InterfaceDefs, as well as all tag UIDs and Graphic OIDs for the document. The drawing representation object is related to the respective document and tag.
The drawing representation objects are specifically used for graphical navigation in the Web Client.
You cannot process the transferred documents and FDW documents using content discovery task.

Process .sha files

In the Central Data Capture Settings module, map the .sha file type to Image Reader in the File Type page.
In the Data Capture Pre-Processor Utilities module, process the .sha file using the Drawing Reader Pre-Processor, and generate the content file.
In the Data Capture Task Manager module, process the content file with Content Discovery Task.