The Resolve Duplicates feature is not available in the Data Capture Quality Control of Desktop Client from Update 37.
The file system may contain several copies of the same file. The files might have different file extensions, or they might be located in different folders. More than one file with the same name could also exist. After the Document Discovery Task runs, a document might be related to more than one file.
You must identify a master file to resolve any duplication. Once the master file is identified, you can keep the other duplicate files for reference or delete them from the database. You can assign the following statuses to a file:
Status |
Details |
---|---|
Unknown |
The file status is not set. This is the default status for the files when duplicates are found. |
Master |
The file is the latest and is selected for content and tag extraction. |
Candidate |
This file is the latest, but it is excluded from content and tag extraction. |
Historical |
This file is an older version that needs to be stored in the database for future reference. |
Ignore |
This file is an older version that is not needed for reference, and it can be stored at a physical location outside the application. |