Confidence Thresholds
The Parashift validation steps Separation, Classification and Extraction have individual Thresholds which dictate which values are taken or extracted.
Confidence Thresholds
Confidence thresholds are defined for each validation step. Generally, these thresholds are set to optimize the dark processing of documents and workflow steps. They can be configured for the workflow steps Separation, Classification, and Extraction.
Separation
The thresholds for separation can be configured in the Upload Profiles and adjusted at any time.
The Separation Threshold is only required if you want to process documents using Intelligent Page Separation.
Learn more: Upload Profiles and Intelligent Page Separation
- Lower Threshold
The lower threshold defines the confidence level from which a separation is displayed, but still requires manual confirmation by the validator in the Separation Validation step. - Upper Threshold
The upper threshold defines the confidence level from which separations are displayed and automatically marked as correct. These separations do not require manual validation.
If all separations in a document reach this threshold, the document will be automatically moved to next processing step.

Classification
Classification groups similar documents under the same Document Type and assigns the type that best matches the uploaded document.
- The classification threshold defines the confidence level required for the system to automatically assign a document type without manual validation.

Extraction
For Extraction, thresholds are managed on a field-by-field basis. Each field has individual threshold settings that can be adjusted in the Document Type Configuration.
Two different thresholds can be configured for extraction:
-
Prediction Thresholds define how confident the system is that the correct value has been selected from the document.
-
Recognition Thresholds define how confident the OCR is that the extracted token has been read correctly.
Learn more: Prediction vs Recognition
Extraction Confidence Threshold:
- Lower Threshold:
The lower threshold defines the confidence level at which a predicted value is displayed but still requires manual verification by a user. - Upper Threshold:
The upper threshold defines the confidence level at which a predicted value is displayed and automatically accepted as correct.
If all fields in a document reach their upper threshold, the document will be dark-processed without any user input.

Recognition Confidence Threshold:
