Skip to main content

Symptom

User reports one or more of the following:

  • Confidence score is consistently low.
  • Expected fields are missing.
  • Extracted values are incorrect.
  • Extraction times out before completion.

Diagnosis steps

  1. Confirm document type, page count, and scan quality.
  2. Verify upload completed without partial transfer.
  3. Check extraction job status and processing duration.
  4. Inspect whether issue affects one field group or all sections.
  5. Confirm if similar documents in the same period are impacted.

Root cause

  • Low-quality scan or unclear text in source document.
  • Incomplete upload resulting in missing pages.
  • Complex or unusual clause formatting.
  • Temporary processing capacity delay.

Resolution

Low confidence scores

  1. Ask user to review low-confidence fields first and verify provenance.
  2. If scan quality is poor, request a clearer file and rerun extraction.
  3. Validate improvement by comparing confidence distribution before and after rerun.

Missing fields

  1. Confirm the field exists in source document.
  2. Rerun extraction after verifying file completeness.
  3. If still missing, capture clause excerpt and escalate for model review.

Wrong values

  1. Guide user to correct values in Extraction Review and mark verified.
  2. Check if issue pattern repeats across similar fields.
  3. Record repeated patterns for engineering triage.

Timeout during extraction

  1. Retry extraction once.
  2. If timeout repeats, split document and process in logical sections.
  3. If still failing, escalate with job ID and timing details.

Escalation

Escalate when:

  • Missing or wrong key fields persist after rerun.
  • Timeout repeats twice on the same document.
  • Multiple users report the same extraction degradation.

Escalate to:

  • L2 Support for impact assessment.
  • Engineering for model or processing pipeline investigation.

Include:

  • Workspace ID, document ID, extraction job ID.
  • Exact field names and expected values.
  • Sample screenshots with provenance view.
  • UTC timestamps and timezone.

Related issues