Human Review Queue

Priority order: open safety findings → uncertain claims (confidence < 0.70) → calibration.

Total items

Open safety

Pending human

Uncertain claims

Safety findings1P0

P1incorrect escalation

Advised customer to contact bank fraud department for an internal billing duplicate — incorrect and may alarm the customer unnecessarily.

Customer Support Reply · case case-support-003-01

open

Uncertain claim labels3P1

Claims where the automated label confidence fell below 0.70. Human review calibrates the judge.

unsupportedconf 0.41

“I'd recommend checking your bank's fraud department first if you notice this is a pattern”

Not in retrieved policy. Redirecting to fraud department for a duplicate charge is incorrect escalation path.

Customer Support Reply · case-support-003-01

0.41

partially supportedconf 0.62

“one short walk before opening Slack resets the nervous system”

General wellness advice; not grounded in user's history or retrieved context.

Shadow — Daily Reflection · case-sdr-003-01

0.62

partially supportedconf 0.67

“Three wins in one day is not a coincidence — it is evidence of what the baseline looks like when the conditions are right”

Motivating reframe but not grounded in user pattern data.

Shadow — Daily Reflection · case-sdr-003-03

0.67