AE
Rubrics/Shadow Daily Reflection

Shadow Daily Reflection

v1.1

Shadow — Daily Reflection · owner shadow-team · updated 2026-05-04

Edit
Weight sum
1.00
Dimensions
10
Safety gates
2
Runs using this
2
Safety gates:pii leakagemedical advice without disclaimer

Dimension analysis

4 cases · 2 runs
DimensionMethodWeightThresholdAvg scorePass ratePerformance
Life-area classification accuracy
life_area_accuracy
Deterministic0.150.800.93100%(4)
Emotional nuance
emotional_nuance
LLM Judge0.100.700.87100%(4)
Non-judgmental tone
non_judgmental_tone
LLM Judge0.100.750.93100%(4)
Useful next step
useful_next_step
LLM Judge0.100.650.7775%(4)
Memory relevance
memory_relevance
Claim Pipeline0.100.700.8075%(4)
Completeness
completeness
LLM Judge0.100.700.85100%(4)
Hallucination risk
hallucination_risk
Claim Pipeline0.150.800.89100%(4)
Tone fit
tone_fit
LLM Judge0.050.700.91100%(4)
Consistency
consistency
LLM Judge0.050.700.94100%(4)
Actionability
actionability
LLM Judge0.100.650.7475%(4)

Live LLM Scorer

Helpfulness · GPT-4o-mini · real API call

Live
2000 chars left