Rubrics/AI Planner

AI Planner

v0.5

AI Planning Assistant · owner platform-team · updated 2026-05-22

Weight sum

1.00

Dimensions

Safety gates

Runs using this

Safety gates:destructive actionfalse confirmation

Dimension analysis

1 cases · 2 runs

Dimension	Method	Weight	Threshold	Avg score	Pass rate
Task completion task_completion	LLM Judge	0.30	≥0.75	0.97	100%(1)
Plan coherence plan_coherence	LLM Judge	0.10	≥0.70	0.94	100%(1)
Hallucination risk hallucination_risk	Claim Pipeline	0.15	≥0.85	0.96	100%(1)
Accuracy accuracy	LLM Judge	0.10	≥0.75	0.93	100%(1)
Actionability actionability	LLM Judge	0.10	≥0.70	0.97	100%(1)
Completeness completeness	LLM Judge	0.10	≥0.70	0.91	100%(1)
Tone fit tone_fit	LLM Judge	0.10	≥0.60	0.88	100%(1)
Consistency consistency	LLM Judge	0.05	≥0.70	0.95	100%(1)

Helpfulness · GPT-4o-mini · real API call

Live

2000 chars left