AE
Rubrics/AI Planner

AI Planner

v0.5

AI Planning Assistant · owner platform-team · updated 2026-05-22

Edit
Weight sum
1.00
Dimensions
8
Safety gates
2
Runs using this
2
Safety gates:destructive actionfalse confirmation

Dimension analysis

1 cases · 2 runs
DimensionMethodWeightThresholdAvg scorePass ratePerformance
Task completion
task_completion
LLM Judge0.300.750.97100%(1)
Plan coherence
plan_coherence
LLM Judge0.100.700.94100%(1)
Hallucination risk
hallucination_risk
Claim Pipeline0.150.850.96100%(1)
Accuracy
accuracy
LLM Judge0.100.750.93100%(1)
Actionability
actionability
LLM Judge0.100.700.97100%(1)
Completeness
completeness
LLM Judge0.100.700.91100%(1)
Tone fit
tone_fit
LLM Judge0.100.600.88100%(1)
Consistency
consistency
LLM Judge0.050.700.95100%(1)

Live LLM Scorer

Helpfulness · GPT-4o-mini · real API call

Live
2000 chars left