AE
Projects/AI Planning Assistant

AI Planning Assistant

Paused

Multi-step task decomposition + report

Open reportsEdit settings

Project Settings

Edit settings →
Identity
NameAI Planning Assistant
DescriptionMulti-step task decomposition + report
Ownerplatform-team
StatusPaused
Evaluation Defaults
Default modelclaude-opus-4-6
Active rubricplanner-v0.4
Judge model
Metadata
Tags
Notes

Project Intelligence

Computed from stored runs, cases, rubrics, and safety findings.

Evaluation Coverage

Total cases1
Passing1
Failing0
Unscored0
Regression cases0
Safety cases0

Recent Quality

Total runs2
Last run score0.94
Previous score0.77
Score delta+0.20
Regression statusClean

Human Review

Open items0
P0 safety0
P1 low-confidence0
Reviewed cases0

Safety State

Open blockers0
PII findings0
False confirmations0
Policy findings0

Active Rubric

NameAI Planner
Versionv0.5
Dimensions8
Weights normalizedYes
Safety gateEnabled
Min threshold≥ 0.60

Eval Run History

WhenVariableScorePass rateVerdictFlags
May 23, 04:20 PMtask-decomposition-prompt-v50.9495.0%Ship-ready
May 16, 10:00 AMtask-decomposition-prompt-v40.7775.0%Acceptable