AE

Wiki

Practical evaluation knowledge base — the opinion layer of the tool.

New to AI evaluation?

Start with the 10-minute guide

Understand projects, rubrics, cases, runs, safety gates, and reports before diving into individual articles.

Interactive · 10 cases · ~8 min

Outputs, Please — practice mode

AI Inspection Booth №7. Label claims, catch ghost numbers, citation drift, PII leaks, prompt injection. Each case maps to one wiki article.

Learning Paths

AI Engineers

Rubrics, judge behavior, groundedness, claim evidence

34 min total

Reviewers

Human review, safety findings, overrides

Trust & Safety

Safety gates, false confirmations, PII, unresolved blockers

Getting Started

Core Concepts

Workflows

Advanced

10 articles · 17 primary sources · Read time 97 min total · Source files in projects/ai-evaluation-tool/wiki/ · Source cards in wiki/sources/source-cards.md