Skip to main content
This mini-Canon is the short path for Eval Labs evaluators. It explains the evaluator workflow without requiring a first-time evaluator to read the full main Canon.
New here? Start with 00 - Welcome to Eval Labs.

Before anything else

Eval Labs has passed the AI-reviewed platform readiness gate. That means the platform lifecycle has been proven: runs can be created, Lucia responses can be captured, reviews can be generated and persisted, and runs can be finalized. It does not mean Lucia has human quality approval. Human review remains the real Lucia behavioral-quality layer. Your job is to help judge whether Lucia is useful, trustworthy, clear, and calm for real human/operator use.

Your first path

Evaluator access is broader than tester access, but it is still scoped. Use:
  • Custom Prompt Test
  • Auto-generated Prompt Test
  • Guest Facing Agent Verification Check
  • Verification Results
  • Controlled Batch Runner when assigned
  • your own run, review, and history routes
  • finalization for your own assigned runs
Do not use owner/admin surfaces unless an owner or admin explicitly changes your access later. That includes Team Review, Global Analysis, Registry Diagnostics, Behavioral Observatory, owner/admin tools, and shared platform-wide evidence. Tester is a separate narrower onboarding role. Testers use only Custom Prompt Test and Auto-generated Prompt Test.

This section does not launch onboarding

This section is onboarding material. It does not mean Lucia is human-approved, that your account has been cleared for every Eval Labs surface, or that active workspace polish is finished. Your access and assignment still come from an owner or admin.

Reading order

  1. What Eval Labs Is
  2. Your Role and Access
  3. AI-Reviewed vs Human Review
  4. Running Your First Custom Eval
  5. Reviewing Lucia
  6. Good Feedback Examples
  7. What Not To Do
  8. First Assignment Checklist
  9. Getting Help

First assignment rule

If you are uncertain, pause and ask an owner/admin. Do not guess your way through role access, routing, scoring meaning, or escalation.