This page explains the safe first-run workflow for approved reviewers. It does not replace the employee onboarding gate.
Before you begin
Make sure you know which testing path you are using:First custom smoke test
Use this prompt:- Lucia responds with current time
- run completes
- Review Queue opens
- no transport failure
- export contains
runSource: custom
First real review test
Choose a small behavior family. Example:What to do in the Review Queue
For each item:- Read the prompt.
- Read Lucia’s response.
- Review any suggested selections.
- Score each dimension honestly.
- Choose Keep talking, Verdict, and Priority.
- Answer the Quick Review questions.
- Add Human Guidance Evaluation scores when useful.
- Write notes when something feels off.
- Save the review.
- Finalize Run
- Back to Launcher
Export after reviewing
Export after review when you need to share evidence with product or engineering. Do not export only the generated responses if the goal is human review analysis. Generated-only exports are useful for debugging, but reviewed exports are stronger evidence. Reviewed exports preserve the structured review, suggested review, Employee Review, Human Guidance Evaluation, adjudication metadata, lifecycle state, tester identity, and dirty/completion state.Finalize Evaluation


Not part of first tester workflow
Tester users should not use:- Guest Facing Agent Verification Check
- Controlled Batch Runner
- Run History/global analytics
- Team Review
- Global Analysis
- Single Run Analysis
- owner/admin Home dashboard

