Skip to main content
Use this checklist before and during your first assigned Eval Labs workflow.

Before starting

  • I know who assigned the work.
  • I know the behavior family being tested.
  • I know which surface I should use.
  • I know not to use owner/admin surfaces.
  • I understand that platform readiness is not human Lucia-quality approval.
  • If I am a tester, I am using only Custom Prompt Test or Auto-generated Prompt Test.

While running

  • I entered only the assigned prompts.
  • I did not add unrelated cases mid-run.
  • I waited for Lucia responses to complete.
  • I opened only my own assigned/scoped run and Review Queue routes.
  • I paused if the UI took me somewhere unexpected.

While reviewing

  • I read the prompt and response before scoring.
  • I treated suggested selections as suggestions, not truth.
  • I scored honestly.
  • I wrote short notes only when useful.
  • I flagged uncertainty or risk for senior review.
  • I saved every item.

Before finalizing

  • Every item has been reviewed.
  • Any uncertainty has been marked or escalated.
  • I am finalizing my own assigned/scoped run.
  • I am not using Team Review, Global Analysis, or owner/admin tools.

Done

After finalization, follow your owner/admin’s instructions for what to share next.