Eval Labs now separates fast employee judgment from senior adjudication so Lucia can learn from structured signal without letting non-expert reviewer language become training truth.
The three-layer review model
Eval Labs review is not one flat annotation step.
It is a layered judgment system:
Layer 1: Employee ReviewLayer 2: Escalation RoutingLayer 3: Senior Adjudication
Each layer has a different job.
Behavioral Observatory position
Behavioral Observatory adds a saved behavioral label layer beside the Review Queue workflow.
It does not erase the three-layer review model.
Use this distinction:
Review Queue = score and review the prompt/response item.
Behavioral Observatory = save structured behavioral labels for the conversation.
Registry Diagnostics = inspect derived classification suggestions.
Behavioral Observatory labels can become useful behavioral evidence, but they should not be confused with senior adjudication or Gold Standard approval.
Layer 1 — Employee Review
Employee Review captures fast human reaction.
It is designed for reviewers who may not know AI, prompting, ontologies, or model training.
Employees answer guided questions such as:
Did Lucia understand what was needed?
Did Lucia give the right next move?
Did Lucia make the situation feel calmer?
Did anything feel risky, confusing, or wrong?
Should a senior reviewer look at this?
Could this teach Lucia something reusable?
This layer should be:
- simple
- fast
- structured
- low-friction
- non-technical
- psychologically clear
Employee reviewers should not be asked to invent labels, taxonomies, intent categories, action classes, or training language.
Layer 2 — Escalation Routing
Escalation Routing converts guided review answers into workflow state.
Examples:
seniorReview = true → reviewState: needs_adjudication
reusableLearning = true → canonCandidate: true
riskOrConfusion = slightly_off → reviewState: needs_review
riskOrConfusion = definitely_wrong → reviewState: needs_review
The important doctrine:
The employee reports what they experienced.
The system routes the case.
The senior reviewer interprets meaning.
Layer 3 — Senior Adjudication
Adjudication is the senior-review layer where canonical meaning is assigned.
Adjudication may include:
- final human labels
- final intent interpretation
- follow-through decision
- final action type
- emotional read
- owner pressure level
- reason for the final call
- reusable canon/training signal
This layer exists to prevent ontology drift.
Employees should not train Lucia directly with improvised language.
Why this architecture matters
Without this separation, Eval Labs risks collecting inconsistent reviewer opinions as if they were stable training truth.
That creates:
ontology drift
label noise
inconsistent training signal
reviewer fatigue
low-confidence exports
With this separation, Eval Labs captures simple human judgment while preserving a high-quality senior interpretation layer.
Canon rule
Non-expert reviewers provide signal. Senior adjudication provides meaning.
This is the core protection layer for scalable human evaluation.