A strong prompt suite tests one behavior family from multiple angles.
The rule
One suite should have one primary purpose. Good suite purpose:The 1–10 prompt range
The custom launcher allows up to 10 prompts. Use the limit deliberately. Recommended sizes:| Purpose | Prompt Count |
|---|---|
| Quick smoke | 1 |
| Narrow bug check | 3–5 |
| Behavior family refinement | 5–8 |
| Pre/post regression comparison | 8–10 |
Prompt variation types
A strong suite uses variants:Direct phrase
Near-neighbor phrase
Indirect emotional signal
Trust-state signal
Operational hybrid
What to avoid
Avoid prompts that are too broad to interpret:Suite notes
When creating a suite, write down:- purpose
- expected mode
- known failure pattern
- desired behavior
- version number

