AI Skill
verify-criteria
Last updated: 2026-05-17
Use when judging acceptance criteria against a delivered artifact. Triggers — 'check criteria', 'verify acceptance'.
Quick Install
npx skills add verify-criteria
verify-criteria
When to use
Call this skill whenever a task had an explicit acceptanceCriteria: [...]
list from the user — and an artifact exists that might satisfy them.
Examples:
- Coding task: "has booking button", "mobile-friendly", "shows hours"
- Marketing task: "under 125 chars", "mentions the discount"
- Design task: "works in monochrome", "readable at 16px"
- Research task: "cites at least 3 sources", "covers competitor pricing"
What you do
For each criterion, judge strictly:
passed: true— you can directly verify it from the artifactpassed: false— you can directly verify it failspassed: null— you cannot tell from available information
passed: true without direct evidence.
"Probably fine" is null, not true. "Likely broken" is null, not false.
This is the only way the output earns trust.
Use whatever tools make sense to actually check:
- Artifact is a URL → Bash
curl -I, Bashagent-browser navigate ... snapshot - Artifact is a text blob → just read and reason
- Artifact is a file on disk → Read tool
- Criterion mentions specific behavior (e.g. "form submits") → actually exercise it via agent-browser or curl
Output format
Return JSON exactly like this on your final message (no other text after):
{
"checks": [
{"name": "<criterion verbatim>", "passed": true, "note": "<1-sentence evidence>"},
{"name": "<criterion verbatim>", "passed": false, "note": "<1-sentence evidence>"},
{"name": "<criterion verbatim>", "passed": null, "note": "<why you can't tell>"}
]
}
Keep each note to one short sentence with concrete evidence ("found
at /" or "no h1 tag in returned HTML").
What NOT to do
- Don't reinterpret the criteria loosely. "Shows hours" means hours are actually rendered — not that the word "hours" appears somewhere.
- Don't skip criteria you find vague. Mark them
passed: nulland say
"criterion too vague to verify automatically" in the note.
- Don't pass/fail criteria you weren't given. Only judge what's asked.