CtrlK
BlogDocsLog inGet started
Tessl Logo

uinaf/verify

Verify your own completed code changes using the repo's existing infrastructure and an independent evaluator context. Use after implementing a change when you need to run unit or integration tests, check build or lint gates, prove the real surface works with evidence, and challenge the changed code for clarity, deduplication, and maintainability. If the repo is not verifiable yet, hand off to `agent-readiness`; if you are reviewing someone else's code, use `review`.

97

1.02x
Quality

98%

Does it follow best practices?

Impact

94%

1.02x

Average score across 3 eval scenarios

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

evidence-rules.mdreferences/

Evidence Rules

Verification without proof is just flattering yourself.

What Counts as Evidence

Prefer reproducible proof:

  • commands or runtime surfaces exercised, recorded exactly in the work log when needed for reproduction
  • screenshots tied to a named flow
  • HTTP requests and responses
  • representative CLI output for failures or meaningful status lines
  • structured logs, traces, or health checks
  • concrete error messages, codes, and recovery hints from an exercised failure path
  • file references with line numbers for static findings

Rules

  1. Screenshots are evidence, not verdicts

    • A nice screenshot does not prove the flow works end to end
  2. Swallow boring success output

    • Keep success terse; surface failures and anomalies
  3. Tie every finding to impact

    • say what breaks, who it affects, or why the risk matters
  4. Name the exact surface exercised

    • page, endpoint, command, state transition, config path
  5. For error-handling findings, capture the surfaced failure

    • include the actual error text, code/classification, and whether the message helps the user recover
  6. Flag unverified claims honestly

    • if you could not hit the real surface, say unverified and why
  7. Do not flood context with giant logs

    • quote only the relevant lines or summarize with exact pointers
  8. Keep final evidence human-readable

    • in the final footer, summarize passing checks by intent and result; include full commands only for failures, reproduction, or when asked

Minimum Output Shape

For each meaningful finding:

  • severity
  • summary
  • evidence
  • impact
  • suggested fix or handoff

For the overall verification:

  • verdict: ready for review / needs more work / blocked (verify never issues ship it — that's review's call)
  • change verified
  • surfaces exercised
  • evidence summary: what passed or failed, not a raw command list unless reproduction requires it

SKILL.md

tile.json