CtrlK
BlogDocsLog inGet started
Tessl Logo

uinaf/verify

Verify your own completed code changes using the repo's existing infrastructure and an independent evaluator context. Use after implementing a change when you need to run unit or integration tests, check build or lint gates, prove the real surface works with evidence, and challenge the changed code for clarity, deduplication, and maintainability. If the repo is not verifiable yet, hand off to `agent-readiness`; if you are reviewing someone else's code, use `review`.

97

1.02x
Quality

100%

Does it follow best practices?

Impact

89%

1.02x

Average score across 3 eval scenarios

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

evidence-rules.mdreferences/

Evidence Rules

Verification without proof is just flattering yourself.

What Counts as Evidence

Prefer reproducible proof:

  • exact commands run
  • screenshots tied to a named flow
  • HTTP requests and responses
  • CLI output
  • structured logs, traces, or health checks
  • concrete error messages, codes, and recovery hints from an exercised failure path
  • file references with line numbers for static findings

Rules

  1. Screenshots are evidence, not verdicts

    • A nice screenshot does not prove the flow works end to end
  2. Swallow boring success output

    • Keep success terse; surface failures and anomalies
  3. Tie every finding to impact

    • say what breaks, who it affects, or why the risk matters
  4. Name the exact surface exercised

    • page, endpoint, command, state transition, config path
  5. For error-handling findings, capture the surfaced failure

    • include the actual error text, code/classification, and whether the message helps the user recover
  6. Flag unverified claims honestly

    • if you could not hit the real surface, say unverified and why
  7. Do not flood context with giant logs

    • quote only the relevant lines or summarize with exact pointers

Minimum Output Shape

For each meaningful finding:

  • severity
  • summary
  • evidence
  • impact
  • suggested fix or handoff

For the overall verification:

  • verdict: ship it / needs review / blocked
  • change verified
  • surfaces exercised
  • evidence summary

SKILL.md

tile.json