CtrlK
BlogDocsLog inGet started
Tessl Logo

fact-check

Verifies claims in backlog items, skill documentation, or plugin content against primary sources using web lookups. Spawns parallel verification agents that must use WebFetch/WebSearch/gh — training data recall is explicitly rejected as evidence. Produces VERIFIED/REFUTED/INCONCLUSIVE verdicts with citations. Use when items are marked UNVERIFIED or when verifying tool API claims, CLI flags, or documented software behavior.

67

Quality

81%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

SKILL.md
Quality
Evals
Security

Quality

Content

62%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured skill with clear workflow sequencing and good use of templates for verdicts and reports. Its main weaknesses are moderate verbosity (mermaid diagrams, some redundant framing) and incomplete actionability — the core mechanism for spawning parallel verification agents is described conceptually rather than with concrete, executable instructions. The evidence rules and CoVe protocol are valuable additions that demonstrate thoughtful design.

Suggestions

Replace mermaid diagrams with concise numbered lists or brief text descriptions — Claude processes text more efficiently than visual flowcharts

Add concrete instructions for how to spawn '@fact-checker' agents (e.g., the specific tool call or subagent mechanism to use), rather than describing it abstractly

Tighten the claim classification section by combining it with the verification agent prompt template, eliminating the redundant enumeration of the same fields

DimensionReasoningScore

Conciseness

The skill is reasonably efficient but includes some content that could be tightened — the mermaid diagrams add visual bulk without much value for Claude (who processes text), and some sections like 'When NOT to Use' and the evidence rules preamble add moderate padding. The claim classification and wave execution sections could be more compact.

2 / 3

Actionability

The skill provides structured templates (verdict format, agent prompt, report format) and specific post-action commands, which is good. However, the core verification process relies on spawning '@fact-checker' agents without explaining how to actually spawn them (what tool/mechanism), and the claim extraction process is described abstractly rather than with concrete executable steps. The CLI commands like 'backlog list --format json' and 'uv run prek run' are specific but their availability/context is assumed without verification.

2 / 3

Workflow Clarity

The multi-step workflow is clearly sequenced: extract claims → classify → spawn agents in waves → collect verdicts → generate report → post-actions. The Chain of Verification (CoVe) requirement adds an explicit validation/feedback loop within each agent's process. The wave execution pattern handles different claim counts. Post-actions include linting and committing with specific commands.

3 / 3

Progressive Disclosure

The skill references four external skills in the References section with relative paths, providing one-level-deep navigation. However, no bundle files are provided to verify these references exist, and the skill itself is fairly long (~150 lines of substantive content) with sections like the full report template and mermaid diagrams that could potentially be split out. The content is well-sectioned with clear headers, but it's borderline monolithic.

2 / 3

Total

9

/

12

Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that clearly articulates what the skill does (claim verification with parallel agents producing structured verdicts), how it works (web lookups rejecting training data recall), and when to use it (UNVERIFIED items, tool API claims, CLI flags). It is highly specific, distinctive, and complete with explicit trigger guidance.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: verifies claims against primary sources, spawns parallel verification agents, uses WebFetch/WebSearch/gh tools, produces VERIFIED/REFUTED/INCONCLUSIVE verdicts with citations. Very detailed about the mechanism and outputs.

3 / 3

Completeness

Clearly answers both 'what' (verifies claims using web lookups, spawns parallel agents, produces verdicts with citations) and 'when' (explicit 'Use when items are marked UNVERIFIED or when verifying tool API claims, CLI flags, or documented software behavior').

3 / 3

Trigger Term Quality

Includes strong natural trigger terms: 'UNVERIFIED', 'verify', 'claims', 'tool API claims', 'CLI flags', 'documented software behavior', 'backlog items', 'skill documentation', 'plugin content'. These cover the natural ways a user or system would invoke this skill.

3 / 3

Distinctiveness Conflict Risk

Highly distinctive niche — fact-checking/verification of claims against primary sources with specific verdict outputs. The combination of verification methodology (parallel agents, web lookups, rejection of training data) and specific triggers (UNVERIFIED status, API claims, CLI flags) makes it very unlikely to conflict with other skills.

3 / 3

Total

12

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

10

/

11

Passed

Repository
Jamie-BitFlight/claude_skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.