CtrlK
BlogDocsLog inGet started
Tessl Logo

emerge/challenge-assumptions

Adversarial reviewer personality for architecture discussions. Use when a user requests a design review, architecture review, system design critique, tech stack decision, RFC review, or devil's advocate perspective on trade-offs. Makes Claude challenge assumptions instead of agreeing — questioning scalability assumptions, identifying single points of failure, challenging technology choices, and probing for edge cases rather than validating decisions.

97

1.25x
Quality

100%

Does it follow best practices?

Impact

94%

1.25x

Average score across 5 eval scenarios

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

Quality

Discovery

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a well-crafted skill description that excels across all dimensions. It clearly defines the adversarial reviewer role with specific behaviors, includes an explicit 'Use when' clause with multiple natural trigger terms, and carves out a distinct niche that won't conflict with standard review or documentation skills.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: 'challenge assumptions', 'questioning scalability assumptions', 'identifying single points of failure', 'challenging technology choices', 'probing for edge cases'. These are clear, actionable behaviors.

3 / 3

Completeness

Clearly answers both what (adversarial reviewer that challenges assumptions, identifies failures, questions choices) AND when (explicit 'Use when' clause listing six specific trigger scenarios). The explicit trigger guidance is comprehensive.

3 / 3

Trigger Term Quality

Excellent coverage of natural terms users would say: 'design review', 'architecture review', 'system design critique', 'tech stack decision', 'RFC review', 'devil's advocate', 'trade-offs'. These are terms users naturally use when seeking critical feedback.

3 / 3

Distinctiveness Conflict Risk

Clear niche as an adversarial/critical reviewer personality, distinct from general code review or documentation skills. The focus on challenging rather than validating, plus specific triggers like 'devil's advocate' and 'RFC review', make it unlikely to conflict with other skills.

3 / 3

Total

12

/

12

Passed

Implementation

100%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is an excellent skill that efficiently defines an adversarial reviewer persona with concrete, actionable guidance. The specific phrases and scenarios give Claude exact language to use, while the tension calibration provides nuanced behavioral adjustment across review stages. The skill respects Claude's intelligence while adding genuine value through specific behavioral rules.

DimensionReasoningScore

Conciseness

Every section earns its place with specific, actionable language. No padding or explanation of concepts Claude already knows. The examples are concrete scenarios, not generic descriptions.

3 / 3

Actionability

Provides exact phrases to use ('Why this over [specific alternative]?'), specific scenarios to invoke, and concrete behavioral rules. Claude knows exactly what to say and when.

3 / 3

Workflow Clarity

The 'Tension Calibration' section provides clear stage-based guidance with explicit behavioral adjustments for each review phase. The self-correction rule adds a feedback loop for maintaining adversarial posture.

3 / 3

Progressive Disclosure

For a personality/behavioral skill under 50 lines, the content is well-organized with clear sections (Activate When, Personality Rules, Tension Calibration). No external references needed; structure is appropriate for scope.

3 / 3

Total

12

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Reviewed

Table of Contents