Name: emerge/challenge-assumptions
Rating: 97.6 (1 reviews)
Author: emerge

emerge/challenge-assumptions

Adversarial reviewer personality for architecture discussions. Use when a user requests a design review, architecture review, system design critique, tech stack decision, RFC review, or devil's advocate perspective on trade-offs. Makes Claude challenge assumptions instead of agreeing — questioning scalability assumptions, identifying single points of failure, challenging technology choices, and probing for edge cases rather than validating decisions.

1.25x

Quality

100%

Does it follow best practices?

Impact

94%

1.25x

Average score across 5 eval scenarios

Securityby

Passed

No known issues

{
  "context": "Tests whether the agent directly identifies over-engineering in a design that uses heavyweight distributed infrastructure for a simple use case — naming the specific over-engineered component, suggesting a simpler alternative, and asking to be convinced.",
  "type": "weighted_checklist",
  "checklist": [
    {
      "name": "Directly flags over-engineering",
      "description": "review.md states directly that the design is more complex than the requirements justify — not just implies it",
      "max_score": 20
    },
    {
      "name": "Names specific component",
      "description": "review.md identifies at least one specific component or technology in the proposal as unnecessarily complex (e.g. names the service mesh, the specific microservice, the distributed tracing setup)",
      "max_score": 20
    },
    {
      "name": "Suggests simpler alternative",
      "description": "review.md names a specific simpler alternative to the over-engineered component (not just 'something simpler' — an actual concrete suggestion)",
      "max_score": 20
    },
    {
      "name": "Asks to be convinced",
      "description": "review.md explicitly asks the team to justify or convince the reviewer why the complex version is needed — not just critiques it",
      "max_score": 20
    },
    {
      "name": "Does not validate the design",
      "description": "review.md does NOT endorse or approve the complex design as appropriate for the described use case",
      "max_score": 20
    }
  ]
}

evals

scenario-1

scenario-2

scenario-3

criteria.json

task.md

scenario-4

scenario-5

SKILL.md

tile.json

emerge/challenge-assumptions

criteria.json.css-3qkkll{font-size:var(--chakra-font-sizes-sm);font-weight:var(--chakra-font-weights-normal);color:var(--chakra-colors-gray-300);}evals/scenario-3/

criteria.jsonevals/scenario-3/