CtrlK
BlogDocsLog inGet started
Tessl Logo

feature-flagging

Operational discipline for feature flags as production infrastructure. Flag types, naming, targeting rules, rollout strategy, lifecycle, governance, stale flag management, and the technical debt patterns that bite teams who weren't deliberate about it.

36

Quality

33%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./skills/feature-flagging/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Content

35%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a comprehensive knowledge document about feature flag operational discipline, but it reads more like a blog post or engineering handbook chapter than a concise skill file for Claude. Its main weakness is extreme verbosity — it explains concepts Claude already understands, provides extensive motivation and rationale, and includes inline content that should live in the referenced files. The actionability is moderate: good naming conventions and process frameworks, but no executable code or commands.

Suggestions

Cut the content by at least 50%: remove the opening philosophy paragraphs, the 'What this skill is for' section, the closing section, and inline explanations of concepts Claude already knows (what caching is, what a kill switch is, what technical debt means).

Move detailed content (targeting rules, performance considerations, governance tiers, rollout strategies) into the referenced files and keep only the summary/decision framework in SKILL.md.

Add concrete, executable examples: a sample flag creation metadata JSON/YAML, a sample PR template for flag removal, a sample audit query, or platform-agnostic SDK code snippets showing correct caching/bulk evaluation patterns.

Provide the referenced bundle files (references/*.md) so the progressive disclosure structure actually functions, or inline the critical checklists if the files won't be provided.

DimensionReasoningScore

Conciseness

The skill is extremely verbose at ~3000+ words. It explains concepts Claude already knows (what feature flags are, what kill switches do, what cache means), provides extensive rationale and motivation paragraphs, and repeats itself across sections. The opening paragraphs, 'What this skill is for' section, and closing section are largely redundant. Many sections could be cut by 50-70% without losing actionable content.

1 / 3

Actionability

The skill provides concrete naming conventions, specific percentage rollout numbers (1→10→25→50→100), permission tiers, and a 14-point checklist framework. However, it contains zero executable code or commands, no concrete platform-agnostic CLI examples, and the guidance remains at the level of organizational process description rather than copy-paste-ready instructions. The 'common failures' section is useful but still descriptive rather than prescriptive with specific remediation steps.

2 / 3

Workflow Clarity

The flag lifecycle (birth→adolescence→launch→maturity→death) and the stale flag cleanup playbook are clearly sequenced. The rollout strategy includes monitoring checkpoints ('watch for one peak hour'). However, validation steps are mostly implicit rather than explicit, and the lifecycle phases lack concrete entry/exit criteria beyond descriptions. The cleanup workflow in the stale flag section is the strongest example but delegates the full checklist to a reference file that isn't provided.

2 / 3

Progressive Disclosure

The skill references seven well-organized reference files with clear signaling and one-level-deep navigation, which is good structure. However, no bundle files are actually provided, so the references are broken. Additionally, the main SKILL.md itself is far too long — much of the inline content (targeting rule details, performance considerations, governance details) should be in the reference files rather than duplicated/expanded in the main body.

2 / 3

Total

7

/

12

Passed

Description

32%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description identifies a clear domain (feature flags) and enumerates relevant subtopics, but reads more like a table of contents than an actionable skill description. It lacks concrete actions Claude would perform and entirely omits 'when to use' guidance, making it difficult for Claude to reliably select this skill from a large pool. The informal closing phrase about technical debt adds personality but not functional clarity.

Suggestions

Add an explicit 'Use when...' clause with trigger terms like 'feature flag', 'feature toggle', 'rollout strategy', 'flag cleanup', 'canary release', or 'progressive delivery'.

Replace topic listings with concrete actions, e.g., 'Designs feature flag naming conventions, creates rollout plans, audits stale flags, and recommends cleanup strategies.'

Include common synonyms and tool names (e.g., 'feature toggles', 'feature switches', 'LaunchDarkly', 'Unleash') to improve trigger term coverage.

DimensionReasoningScore

Specificity

The description names the domain (feature flags) and lists several relevant subtopics (flag types, naming, targeting rules, rollout strategy, lifecycle, governance, stale flag management, technical debt patterns), but these are topic areas rather than concrete actions Claude would perform. It doesn't specify what Claude actually does with these topics (e.g., 'creates rollout plans', 'audits stale flags').

2 / 3

Completeness

The description covers 'what' at a topical level but completely lacks any 'when should Claude use it' guidance. There is no 'Use when...' clause or equivalent explicit trigger guidance, and per the rubric, a missing 'Use when...' clause should cap completeness at 2, but since the 'what' is also more topical than actionable, this falls to 1.

1 / 3

Trigger Term Quality

Contains relevant keywords like 'feature flags', 'rollout strategy', 'stale flag', 'technical debt', and 'targeting rules' that users might naturally mention. However, it misses common variations like 'feature toggles', 'feature switches', 'LaunchDarkly', 'flag cleanup', or 'canary release', which limits coverage of natural user language.

2 / 3

Distinctiveness Conflict Risk

Feature flags is a reasonably specific niche, but the broad framing as 'operational discipline' and inclusion of generic concepts like 'technical debt' and 'governance' could cause overlap with general DevOps, release management, or technical debt management skills.

2 / 3

Total

7

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

10

/

11

Passed

Repository
rampstackco/claude-skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.