CtrlK
BlogDocsLog inGet started
Tessl Logo

feature-flagging

Operational discipline for feature flags as production infrastructure. Flag types, naming, targeting rules, rollout strategy, lifecycle, governance, stale flag management, and the technical debt patterns that bite teams who weren't deliberate about it.

36

Quality

33%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./skills/feature-flagging/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

32%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description identifies a clear domain (feature flags) and enumerates relevant subtopics, but reads more like a course syllabus than an actionable skill description. It lacks concrete actions Claude would perform, omits a 'Use when...' clause entirely, and misses common synonym trigger terms like 'feature toggles' or 'feature switches'. The topical specificity of feature flags provides some natural distinctiveness, but the description needs significant improvement to function well in a multi-skill selection context.

Suggestions

Add an explicit 'Use when...' clause, e.g., 'Use when the user asks about feature flags, feature toggles, rollout strategies, flag cleanup, or managing feature flag technical debt.'

Replace the topic list with concrete actions Claude performs, e.g., 'Designs flag naming conventions, creates rollout plans with percentage-based targeting, audits codebases for stale flags, and recommends flag lifecycle policies.'

Include common synonym trigger terms such as 'feature toggles', 'feature switches', 'kill switches', 'canary releases', and specific tool names like 'LaunchDarkly' or 'Unleash' if applicable.

DimensionReasoningScore

Specificity

The description names the domain (feature flags) and lists several topic areas (flag types, naming, targeting rules, rollout strategy, lifecycle, governance, stale flag management, technical debt patterns), but these read more like a table of contents than concrete actions. It doesn't specify what Claude actually does with these topics (e.g., 'creates rollout plans', 'audits stale flags', 'generates naming conventions').

2 / 3

Completeness

The description covers 'what' (operational discipline for feature flags across several dimensions) but completely lacks a 'Use when...' clause or any explicit trigger guidance for when Claude should select this skill. Per the rubric, a missing 'Use when...' clause caps completeness at 2, and the 'what' itself is more topical than actionable, pushing this to 1.

1 / 3

Trigger Term Quality

Includes relevant terms like 'feature flags', 'rollout strategy', 'stale flag management', and 'technical debt' that users might naturally mention. However, it misses common variations like 'feature toggles', 'feature switches', 'LaunchDarkly', 'flag cleanup', 'canary release', or 'percentage rollout' that users would likely say.

2 / 3

Distinctiveness Conflict Risk

Feature flags is a reasonably specific niche, which helps distinctiveness. However, the mention of 'technical debt patterns' and 'governance' could overlap with broader engineering practices or code quality skills. The lack of explicit triggers increases the risk of incorrect activation or missed activation.

2 / 3

Total

7

/

12

Passed

Implementation

35%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a comprehensive knowledge document about feature flag operational discipline, but it reads more like a blog post or engineering handbook chapter than a skill file for Claude. It is significantly too verbose, repeating concepts Claude already understands, and lacks executable artifacts (code, commands, config snippets). The progressive disclosure structure is well-intentioned with seven reference files, but the main file inlines too much detail that belongs in those references, and the references themselves don't exist in the bundle.

Suggestions

Cut the content by 50-60%: remove the introductory philosophy paragraphs, the 'what this skill is for' section, the closing section, and the 14-point framework (which restates everything above it). Trust that Claude understands what feature flags are.

Add executable artifacts: include a concrete flag metadata template (YAML/JSON), a sample PR description for flag creation, a sample stale-flag audit query, or a concrete checklist in checkbox format rather than prose.

Move the detailed sections (targeting rules, rollout strategies, performance, testing, governance) into the referenced files and keep only 2-3 sentence summaries with links in the main SKILL.md, making it a true overview.

Add explicit validation gates with concrete thresholds: e.g., 'error rate < baseline + 0.1% for 1 hour before advancing' rather than 'watch error rates and latency'.

DimensionReasoningScore

Conciseness

The skill is extremely verbose at ~3000+ words. It explains concepts Claude already knows (what feature flags are, what kill switches do, basic caching concepts, what AND/OR/NOT means). Many sections repeat information (e.g., the 14-point framework restates everything already covered). The introductory paragraphs, the 'what this skill is for' section, and the closing section are largely padding that could be cut entirely.

1 / 3

Actionability

The skill provides concrete naming conventions, specific percentage rollout numbers (1% → 10% → 25% → 50% → 100%), permission tiers, and a cleanup playbook with steps. However, there is zero executable code, no CLI commands, no platform-specific examples, and no copy-paste-ready artifacts. The guidance is specific but descriptive rather than executable.

2 / 3

Workflow Clarity

The flag lifecycle phases (birth through death) are clearly sequenced, and the stale flag cleanup has numbered steps. However, validation checkpoints are mostly implicit ('monitor at each step') rather than explicit with concrete criteria. The rollout strategy says 'watch error rates' but doesn't specify thresholds or what constitutes a pass/fail gate. The rollback section lacks a concrete decision tree.

2 / 3

Progressive Disclosure

The skill references seven well-organized reference files with clear one-level-deep links, which is good structure. However, no bundle files were provided, so the references are broken. The main SKILL.md itself is a monolithic wall of text that inlines extensive detail that should live in the referenced files (e.g., the full targeting rules section, the full rollout strategies section, the performance section). The overview doesn't stay at overview level.

2 / 3

Total

7

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

10

/

11

Passed

Repository
rampstackco/claude-skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.