CtrlK
BlogDocsLog inGet started
Tessl Logo

github-release-management

Comprehensive GitHub release orchestration with AI swarm coordination for automated versioning, testing, deployment, and rollback management

51

1.62x
Quality

33%

Does it follow best practices?

Impact

78%

1.62x

Average score across 3 eval scenarios

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./.claude/skills/github-release-management/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

32%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description suffers from buzzword-heavy language ('AI swarm coordination', 'orchestration') that obscures what the skill actually does concretely. It lacks a 'Use when...' clause, making it difficult for Claude to know when to select this skill. The core domain (GitHub releases) is identifiable but the specific capabilities and trigger conditions need to be much more explicit.

Suggestions

Add an explicit 'Use when...' clause with natural trigger terms like 'create a release', 'publish to GitHub', 'bump version', 'changelog', 'rollback release'.

Replace buzzwords like 'AI swarm coordination' and 'orchestration' with concrete actions such as 'creates GitHub releases, generates changelogs, bumps semantic versions, runs pre-release tests, and manages rollbacks'.

Include common file/format references users might mention, such as 'CHANGELOG.md', 'git tags', 'GitHub Actions', or 'release notes'.

DimensionReasoningScore

Specificity

Names the domain (GitHub releases) and lists some actions (versioning, testing, deployment, rollback management), but 'AI swarm coordination' and 'orchestration' are buzzwordy rather than concrete. It doesn't specify what concrete steps are performed (e.g., creating tags, writing changelogs, running CI pipelines).

2 / 3

Completeness

The description addresses 'what' at a high level but completely lacks any 'when' clause or explicit trigger guidance. Per the rubric, a missing 'Use when...' clause caps completeness at 2, and since the 'what' is also vague and buzzword-heavy, this scores a 1.

1 / 3

Trigger Term Quality

Includes some relevant terms like 'GitHub release', 'versioning', 'deployment', and 'rollback', which users might naturally say. However, 'AI swarm coordination' and 'orchestration' are not terms users would typically use when requesting help, and common variations like 'tag', 'changelog', 'publish release', or 'semantic versioning' are missing.

2 / 3

Distinctiveness Conflict Risk

The mention of 'GitHub release' provides some specificity, but terms like 'deployment', 'testing', and 'versioning' are broad enough to overlap with CI/CD skills, deployment skills, or general DevOps skills. The 'AI swarm coordination' phrase is distinctive but not in a useful way since users wouldn't search for it.

2 / 3

Total

7

/

12

Passed

Implementation

35%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill is extremely bloated and reads more like aspirational product documentation for the claude-flow tool than a focused, actionable skill for Claude. The core useful content (gh CLI commands, GitHub Actions workflows, release checklists) is buried under hundreds of lines of speculative CLI commands, verbose best practices Claude already knows, and enterprise configuration that most users won't need. The skill would benefit enormously from being cut to ~20% of its current size.

Suggestions

Cut the content to under 150 lines by removing sections Claude already knows (release cadence best practices, documentation standards, what semantic versioning is) and moving enterprise/advanced content to separate referenced files.

Clearly distinguish between real, executable commands (gh CLI, npm, git) and aspirational/hypothetical claude-flow commands - or remove the latter if they don't actually exist yet.

Add explicit validation checkpoints with error recovery steps (e.g., 'If npm publish fails, check authentication with `npm whoami` and retry') instead of relying on opaque tool flags like --auto-rollback.

Replace the pseudo-JavaScript 'Single Message' syntax with actual executable code or clearly document what runtime/tool interprets that syntax.

DimensionReasoningScore

Conciseness

Extremely verbose at 700+ lines. Massive amounts of content that Claude already knows (best practices for release cadence, documentation standards, what semantic versioning is). The YAML configs, multiple agent specializations, and enterprise features sections are padded with aspirational CLI commands for a tool (claude-flow) whose actual capabilities are unclear, making much of this speculative filler rather than actionable content.

1 / 3

Actionability

The basic gh CLI commands and GitHub Actions workflows are concrete and mostly executable. However, the majority of the skill relies on `npx claude-flow` commands with numerous flags that appear to be hypothetical/aspirational rather than documented real commands. The JavaScript 'Single Message' blocks use a pseudo-syntax that isn't standard or executable.

2 / 3

Workflow Clarity

The GitHub Actions workflow provides a clear sequential pipeline, and the checklists at the end are useful. However, validation checkpoints are mostly implicit - there's no explicit 'if this fails, do X' feedback loop in most workflows. The hotfix and rollback procedures lack verification steps to confirm the rollback succeeded. The staged deployment config shows thresholds but the actual validation/retry logic is delegated to opaque CLI commands.

2 / 3

Progressive Disclosure

The skill attempts progressive disclosure with labeled levels (1-4), but all content is inlined in a single massive file rather than split into referenced documents. The 'Level 4 - Enterprise Features' section alone is hundreds of lines that should be in a separate file. The related resources section links to external docs but the core content is monolithic.

2 / 3

Total

7

/

12

Passed

Validation

81%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation9 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

skill_md_line_count

SKILL.md is long (1082 lines); consider splitting into references/ and linking

Warning

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

9

/

11

Passed

Repository
ruvnet/agentic-flow
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.