Design multi-stage CI/CD pipelines with approval gates, security checks, and deployment orchestration. Use when architecting deployment workflows, setting up continuous delivery, or implementing GitOps practices.
83
78%
Does it follow best practices?
Impact
91%
1.12xAverage score across 3 eval scenarios
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./tests/ext_conformance/artifacts/agents-wshobson/cicd-automation/skills/deployment-pipeline-design/SKILL.mdQuality
Discovery
92%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong description that clearly articulates specific capabilities and includes an explicit 'Use when' clause with natural trigger terms. The main weakness is moderate overlap risk with adjacent DevOps, infrastructure, or security skills due to broad terms like 'deployment workflows' and 'security checks'. Overall, it follows best practices and would perform well in skill selection.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: 'multi-stage CI/CD pipelines', 'approval gates', 'security checks', and 'deployment orchestration'. These are distinct, concrete capabilities rather than vague abstractions. | 3 / 3 |
Completeness | Clearly answers both 'what' (design multi-stage CI/CD pipelines with approval gates, security checks, deployment orchestration) and 'when' (explicit 'Use when' clause covering architecting deployment workflows, setting up continuous delivery, or implementing GitOps practices). | 3 / 3 |
Trigger Term Quality | Includes strong natural keywords users would say: 'CI/CD pipelines', 'approval gates', 'deployment workflows', 'continuous delivery', 'GitOps'. These cover common variations of how users would describe this need. | 3 / 3 |
Distinctiveness Conflict Risk | While CI/CD and GitOps are fairly specific, terms like 'deployment workflows' and 'security checks' could overlap with infrastructure, DevOps, or security-focused skills. The niche is reasonably clear but not perfectly distinct from adjacent deployment or DevOps skills. | 2 / 3 |
Total | 11 / 12 Passed |
Implementation
64%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill provides strong, actionable code examples across multiple CI/CD platforms and deployment strategies, making it highly practical. However, it's somewhat verbose with explanatory content Claude doesn't need (deployment strategy characteristics, generic best practices list), and the workflow guidance could better integrate validation checkpoints and error recovery loops throughout rather than isolating them in one section. The document would benefit from splitting some sections into referenced files to improve progressive disclosure.
Suggestions
Trim or remove the 'Characteristics' bullet lists under each deployment strategy and the generic 'Best Practices' numbered list—Claude already knows these concepts and the tokens would be better spent on more specific guidance.
Integrate explicit validation/verification checkpoints into the main pipeline workflow description rather than only showing them in the isolated rollback section—e.g., add a clear 'if verification fails, do X' feedback loop to the multi-stage pipeline example.
Move the detailed deployment strategies section and monitoring/metrics section into separate referenced files to reduce the main skill's length and improve progressive disclosure.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is fairly comprehensive but includes some unnecessary padding—e.g., the 'Characteristics' bullet lists under each deployment strategy explain concepts Claude already knows (what blue-green or canary deployments are), and the 'Best Practices' section is a generic list of platitudes without actionable detail. The 'When to Use' section is also somewhat redundant. However, the code examples themselves are reasonably lean. | 2 / 3 |
Actionability | The skill provides fully executable, copy-paste-ready YAML and code examples across multiple CI/CD platforms (GitHub Actions, GitLab CI, Azure Pipelines), Kubernetes manifests, Argo Rollouts specs, and concrete bash commands for rollback. These are specific and directly usable. | 3 / 3 |
Workflow Clarity | The multi-stage pipeline example shows a clear sequence from build through verification, and the rollback section includes automated health-check-then-rollback logic. However, the overall document presents many patterns without explicit validation checkpoints between stages or clear feedback loops for error recovery in most sections. The automated rollback section is good but is isolated rather than integrated into the main workflow guidance. | 2 / 3 |
Progressive Disclosure | The skill references external files (approval-gate-template.yml, pipeline-orchestration.md) and related skills, which is good. However, the main document is quite long (~250 lines of content) and includes substantial inline detail (deployment strategies, monitoring, metrics) that could be split into separate reference files. The structure is reasonable but the document is somewhat monolithic. | 2 / 3 |
Total | 9 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
6e3d68c
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.