Design multi-stage CI/CD pipelines with approval gates, security checks, and deployment orchestration. Use when architecting deployment workflows, setting up continuous delivery, or implementing GitOps practices.
75
63%
Does it follow best practices?
Impact
91%
1.12xAverage score across 3 eval scenarios
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./tests/ext_conformance/artifacts/agents-wshobson/cicd-automation/skills/deployment-pipeline-design/SKILL.mdQuality
Discovery
92%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong description that clearly articulates specific capabilities and includes an explicit 'Use when' clause with natural trigger terms. The main weakness is a slight risk of overlap with adjacent DevOps or infrastructure skills due to broad terms like 'deployment' and 'security checks', but overall it performs well across all dimensions.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: 'multi-stage CI/CD pipelines', 'approval gates', 'security checks', and 'deployment orchestration'. These are distinct, concrete capabilities rather than vague abstractions. | 3 / 3 |
Completeness | Clearly answers both 'what' (design multi-stage CI/CD pipelines with approval gates, security checks, deployment orchestration) and 'when' (explicit 'Use when' clause covering architecting deployment workflows, setting up continuous delivery, or implementing GitOps practices). | 3 / 3 |
Trigger Term Quality | Includes strong natural keywords users would say: 'CI/CD pipelines', 'approval gates', 'deployment workflows', 'continuous delivery', 'GitOps'. These cover common variations of how users would describe this need. | 3 / 3 |
Distinctiveness Conflict Risk | While CI/CD and GitOps are fairly specific, terms like 'deployment workflows' and 'security checks' could overlap with infrastructure, DevOps, or security-focused skills. The niche is reasonably clear but not as tightly scoped as something like PDF processing. | 2 / 3 |
Total | 11 / 12 Passed |
Implementation
35%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill reads more like a comprehensive tutorial or reference guide on CI/CD concepts than a targeted skill for Claude. It over-explains well-known deployment patterns (rolling, blue-green, canary, feature flags) and DORA metrics that Claude already understands, resulting in significant token waste. The concrete YAML examples are a strength, but the content would benefit greatly from being trimmed to only project-specific conventions, non-obvious patterns, and critical validation steps.
Suggestions
Drastically reduce content by removing explanations of standard concepts Claude already knows (deployment strategy definitions, DORA metrics, generic best practices list) and focus only on project-specific pipeline conventions and non-obvious patterns.
Add explicit validation checkpoints and feedback loops for failure scenarios—e.g., what happens when integration tests fail after staging deploy, or when canary metrics exceed thresholds, with concrete remediation steps.
Move deployment strategies, monitoring integration, and rollback patterns into separate referenced files to reduce the main skill to a concise overview with clear navigation.
Ensure referenced files (assets/approval-gate-template.yml, references/pipeline-orchestration.md) actually exist in the bundle, or remove the references to avoid dead links.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is extremely verbose at ~300 lines, explaining well-known concepts like deployment strategies (rolling, blue-green, canary) and DORA metrics that Claude already knows. The best practices section is a generic list of platitudes. Much of this content adds no novel knowledge beyond what Claude can already produce. | 1 / 3 |
Actionability | The skill provides concrete YAML and code examples that are mostly executable (GitHub Actions, Kubernetes manifests, Argo Rollouts), but many are incomplete or require significant context (e.g., missing Dockerfiles, missing k8s manifests). The feature flags example uses a specific SDK but is more illustrative than copy-paste ready in a real pipeline context. | 2 / 3 |
Workflow Clarity | The multi-stage pipeline example shows a clear sequence with stages from build through verification, and the rollback section includes automated health-check-then-rollback logic. However, validation checkpoints are inconsistent—the staging deploy has no verification before proceeding, and there's no explicit feedback loop for what to do when integration tests fail or when canary metrics degrade. | 2 / 3 |
Progressive Disclosure | The skill references external files (assets/approval-gate-template.yml, references/pipeline-orchestration.md) and related skills, which is good structure. However, no bundle files are provided, making these references dead links. The main file itself is monolithic—deployment strategies, monitoring, and rollback could each be separate referenced documents rather than inline. | 2 / 3 |
Total | 7 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
99da384
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.