CtrlK
BlogDocsLog inGet started
Tessl Logo

quality-gate-size-analysis

Analyze static quality gate on-disk size changes, correlate with Confluence exception records and GitHub PRs by milestone

52

Quality

57%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./.claude/skills/quality-gate-size-analysis/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Content

64%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a solid, highly actionable skill that provides concrete commands, specific metric names, and clear data extraction instructions for a complex cross-system correlation workflow. Its main weaknesses are the lack of validation checkpoints in a multi-step process involving external APIs (which could fail or return unexpected data), and the monolithic structure that could benefit from splitting detailed reference material into supporting files. The domain-specific details in the 'Key Details and Pitfalls' section are excellent and add genuine value.

Suggestions

Add explicit validation checkpoints after each data-fetching step (e.g., 'Verify Datadog returned non-empty results for the time range', 'Confirm at least N exception pages were found, excluding the template')

Add a feedback loop for the correlation step: 'If coverage gaps exceed X%, re-check the time window and search terms before finalizing the report'

Consider extracting the Datadog metric reference (Step 1c) and the report template structure (Step 5) into separate bundle files to improve progressive disclosure

DimensionReasoningScore

Conciseness

The content is reasonably efficient and domain-specific, but includes some redundant explanations (e.g., restating what milestone means, explaining what CQL ancestor does). Some sections could be tightened—the opening sentence repeats the title, and certain instructions are slightly verbose for Claude's level of competence.

2 / 3

Actionability

The skill provides concrete, executable queries (Datadog metric queries, CQL search syntax, exact gh CLI commands with --json flags), specific field names to extract from Confluence pages, and precise output format specifications. The commands are copy-paste ready with clear parameterization.

3 / 3

Workflow Clarity

The five-step workflow is clearly sequenced and logically ordered (query metrics → fetch exceptions → pull PRs → correlate → report). However, there are no explicit validation checkpoints or feedback loops—for instance, no step to verify that Datadog queries returned valid data, no error handling for missing Confluence pages, and no verification of the final report's completeness before writing.

2 / 3

Progressive Disclosure

The content is well-structured with clear headers and sub-steps, but it's a monolithic document (~120 lines) with no references to supporting files. The detailed Datadog query syntax, Confluence extraction logic, and report template could be split into separate reference files. However, given no bundle files exist, the inline approach is somewhat justified.

2 / 3

Total

9

/

12

Passed

Description

50%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is highly specific and distinctive, naming a clear niche of correlating quality gate size changes with Confluence and GitHub data. However, it completely lacks a 'Use when...' clause, making it harder for Claude to know when to select this skill. The trigger terms are domain-specific and may not match how users naturally phrase requests.

Suggestions

Add a 'Use when...' clause specifying trigger scenarios, e.g., 'Use when the user asks about binary size regressions, quality gate failures related to size, or needs to correlate size changes with PRs or Confluence exceptions.'

Include natural language variations users might say, such as 'size regression', 'binary size budget', 'size increase', 'artifact size', or 'build size analysis'.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: analyzing static quality gate on-disk size changes, correlating with Confluence exception records, and correlating with GitHub PRs by milestone. These are concrete, well-defined operations.

3 / 3

Completeness

Describes what the skill does but completely lacks a 'Use when...' clause or any explicit trigger guidance. Per the rubric, a missing 'Use when...' clause caps completeness at 2, and since the 'when' is entirely absent and the 'what' is only moderately clear, this scores low.

1 / 3

Trigger Term Quality

Includes some relevant keywords like 'quality gate', 'on-disk size', 'Confluence', 'GitHub PRs', and 'milestone', but these are fairly specialized terms. Missing common variations or natural phrases a user might say (e.g., 'binary size', 'size regression', 'size budget').

2 / 3

Distinctiveness Conflict Risk

Highly specific niche combining static quality gates, on-disk size changes, Confluence exception records, and GitHub PRs by milestone. This very particular combination is unlikely to conflict with other skills.

3 / 3

Total

9

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

10

/

11

Passed

Repository
DataDog/datadog-agent
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.