CtrlK
BlogDocsLog inGet started
Tessl Logo

peteski22/python-style

Validate Python code against style and architectural conventions

49

Quality

62%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

Quality

Content

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a strong, highly actionable skill that provides precise, executable guidance for Python code validation. Its main strength is the specificity of rules—concrete examples of violations, exact output schema, and clear severity classification. Its main weakness is moderate verbosity: some sections (scope exclusions, anti-pattern propagation warnings, operating rules) could be tightened, and the architecture rules are dense enough to warrant a separate reference file.

Suggestions

Trim the scope declaration's 'MUST NOT report on' list and the operating rules preamble—Claude doesn't need repeated reminders to not invent rules or apply personal preference.

Consider extracting the detailed architecture rules into a separate ARCHITECTURE_RULES.md file and referencing it from the main skill, improving progressive disclosure and reducing cognitive load.

DimensionReasoningScore

Conciseness

The skill is thorough but includes some redundancy. The scope declaration listing what it does NOT check is useful but slightly verbose. The anti-pattern propagation section and some operating rules restate things Claude would naturally understand. The architecture rules are detailed and earn their tokens, but the overall document could be tightened by ~20%.

2 / 3

Actionability

Highly actionable: provides executable bash commands for input gathering, precise rule definitions with concrete examples (e.g., `str | None` not `Optional[str]`, `list[str]` not `List[str]`), specific architectural boundary rules with named directories, and an exact JSON output schema. The distinction between repository and service behavior includes concrete examples like `get_user_by_id` vs `create_order`.

3 / 3

Workflow Clarity

The workflow is clearly sequenced: get changed files (with fallback order), read files, evaluate rules in listed order, categorize findings by severity, produce structured output. The input section has explicit fallback steps. The severity hierarchy (HARD > SHOULD > WARN) with clear pass/fail criteria (`pass: false` if hard_count > 0 or should_count > 0) serves as a validation checkpoint. The root-cause suppression rule for derivative violations prevents cascading false positives.

3 / 3

Progressive Disclosure

The content is well-organized with clear sections and headers, but it's a monolithic ~200-line document with no references to external files. The architecture rules section alone is quite dense and could benefit from being split into a separate reference file. However, since no bundle files exist, there's nothing to reference, and the content is reasonably navigable via its heading structure.

2 / 3

Total

10

/

12

Passed

Description

32%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is a single clause that identifies the domain (Python code) and a high-level action (validate against style and architectural conventions) but lacks specificity about what concrete checks are performed. It is missing a 'Use when...' clause entirely, which significantly hurts completeness and trigger quality. Adding explicit trigger terms and concrete action examples would substantially improve skill selection accuracy.

Suggestions

Add a 'Use when...' clause with natural trigger terms like 'lint', 'code review', 'PEP 8', 'style check', 'code conventions', or 'Python formatting'.

List specific concrete actions such as 'checks import ordering, enforces naming conventions, validates type annotations, verifies module structure'.

Clarify what 'architectural conventions' means with examples (e.g., 'layer separation, dependency rules, module organization') to improve distinctiveness from generic linting skills.

DimensionReasoningScore

Specificity

Names the domain (Python code) and a general action (validate against style and architectural conventions), but doesn't list specific concrete actions like checking import order, enforcing naming conventions, verifying type hints, etc.

2 / 3

Completeness

Describes what the skill does (validate Python code against conventions) but completely lacks a 'Use when...' clause or any explicit trigger guidance, which per the rubric should cap completeness at 2, and the 'what' itself is also fairly thin, warranting a 1.

1 / 3

Trigger Term Quality

Includes some relevant keywords like 'Python', 'validate', 'style', and 'conventions', but misses common user terms like 'lint', 'linting', 'code review', 'PEP 8', 'formatting', 'code quality', or 'style guide'.

2 / 3

Distinctiveness Conflict Risk

Somewhat specific to Python validation and conventions, but could overlap with general code review skills, Python formatting skills, or linting tools. The mention of 'architectural conventions' adds some distinction but is vague.

2 / 3

Total

7

/

12

Passed

Validation

81%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation9 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

allowed_tools_field

'allowed-tools' contains unusual tool name(s)

Warning

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

9

/

11

Passed

Reviewed

Table of Contents