prompt-engineer

Writes, refactors, and evaluates prompts for LLMs — generating optimized prompt templates, structured output schemas, evaluation rubrics, and test suites. Use when designing prompts for new LLM applications, refactoring existing prompts for better accuracy or token efficiency, implementing chain-of-thought or few-shot learning, creating system prompts with personas and guardrails, building JSON/function-calling schemas, or developing prompt evaluation frameworks to measure and improve model performance.

Quality

75%

Does it follow best practices?

Run evals on this skill

Adds up to 20 points to the overall score

View guide

Securityby

Low

Low-risk findings worth noting

Fix and improve this skill with Tessl

tessl review fix ./skills/prompt-engineer/SKILL.md

Quality

Content

50%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill provides a reasonable overview of prompt engineering with useful concrete examples (zero-shot vs few-shot, before/after optimization) and a well-organized reference table. However, it leans toward describing a general methodology rather than providing Claude with specific, executable guidance it doesn't already know — much of the content (prompt engineering best practices, the iterative workflow) is knowledge Claude already possesses. The lack of bundle files undermines the progressive disclosure structure.

Suggestions

Remove or drastically shorten the constraints lists and workflow description — Claude already knows prompt engineering best practices. Focus on project-specific conventions, tool integrations, or non-obvious patterns.

Add concrete, executable evaluation examples — e.g., a specific Python script or command for measuring prompt accuracy, rather than the abstract '80% accuracy' threshold.

Provide the referenced bundle files (references/prompt-patterns.md, etc.) or remove the reference table if they don't exist, as broken references reduce trust in the skill.

Trim the 'When to Use This Skill' section — it largely restates the skill description and consumes tokens without adding actionable information.

Dimension	Reasoning	Score
Conciseness	The skill is reasonably well-structured but includes some unnecessary content that Claude already knows — e.g., explaining when to use zero-shot vs few-shot, the general workflow of 'understand requirements → design → test → iterate → deploy,' and the Coverage Note section which is largely filler. The constraints lists are somewhat verbose with items that are general best practices rather than skill-specific guidance.	2 / 3
Actionability	The before/after prompt examples are concrete and useful, and the zero-shot vs few-shot comparison is copy-paste ready. However, the core workflow is high-level process guidance rather than executable steps — there are no specific commands, tools, or scripts to run. The validation checkpoint mentions '80% accuracy' but doesn't specify how to measure it. Much of the actionable detail is deferred to reference files.	2 / 3
Workflow Clarity	The 5-step core workflow is clearly sequenced and includes a validation checkpoint at step 3, which is good. However, the checkpoint is vague ('if accuracy < 80%') without specifying measurement methodology, and there's no explicit feedback loop for steps 4-5. The workflow reads more like a general methodology description than a concrete operational procedure with verifiable gates.	2 / 3
Progressive Disclosure	The reference table is well-structured with clear 'Load When' guidance, which is a good progressive disclosure pattern. However, no bundle files were provided, so the six referenced files (references/prompt-patterns.md, etc.) cannot be verified to exist. The main SKILL.md also includes substantial inline content (examples, constraints, output templates) that could arguably be split out, making the overview heavier than ideal.	2 / 3
	Total	8 / 12 Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that hits all the marks. It provides specific concrete actions, includes a comprehensive 'Use when...' clause with multiple natural trigger scenarios, and uses domain-specific terminology that clearly distinguishes it from other skills. The description is well-structured, uses third person voice correctly, and balances detail with conciseness.

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions: writes/refactors/evaluates prompts, generates optimized prompt templates, structured output schemas, evaluation rubrics, and test suites. Very detailed and actionable.	3 / 3
Completeness	Clearly answers both 'what' (writes, refactors, evaluates prompts, generates templates/schemas/rubrics/test suites) and 'when' with an explicit 'Use when...' clause listing six specific trigger scenarios.	3 / 3
Trigger Term Quality	Excellent coverage of natural terms users would say: 'prompts', 'LLM', 'chain-of-thought', 'few-shot learning', 'system prompts', 'JSON', 'function-calling', 'token efficiency', 'evaluation', 'guardrails', 'personas'. These are terms prompt engineers and developers naturally use.	3 / 3
Distinctiveness Conflict Risk	Occupies a clear niche around prompt engineering specifically for LLMs. The specific triggers like 'chain-of-thought', 'few-shot learning', 'system prompts', 'function-calling schemas', and 'prompt evaluation frameworks' are highly distinctive and unlikely to conflict with general coding or writing skills.	3 / 3
	Total	12 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository: Jeffallan/claude-skills
Path: skills/prompt-engineer/SKILL.md
Commit: e8be415

Reviewed: about 5 hours ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.