prompt-engineering-patterns

Master advanced prompt engineering techniques to maximize LLM performance, reliability, and controllability in production. Use when optimizing prompts, improving LLM outputs, or designing production prompt templates.

1.16x

Quality

41%

Does it follow best practices?

Impact

85%

1.16x

Average score across 3 eval scenarios

Securityby

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./tests/ext_conformance/artifacts/agents-wshobson/llm-application-dev/skills/prompt-engineering-patterns/SKILL.md

Quality

Discovery

67%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description has good structural completeness with an explicit 'Use when' clause, but suffers from moderate vagueness in its capability listing—relying on abstract terms like 'maximize performance' and 'reliability' rather than naming specific techniques. The trigger terms cover the basics but miss many natural user phrasings, and the broad language around LLM improvement creates some conflict risk with adjacent skills.

Suggestions

Replace abstract claims like 'maximize LLM performance, reliability, and controllability' with specific techniques such as 'chain-of-thought prompting, few-shot examples, structured output formatting, system prompt design'.

Expand trigger terms in the 'Use when' clause to include natural variations like 'system prompt', 'few-shot', 'prompt design', 'prompt tuning', 'AI instructions', 'prompt template'.

Dimension	Reasoning	Score
Specificity	The description names the domain ('prompt engineering') and mentions some actions ('optimizing prompts', 'improving LLM outputs', 'designing production prompt templates'), but these are fairly high-level and not concrete specific actions like 'chain-of-thought prompting, few-shot examples, structured output formatting'.	2 / 3
Completeness	The description clearly answers both 'what' (master advanced prompt engineering techniques for LLM performance, reliability, controllability) and 'when' (explicit 'Use when optimizing prompts, improving LLM outputs, or designing production prompt templates').	3 / 3
Trigger Term Quality	Includes some relevant keywords like 'prompt engineering', 'LLM', 'prompts', 'production prompt templates', but misses many natural variations users might say such as 'prompt design', 'system prompt', 'few-shot', 'chain of thought', 'prompt tuning', 'AI instructions', or 'prompt optimization'.	2 / 3
Distinctiveness Conflict Risk	While 'prompt engineering' is a reasonably specific domain, phrases like 'improving LLM outputs' and 'maximize LLM performance' are broad enough to potentially overlap with skills related to general LLM usage, AI coding assistance, or model evaluation. The niche could be sharper.	2 / 3
	Total	9 / 12 Passed

Implementation

14%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill reads more like a comprehensive textbook chapter on prompt engineering than an actionable skill file for Claude. It is excessively verbose, explains many concepts Claude already understands deeply, and lacks a coherent workflow for actually applying prompt engineering techniques. The code examples provide some value but are buried in a wall of descriptive content that doesn't earn its token cost.

Suggestions

Cut the content by 60-70%: remove 'Core Capabilities' (purely descriptive), 'Best Practices', 'Common Pitfalls', 'Success Metrics', and 'Resources' sections entirely — Claude already knows these concepts.

Add a clear workflow: define a concrete step-by-step process for prompt optimization (e.g., 1. Start simple → 2. Test on examples → 3. Identify failure modes → 4. Add constraints/examples → 5. Validate improvement → 6. Iterate), with explicit validation checkpoints.

Split into bundle files: keep SKILL.md as a concise overview with the top 2-3 patterns, then reference separate files like PATTERNS.md (code examples), SYSTEM_PROMPTS.md (role templates), and INTEGRATION.md (RAG/caching patterns).

Remove explanatory framing from code examples — present them as minimal, copy-paste-ready templates with brief inline comments rather than surrounding prose.

Dimension	Reasoning	Score
Conciseness	This is extremely verbose at ~400+ lines. It explains concepts Claude already knows well (what few-shot learning is, what chain-of-thought is, what system prompts are), includes bullet-point lists of abstract concepts that add no actionable value (e.g., 'Success Metrics', 'Common Pitfalls', 'Best Practices' sections are generic advice), and the 'Core Capabilities' section is entirely descriptive without adding any novel information.	1 / 3
Actionability	The code examples are mostly executable and concrete (Pydantic structured output, LangChain chains, Anthropic API calls), which is good. However, many sections are purely descriptive bullet lists without concrete guidance (Core Capabilities, Best Practices, Common Pitfalls, Success Metrics), and some code examples are more illustrative templates than truly actionable patterns Claude could directly apply.	2 / 3
Workflow Clarity	There is no clear multi-step workflow for actually engineering a prompt. The skill presents a catalog of disconnected patterns without sequencing them into a coherent process. There are no validation checkpoints for iterating on prompts, no feedback loops for measuring whether a prompt improvement worked, and the 'Prompt Optimization' section just lists abstract concepts like 'A/B testing' without any concrete workflow.	1 / 3
Progressive Disclosure	Everything is crammed into a single monolithic file with no bundle files for reference. The content mixes quick-start material with advanced patterns, integration patterns, performance optimization, best practices, pitfalls, and metrics all in one massive document. There are no references to supporting files, and the external links at the bottom are to general documentation rather than structured companion files.	1 / 3
	Total	5 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository: Dicklesworthstone/pi_agent_rust
Commit: b09ec7f

Reviewed: about 8 hours ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.