Deploy, evaluate, fine-tune, and manage Foundry agents end-to-end: Docker build, ACR push, hosted/prompt agent create, batch eval, continuous eval, prompt optimizer, Agent Optimizer scaffold, agent.yaml, dataset curation from traces, model fine-tuning (SFT/DPO/RFT). USE FOR: deploy agent, hosted agent, create agent, add tool to agent, invoke agent, evaluate agent, continuous eval, continuous monitoring, optimize prompt, improve prompt, optimize agent instructions, agent optimizer, deploy model, Foundry project, RBAC, role assignment, permissions, quota, capacity, region, troubleshoot agent, deployment failure, AI Services, create Foundry resource, provision, knowledge index, customize deployment, onboard, availability, fine-tune, SFT, DPO, RFT, training-data, grader, distillation, fine-tuned model, large file upload. DO NOT USE FOR: Azure Functions, App Service, general Azure deploy (use azure-deploy), general Azure prep (use azure-prepare).
59
67%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Advisory
Suggest reviewing before use
Optimize this skill with Tessl
npx tessl skill review --optimize ./plugin/skills/microsoft-foundry/SKILL.mdQuality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is an excellent skill description that comprehensively covers specific capabilities, provides extensive natural trigger terms, clearly answers both what and when, and explicitly disambiguates from related skills. The 'USE FOR' and 'DO NOT USE FOR' clauses are particularly effective for skill selection. The only minor concern is that the description is quite dense, but the information density is justified given the breadth of the skill.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | The description lists numerous specific concrete actions: Docker build, ACR push, hosted/prompt agent create, batch eval, continuous eval, prompt optimizer, Agent Optimizer scaffold, dataset curation from traces, model fine-tuning (SFT/DPO/RFT), and more. This goes well beyond naming a domain and provides comprehensive action coverage. | 3 / 3 |
Completeness | The description clearly answers 'what' (deploy, evaluate, fine-tune, manage Foundry agents with specific sub-capabilities) and 'when' (explicit 'USE FOR' clause with detailed trigger scenarios). It also includes a 'DO NOT USE FOR' clause that further clarifies boundaries, which is excellent for disambiguation. | 3 / 3 |
Trigger Term Quality | The 'USE FOR' section provides extensive natural trigger terms users would actually say: 'deploy agent', 'create agent', 'evaluate agent', 'optimize prompt', 'fine-tune', 'troubleshoot agent', 'deployment failure', etc. It also includes acronyms (SFT, DPO, RFT, RBAC) and variations that cover many user phrasings. | 3 / 3 |
Distinctiveness Conflict Risk | The description is highly specific to Foundry agents and explicitly disambiguates from related skills with the 'DO NOT USE FOR' clause referencing azure-deploy and azure-prepare. This clear niche and explicit exclusions make it very unlikely to conflict with other skills. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
35%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill functions well as a routing hub with comprehensive sub-skill references and clear intent-to-workflow mapping tables. However, it suffers from significant verbosity — the 7-step context resolution process and metadata management rules are overly detailed for a top-level skill file and should be extracted into reference documents. The lack of executable examples and validation checkpoints weakens actionability and workflow clarity.
Suggestions
Extract Steps 1-7 ('Common Project Context Resolution') into a separate reference file (e.g., references/context-resolution.md) and replace with a 3-5 line summary linking to it — this alone would cut the skill by ~60%.
Add a concrete executable example showing a typical MCP tool invocation pattern (e.g., the mandatory `foundry` discovery call) so Claude has a copy-paste-ready starting point.
Move the '.foundry Workspace Standard' section and metadata overlay rules to a reference file, keeping only a 2-line summary with a link in the main skill.
Add explicit validation checkpoints to the context resolution workflow, e.g., 'After Step 2, verify project endpoint is reachable before proceeding' with specific tool calls to validate.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is extremely verbose at ~250+ lines. It includes extensive metadata resolution logic (Steps 1-7) that reads more like an internal design document than actionable guidance. Much of the content (environment resolution, metadata overlay rules, legacy migration logic) could be dramatically condensed or moved to reference files. Claude doesn't need paragraphs explaining priority-to-tag mapping rules inline. | 1 / 3 |
Actionability | The routing tables and sub-skill references are concrete and useful, and the tool usage conventions provide some actionable guidance. However, the skill lacks executable code examples or specific MCP tool invocation patterns. Steps 1-7 describe resolution logic in prose rather than providing concrete commands or code snippets Claude can execute directly. | 2 / 3 |
Workflow Clarity | The Agent Development Lifecycle table provides clear sequencing of sub-skills, and Steps 1-7 outline a resolution process. However, validation checkpoints are largely absent — there's no explicit 'verify this succeeded before proceeding' pattern. The resolution steps describe complex conditional logic but lack error recovery or feedback loops for when resolution fails or conflicts arise beyond 'stop and ask.' | 2 / 3 |
Progressive Disclosure | The skill correctly delegates to 16+ sub-skill documents via a well-organized table with clear routing descriptions. However, the main SKILL.md itself contains far too much inline content — the entire 7-step 'Common Project Context Resolution' section and the '.foundry Workspace Standard' section should be in separate reference files. The balance between overview and detail is off, with the main file acting as both router and detailed specification. | 2 / 3 |
Total | 7 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
dd731da
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.