Deploy, evaluate, fine-tune, and manage Foundry agents end-to-end: Docker build, ACR push, hosted/prompt agent create, container start, batch eval, continuous eval, prompt optimizer, Agent Optimizer scaffold, agent.yaml, dataset curation from traces, model fine-tuning (SFT/DPO/RFT). USE FOR: deploy agent, hosted agent, create agent, add tool to agent, invoke agent, evaluate agent, continuous eval, continuous monitoring, optimize prompt, improve prompt, optimize agent instructions, deploy model, Foundry project, RBAC, role assignment, permissions, quota, capacity, region, troubleshoot agent, deployment failure, AI Services, create Foundry resource, provision, knowledge index, agent monitoring, customize deployment, onboard, availability, fine-tune, SFT, DPO, RFT, training-data, grader, distillation, fine-tuned model, large file upload. DO NOT USE FOR: Azure Functions, App Service, general Azure deploy (use azure-deploy), general Azure prep (use azure-prepare).
61
71%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Advisory
Suggest reviewing before use
Optimize this skill with Tessl
npx tessl skill review --optimize ./plugin/skills/microsoft-foundry/SKILL.mdQuality
Discovery
100%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong, well-crafted skill description that excels across all dimensions. It provides highly specific capabilities, extensive natural trigger terms via the 'USE FOR' block, clear 'what' and 'when' guidance, and explicit conflict avoidance with the 'DO NOT USE FOR' clause. The only minor concern is that the description is quite dense and long, but the information density is justified given the breadth of the skill.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: Docker build, ACR push, hosted/prompt agent create, container start, batch eval, continuous eval, prompt optimizer, Agent Optimizer scaffold, dataset curation from traces, model fine-tuning (SFT/DPO/RFT). These are highly specific and actionable. | 3 / 3 |
Completeness | Clearly answers 'what' (deploy, evaluate, fine-tune, manage Foundry agents with specific sub-tasks) and 'when' via an explicit 'USE FOR:' clause with extensive trigger terms. Also includes a 'DO NOT USE FOR:' clause that further clarifies boundaries, which is excellent for routing. | 3 / 3 |
Trigger Term Quality | Excellent coverage of natural terms users would say: 'deploy agent', 'create agent', 'evaluate agent', 'optimize prompt', 'fine-tune', 'troubleshoot agent', 'deployment failure', 'RBAC', 'permissions', 'knowledge index', etc. Covers many variations and synonyms a user might naturally use. | 3 / 3 |
Distinctiveness Conflict Risk | Highly distinctive with a clear niche around Foundry agents specifically. The explicit 'DO NOT USE FOR' clause with redirects to other skills (azure-deploy, azure-prepare) directly addresses potential conflicts and makes routing unambiguous. | 3 / 3 |
Total | 12 / 12 Passed |
Implementation
42%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This skill functions well as a routing hub with excellent progressive disclosure across many sub-skills, but the main document itself is excessively verbose — particularly the 7-step 'Common Project Context Resolution' section, which reads like an internal specification rather than concise operational guidance. The skill would benefit significantly from moving the detailed resolution logic into a referenced sub-document and tightening the main skill to focus on routing and essential conventions.
Suggestions
Extract the 'Common Project Context Resolution' (Steps 1-7) into a separate referenced document (e.g., references/context-resolution.md) and replace it with a 5-10 line summary in the main skill, dramatically reducing token usage.
Add explicit validation checkpoints to the context resolution workflow, e.g., 'After Step 2, confirm resolved values with the user before proceeding' or 'If azd env get-values returns empty, stop and troubleshoot.'
Replace verbose prose descriptions of metadata migration rules (testSuites→evaluationSuites, legacy testCases normalization) with a compact table or move them to a migration reference document.
Add at least one concrete executable example (e.g., an actual azd command or MCP tool invocation with parameters) to the main skill to improve actionability.
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is extremely verbose with extensive metadata resolution logic, environment bootstrapping steps, and configuration layering that reads more like an internal design document than actionable guidance. Much of the content (e.g., legacy migration rules for testSuites→evaluationSuites, detailed metadata overlay write rules) could be in referenced sub-documents. The 'Common Project Context Resolution' section alone is massive and explains procedural logic Claude could follow from a much more compact specification. | 1 / 3 |
Actionability | The skill provides concrete routing tables, specific MCP tool names, azd variable names, and file paths, which is useful. However, it lacks executable code examples or copy-paste-ready commands — it's mostly procedural prose describing what to do rather than showing it. The sub-skill references delegate actual execution details, but the main skill itself is light on directly executable guidance. | 2 / 3 |
Workflow Clarity | The Agent Development Lifecycle table provides clear sequencing of sub-skills, and the 7-step context resolution process is well-ordered. However, there are no explicit validation checkpoints or feedback loops — for example, after resolving metadata or azd context, there's no 'verify these values are correct before proceeding' step. The context resolution steps describe what to do but lack error recovery or validation gates. | 2 / 3 |
Progressive Disclosure | The skill excels at progressive disclosure with a clear routing table of 16+ sub-skills, each with a one-line description and direct file reference. The main skill serves as an index/router, delegating detailed workflows to sub-skill documents. References are one level deep and clearly signaled with both descriptions and file paths. | 3 / 3 |
Total | 8 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
915f809
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.