CtrlK
BlogDocsLog inGet started
Tessl Logo

microsoft-foundry

Deploy, evaluate, fine-tune, and manage Foundry agents end-to-end: Docker build, ACR push, hosted/prompt agent create, container start, batch eval, continuous eval, prompt optimizer, Agent Optimizer scaffold, agent.yaml, dataset curation from traces, model fine-tuning (SFT/DPO/RFT). USE FOR: deploy agent, hosted agent, create agent, add tool to agent, invoke agent, evaluate agent, continuous eval, continuous monitoring, optimize prompt, improve prompt, optimize agent instructions, deploy model, Foundry project, RBAC, role assignment, permissions, quota, capacity, region, troubleshoot agent, deployment failure, AI Services, create Foundry resource, provision, knowledge index, agent monitoring, customize deployment, onboard, availability, fine-tune, SFT, DPO, RFT, training-data, grader, distillation, fine-tuned model, large file upload. DO NOT USE FOR: Azure Functions, App Service, general Azure deploy (use azure-deploy), general Azure prep (use azure-prepare).

61

Quality

71%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./plugin/skills/microsoft-foundry/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong, well-crafted skill description that excels across all dimensions. It provides highly specific capabilities, extensive natural trigger terms via the 'USE FOR' block, clear 'what' and 'when' guidance, and explicit conflict avoidance with the 'DO NOT USE FOR' clause. The only minor concern is that the description is quite dense and long, but the information density is justified given the breadth of the skill.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: Docker build, ACR push, hosted/prompt agent create, container start, batch eval, continuous eval, prompt optimizer, Agent Optimizer scaffold, dataset curation from traces, model fine-tuning (SFT/DPO/RFT). These are highly specific and actionable.

3 / 3

Completeness

Clearly answers 'what' (deploy, evaluate, fine-tune, manage Foundry agents with specific sub-tasks) and 'when' via an explicit 'USE FOR:' clause with extensive trigger terms. Also includes a 'DO NOT USE FOR:' clause that further clarifies boundaries, which is excellent for routing.

3 / 3

Trigger Term Quality

Excellent coverage of natural terms users would say: 'deploy agent', 'create agent', 'evaluate agent', 'optimize prompt', 'fine-tune', 'troubleshoot agent', 'deployment failure', 'RBAC', 'permissions', 'knowledge index', etc. Covers many variations and synonyms a user might naturally use.

3 / 3

Distinctiveness Conflict Risk

Highly distinctive with a clear niche around Foundry agents specifically. The explicit 'DO NOT USE FOR' clause with redirects to other skills (azure-deploy, azure-prepare) directly addresses potential conflicts and makes routing unambiguous.

3 / 3

Total

12

/

12

Passed

Implementation

42%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill functions well as a routing hub with excellent progressive disclosure across many sub-skills, but the main document itself is excessively verbose — particularly the 7-step 'Common Project Context Resolution' section, which reads like an internal specification rather than concise operational guidance. The skill would benefit significantly from moving the detailed resolution logic into a referenced sub-document and tightening the main skill to focus on routing and essential conventions.

Suggestions

Extract the 'Common Project Context Resolution' (Steps 1-7) into a separate referenced document (e.g., references/context-resolution.md) and replace it with a 5-10 line summary in the main skill, dramatically reducing token usage.

Add explicit validation checkpoints to the context resolution workflow, e.g., 'After Step 2, confirm resolved values with the user before proceeding' or 'If azd env get-values returns empty, stop and troubleshoot.'

Replace verbose prose descriptions of metadata migration rules (testSuites→evaluationSuites, legacy testCases normalization) with a compact table or move them to a migration reference document.

Add at least one concrete executable example (e.g., an actual azd command or MCP tool invocation with parameters) to the main skill to improve actionability.

DimensionReasoningScore

Conciseness

The skill is extremely verbose with extensive metadata resolution logic, environment bootstrapping steps, and configuration layering that reads more like an internal design document than actionable guidance. Much of the content (e.g., legacy migration rules for testSuites→evaluationSuites, detailed metadata overlay write rules) could be in referenced sub-documents. The 'Common Project Context Resolution' section alone is massive and explains procedural logic Claude could follow from a much more compact specification.

1 / 3

Actionability

The skill provides concrete routing tables, specific MCP tool names, azd variable names, and file paths, which is useful. However, it lacks executable code examples or copy-paste-ready commands — it's mostly procedural prose describing what to do rather than showing it. The sub-skill references delegate actual execution details, but the main skill itself is light on directly executable guidance.

2 / 3

Workflow Clarity

The Agent Development Lifecycle table provides clear sequencing of sub-skills, and the 7-step context resolution process is well-ordered. However, there are no explicit validation checkpoints or feedback loops — for example, after resolving metadata or azd context, there's no 'verify these values are correct before proceeding' step. The context resolution steps describe what to do but lack error recovery or validation gates.

2 / 3

Progressive Disclosure

The skill excels at progressive disclosure with a clear routing table of 16+ sub-skills, each with a one-line description and direct file reference. The main skill serves as an index/router, delegating detailed workflows to sub-skill documents. References are one level deep and clearly signaled with both descriptions and file paths.

3 / 3

Total

8

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
microsoft/github-copilot-for-azure
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.