CtrlK
BlogDocsLog inGet started
Tessl Logo

microsoft-foundry

Deploy, evaluate, fine-tune, and manage Foundry agents end-to-end: Docker build, ACR push, hosted/prompt agent create, batch eval, continuous eval, prompt optimizer, Agent Optimizer scaffold, agent.yaml, dataset curation from traces, model fine-tuning (SFT/DPO/RFT). USE FOR: deploy agent, hosted agent, create agent, add tool to agent, invoke agent, evaluate agent, continuous eval, continuous monitoring, optimize prompt, improve prompt, optimize agent instructions, agent optimizer, deploy model, Foundry project, RBAC, role assignment, permissions, quota, capacity, region, troubleshoot agent, deployment failure, AI Services, create Foundry resource, provision, knowledge index, customize deployment, onboard, availability, fine-tune, SFT, DPO, RFT, training-data, grader, distillation, fine-tuned model, large file upload. DO NOT USE FOR: Azure Functions, App Service, general Azure deploy (use azure-deploy), general Azure prep (use azure-prepare).

59

Quality

67%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./plugin/skills/microsoft-foundry/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that comprehensively covers specific capabilities, provides extensive natural trigger terms, clearly answers both what and when, and explicitly disambiguates from related skills. The 'USE FOR' and 'DO NOT USE FOR' clauses are particularly effective for skill selection. The only minor concern is that the description is quite dense, but the information density is justified given the breadth of the skill.

DimensionReasoningScore

Specificity

The description lists numerous specific concrete actions: Docker build, ACR push, hosted/prompt agent create, batch eval, continuous eval, prompt optimizer, Agent Optimizer scaffold, dataset curation from traces, model fine-tuning (SFT/DPO/RFT), and more. This goes well beyond naming a domain and provides comprehensive action coverage.

3 / 3

Completeness

The description clearly answers 'what' (deploy, evaluate, fine-tune, manage Foundry agents with specific sub-capabilities) and 'when' (explicit 'USE FOR' clause with detailed trigger scenarios). It also includes a 'DO NOT USE FOR' clause that further clarifies boundaries, which is excellent for disambiguation.

3 / 3

Trigger Term Quality

The 'USE FOR' section provides extensive natural trigger terms users would actually say: 'deploy agent', 'create agent', 'evaluate agent', 'optimize prompt', 'fine-tune', 'troubleshoot agent', 'deployment failure', etc. It also includes acronyms (SFT, DPO, RFT, RBAC) and variations that cover many user phrasings.

3 / 3

Distinctiveness Conflict Risk

The description is highly specific to Foundry agents and explicitly disambiguates from related skills with the 'DO NOT USE FOR' clause referencing azure-deploy and azure-prepare. This clear niche and explicit exclusions make it very unlikely to conflict with other skills.

3 / 3

Total

12

/

12

Passed

Implementation

35%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This skill functions well as a routing hub with comprehensive sub-skill references and clear intent-to-workflow mapping tables. However, it suffers from significant verbosity — the 7-step context resolution process and metadata management rules are overly detailed for a top-level skill file and should be extracted into reference documents. The lack of executable examples and validation checkpoints weakens actionability and workflow clarity.

Suggestions

Extract Steps 1-7 ('Common Project Context Resolution') into a separate reference file (e.g., references/context-resolution.md) and replace with a 3-5 line summary linking to it — this alone would cut the skill by ~60%.

Add a concrete executable example showing a typical MCP tool invocation pattern (e.g., the mandatory `foundry` discovery call) so Claude has a copy-paste-ready starting point.

Move the '.foundry Workspace Standard' section and metadata overlay rules to a reference file, keeping only a 2-line summary with a link in the main skill.

Add explicit validation checkpoints to the context resolution workflow, e.g., 'After Step 2, verify project endpoint is reachable before proceeding' with specific tool calls to validate.

DimensionReasoningScore

Conciseness

The skill is extremely verbose at ~250+ lines. It includes extensive metadata resolution logic (Steps 1-7) that reads more like an internal design document than actionable guidance. Much of the content (environment resolution, metadata overlay rules, legacy migration logic) could be dramatically condensed or moved to reference files. Claude doesn't need paragraphs explaining priority-to-tag mapping rules inline.

1 / 3

Actionability

The routing tables and sub-skill references are concrete and useful, and the tool usage conventions provide some actionable guidance. However, the skill lacks executable code examples or specific MCP tool invocation patterns. Steps 1-7 describe resolution logic in prose rather than providing concrete commands or code snippets Claude can execute directly.

2 / 3

Workflow Clarity

The Agent Development Lifecycle table provides clear sequencing of sub-skills, and Steps 1-7 outline a resolution process. However, validation checkpoints are largely absent — there's no explicit 'verify this succeeded before proceeding' pattern. The resolution steps describe complex conditional logic but lack error recovery or feedback loops for when resolution fails or conflicts arise beyond 'stop and ask.'

2 / 3

Progressive Disclosure

The skill correctly delegates to 16+ sub-skill documents via a well-organized table with clear routing descriptions. However, the main SKILL.md itself contains far too much inline content — the entire 7-step 'Common Project Context Resolution' section and the '.foundry Workspace Standard' section should be in separate reference files. The balance between overview and detail is off, with the main file acting as both router and detailed specification.

2 / 3

Total

7

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
microsoft/github-copilot-for-azure
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.