Configure Azure API Management as an AI Gateway for AI models, MCP tools, and agents. WHEN: semantic caching, token limit, content safety, load balancing, AI model governance, MCP rate limiting, jailbreak detection, add Azure OpenAI backend, add AI Foundry model, test AI gateway, LLM policies, configure AI backend, token metrics, AI cost control, convert API to MCP, import OpenAPI to gateway.
Install with Tessl CLI
npx tessl i github:microsoft/github-copilot-for-azure --skill azure-aigateway95
Does it follow best practices?
Evaluation — 95%
↑ 1.72xAgent success when using this skill
Validation for skill structure
Full AI governance policy composition
Auth before cache
0%
100%
Cache before token limit
100%
100%
Token limit before content policy
100%
100%
Metrics last in inbound
0%
100%
Correct auth resource
100%
100%
Cache lookup attributes
100%
100%
Cache store in outbound
100%
100%
Token metric dimensions
100%
100%
Token limit has counter-key
100%
100%
Streaming incompatibility noted
100%
100%
Backend timeout set
0%
100%
estimate-prompt-tokens enabled
100%
100%
Without context: $0.4936 · 2m 45s · 18 turns · 22 in / 9,302 out tokens
With context: $0.5051 · 1m 50s · 18 turns · 2,950 in / 6,534 out tokens
Content safety and jailbreak protection policies
Uses llm-content-safety
0%
100%
All four categories present
30%
100%
False-positive thresholds raised
44%
100%
On-error jailbreak handler
10%
100%
Blocked content returns 400
100%
100%
Rate limit by agent ID
90%
100%
Rate limit includes retry header
0%
100%
Content safety in inbound
100%
100%
Auth policy present
37%
100%
Policy order respected
30%
100%
Threshold reasoning in notes
100%
100%
Without context: $0.2029 · 1m 36s · 9 turns · 14 in / 4,652 out tokens
With context: $0.4424 · 1m 38s · 22 turns · 817 in / 5,111 out tokens
Multi-region load balancing and multi-tenant governance
Hash-based backend selection
0%
0%
Retry on 429
100%
100%
Retry cycles to next backend
100%
100%
Per-tenant token limit
0%
100%
Token limit value per tenant
0%
100%
Tenant dimension in metrics
22%
100%
Managed identity auth
100%
100%
Correct RBAC role in script
0%
88%
Backend create commands
100%
100%
estimate-prompt-tokens set
0%
100%
Backend timeout elevated
0%
100%
Policy ordering correct
33%
50%
Without context: $0.4017 · 2m 14s · 11 turns · 14 in / 9,083 out tokens
With context: $0.8018 · 3m 55s · 19 turns · 2,951 in / 13,974 out tokens
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.