databricks-model-serving

Deploy and query Databricks Model Serving endpoints. Use when (1) deploying MLflow models or AI agents to endpoints, (2) creating ChatAgent/ResponsesAgent agents, (3) integrating UC Functions or Vector Search tools, (4) querying deployed endpoints, (5) checking endpoint status. Covers classical ML models, custom pyfunc, and GenAI agents.

Quality

71%

Does it follow best practices?

Run evals on this skill

Adds up to 20 points to the overall score

View guide

Securityby

Passed

No findings from the security scan

Fix and improve this skill with Tessl

tessl review fix ./databricks-skills/databricks-model-serving/SKILL.md

Quality

Content

57%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is highly actionable with concrete code and MCP tooling, but it is hurt by redundancy, an oversized inline model table, and—most seriously—a broken reference structure where all nine cited bundle files are absent.

Suggestions

Create the references/ directory and add the nine referenced files (e.g. 1-classical-ml.md, 7-deployment.md, 9-package-requirements.md), or remove the dead reference links from the body.

Dedupe the querying examples: keep one canonical manage_serving_endpoint query example and reference it from the Quick Start and Common Workflows sections instead of repeating it three times.

Add an explicit validation feedback loop for deployment (e.g. poll manage_serving_endpoint action="get"; if NOT_READY after N attempts, surface the deployment job error and retry) to support the async job-based deploy path.

Dimension	Reasoning	Score
Conciseness	Core guidance is efficient, but a ~25-row foundation-model endpoint table and querying examples repeated three times (Quick Start, MCP Tools, Common Workflows) consume tokens that could be tightened or offloaded.	2 / 3
Actionability	Provides copy-paste-ready pip installs, MCP tool calls with parameters, executable code blocks, and a concrete troubleshooting table with fixes.	3 / 3
Workflow Clarity	A clear 7-step deploy sequence with polling guidance exists, but there is no explicit validate→fix→retry feedback loop for the async deployment, which the rubric expects for risky/batch operations.	2 / 3
Progressive Disclosure	The body references nine files (1-classical-ml.md through 9-package-requirements.md) but no references/ directory exists, so every reference is a dead link and detail stays trapped inline rather than split one level deep.	1 / 3
	Total	8 / 12 Passed

Description

85%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

A strong, well-scoped description that answers both what and when with concrete, numbered triggers. Trigger-term quality is slightly dragged down by product-jargon phrases mixed in with the natural user phrasing.

Dimension	Reasoning	Score
Specificity	Lists multiple concrete actions across five numbered cases ("deploying MLflow models or AI agents", "creating ChatAgent/ResponsesAgent agents", "integrating UC Functions or Vector Search tools", "querying deployed endpoints", "checking endpoint status") plus three model types.	3 / 3
Completeness	Clearly states what it does ("Deploy and query Databricks Model Serving endpoints") and when to use it via an explicit "Use when (1)...(5)..." trigger clause.	3 / 3
Trigger Term Quality	Includes natural terms ("deploy MLflow models", "query deployed endpoints", "check endpoint status") but leans on technical product jargon ("ResponsesAgent", "UC Functions", "Vector Search tools") that not every user would naturally voice.	2 / 3
Distinctiveness Conflict Risk	Scoped tightly to Databricks Model Serving with product-specific triggers, making it unlikely to fire for unrelated skills.	3 / 3
	Total	11 / 12 Passed

Validation

93%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 15 / 16 Passed

Validation for skill structure

Criteria	Description	Result
relative_links	Relative link issues: 15 missing, 5 suspicious	Warning

	Total	15 / 16 Passed

Repository: databricks-solutions/ai-dev-kit
Path: databricks-skills/databricks-model-serving/SKILL.md
Commit: a7e1d51

Reviewed: about 2 hours ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.