CtrlK
BlogDocsLog inGet started
Tessl Logo

databricks-pipelines

Develop Lakeflow Spark Declarative Pipelines (formerly Delta Live Tables) on Databricks. Use when building batch or streaming data pipelines with Python or SQL. Invoke BEFORE starting implementation.

71

Quality

87%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

Quality

Content

85%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a strong, well-structured skill that serves as an effective hub for Lakeflow Spark Declarative Pipelines development. Its decision tree, common traps, and comprehensive API reference tables provide genuinely novel guidance that Claude wouldn't know independently. The main weakness is verbosity in the API reference tables — the deprecated columns and extensive repetition of skill links across multiple tables could be condensed without losing clarity.

Suggestions

Consider condensing the API reference tables by removing or collapsing the deprecated columns into a single 'Migration Notes' section, since the Import/Module APIs table already covers the deprecated→current mapping.

DimensionReasoningScore

Conciseness

The skill is mostly efficient and contains genuinely useful decision trees, trap lists, and API tables that Claude wouldn't inherently know. However, the comprehensive API reference tables are extremely verbose with deprecated columns that could be condensed, and some explanations in Common Traps repeat information already conveyed in the decision tree. The sheer length (~300+ lines) includes some redundancy.

2 / 3

Actionability

The skill provides concrete, executable commands (bundle init with exact flags, bundle deploy/run commands, YAML config examples), specific API syntax for both Python and SQL, and a clear decision tree for choosing dataset types. The scaffolding section includes copy-paste ready bash commands and file templates.

3 / 3

Workflow Clarity

The development workflow is clearly sequenced (validate → deploy → run → check status) with an explicit warning that deployment must happen before running. The decision tree provides unambiguous sequencing for choosing dataset types. Destructive operations (full refresh) are explicitly flagged with warnings about data loss, and selective refresh is recommended as the safer default.

3 / 3

Progressive Disclosure

The skill is an excellent overview that points to well-organized, one-level-deep reference files for each feature (streaming tables, materialized views, auto loader, auto CDC, expectations, sinks, etc.). References are clearly signaled in both the API tables and the Pipeline API Reference section at the bottom, with links to both Python and SQL variants. The MANDATORY instruction to read linked references before implementing is a strong navigation signal.

3 / 3

Total

11

/

12

Passed

Description

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a solid skill description with excellent trigger term coverage (including the former product name), clear when-to-use guidance, and strong distinctiveness. Its main weakness is that it could be more specific about the concrete actions it enables — it describes the domain well but doesn't enumerate specific capabilities like defining streaming tables, setting data quality expectations, or configuring pipeline settings.

Suggestions

Add 2-3 specific concrete actions to improve specificity, e.g., 'Define streaming tables, materialized views, and data quality expectations using Lakeflow Spark Declarative Pipelines (formerly Delta Live Tables) on Databricks.'

DimensionReasoningScore

Specificity

The description names the domain (Lakeflow Spark Declarative Pipelines / Delta Live Tables on Databricks) and mentions batch/streaming data pipelines with Python or SQL, but does not list specific concrete actions like 'create tables', 'define expectations', 'configure materialized views', etc.

2 / 3

Completeness

Clearly answers both 'what' (develop Lakeflow Spark Declarative Pipelines on Databricks) and 'when' (when building batch or streaming data pipelines with Python or SQL, invoked before starting implementation). The explicit 'Use when...' and 'Invoke BEFORE...' clauses provide clear trigger guidance.

3 / 3

Trigger Term Quality

Includes strong natural trigger terms: 'Lakeflow', 'Spark Declarative Pipelines', 'Delta Live Tables', 'Databricks', 'batch', 'streaming', 'data pipelines', 'Python', 'SQL'. The inclusion of the former name 'Delta Live Tables' is especially helpful since many users will still use that term.

3 / 3

Distinctiveness Conflict Risk

Highly distinctive — targets a very specific Databricks technology (Lakeflow Spark Declarative Pipelines / Delta Live Tables) which is unlikely to conflict with general data engineering or other pipeline skills. The combination of platform and technology creates a clear niche.

3 / 3

Total

11

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

10

/

11

Passed

Repository
databricks/devhub
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.