CtrlK
BlogDocsLog inGet started
Tessl Logo

long-running-server

Enable long-running background task support with LongRunningAgentServer. Use when: (1) Agent tasks may exceed HTTP timeout (~120s), (2) User wants background/async execution, (3) User says 'long running', 'background tasks', or 'async agent'.

65

Quality

77%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./agent-migration-from-model-serving/.claude/skills/long-running-server/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Content

64%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured, highly actionable skill with complete executable code for both SDK variants. Its main weaknesses are verbosity from inlining two full SDK implementations and the lack of explicit validation/verification steps in the workflow. The troubleshooting table and constructor reference are valuable additions.

Suggestions

Add explicit validation checkpoints after key steps, e.g., 'Start the server locally and test: curl -X POST localhost:8000/responses -d {"background": true, ...}' to verify the setup works before deploying.

Consider moving the two SDK-specific implementations (OpenAI vs LangGraph) into separate referenced files to reduce the main skill's length and improve progressive disclosure.

Trim the LakebaseConfig boilerplate — a brief description of the required env vars and a one-line note like 'Create a frozen dataclass reading these env vars' would suffice given Claude's ability to generate such code.

DimensionReasoningScore

Conciseness

The skill is fairly long (~200+ lines) with two full SDK variants (OpenAI and LangGraph) shown inline, which adds significant bulk. The table explaining request patterns and the constructor reference table are useful but could be more compact. Some sections like the LakebaseConfig dataclass are boilerplate that Claude could generate from a brief description.

2 / 3

Actionability

The skill provides fully executable, copy-paste-ready code for every step — complete Python files for start_server.py, utility functions, YAML config snippets, and .env examples. Both SDK variants have concrete, runnable implementations with proper imports and module structure.

3 / 3

Workflow Clarity

The 7-step sequence is clearly numbered and logically ordered, but there are no explicit validation checkpoints or feedback loops. For a multi-step process involving database configuration and deployment, there should be verification steps (e.g., 'test the background endpoint', 'verify Lakebase tables were created'). The troubleshooting table partially compensates but doesn't substitute for inline validation.

2 / 3

Progressive Disclosure

The skill appropriately references the lakebase-setup skill for prerequisite steps and defers some configuration details, which is good. However, the two full SDK variants (OpenAI and LangGraph) are shown inline, making the document very long. These could be split into separate referenced files. No bundle files are provided to support progressive disclosure.

2 / 3

Total

9

/

12

Passed

Description

89%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a solid description with excellent completeness and trigger term coverage, clearly specifying both what the skill does and when to use it. Its main weakness is that the 'what' portion could be more specific about the concrete actions or capabilities provided beyond just 'enabling support'. The distinctiveness is strong due to the specialized domain.

Suggestions

Add specific concrete actions to improve specificity, e.g., 'Queue tasks, poll execution status, retrieve results for agent tasks that exceed HTTP timeout limits.'

DimensionReasoningScore

Specificity

Names the domain ('long-running background task support') and mentions a specific component ('LongRunningAgentServer'), but doesn't list multiple concrete actions—it describes a capability area rather than specific operations like 'queue tasks', 'poll status', 'retrieve results'.

2 / 3

Completeness

Clearly answers both 'what' (enable long-running background task support with LongRunningAgentServer) and 'when' with explicit numbered trigger conditions including timeout scenarios and specific user phrases.

3 / 3

Trigger Term Quality

Includes natural keywords users would say: 'long running', 'background tasks', 'async agent', 'HTTP timeout', and 'background/async execution'. These cover common variations of how a user would describe this need.

3 / 3

Distinctiveness Conflict Risk

Highly distinctive niche—'LongRunningAgentServer', 'HTTP timeout', 'background tasks', and 'async agent' are specific enough to avoid conflicts with other skills. Unlikely to be confused with general task or server skills.

3 / 3

Total

11

/

12

Passed

Validation

100%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository
databricks/app-templates
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.