CtrlK
BlogDocsLog inGet started
Tessl Logo

phoenix-server

Backend development guide for the Phoenix AI observability platform (Strawberry GraphQL, SQLAlchemy async, FastAPI). Use this skill when writing or modifying Python server code in the phoenix repo — adding mutations, types, migrations, or tests. Trigger on any backend task touching src/phoenix/server/, src/phoenix/db/, or tests/unit/server/.

67

Quality

82%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

Quality

Content

64%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-organized backend development overview that efficiently maps the codebase structure and routes developers to appropriate references. Its strengths are conciseness, clear directory layout, and the critical security rule about mutations vs queries. Its weaknesses are the lack of concrete executable examples in the SKILL.md itself and the absence of explicit multi-step workflows with validation checkpoints for common tasks like adding a mutation or running a migration.

Suggestions

Add a brief concrete workflow example for the most common task (e.g., adding a mutation: create file → register in __init__.py → make graphql → make test-python → verify), with explicit validation checkpoints.

Consider moving the Naming and Docstrings sections to a separate style-guide reference file to keep SKILL.md focused on actionable development workflows.

Include at least one minimal executable code snippet (e.g., a skeleton mutation class) so the SKILL.md is independently useful without requiring reference file access.

DimensionReasoningScore

Conciseness

The content is lean and well-organized. It assumes Claude's competence with Python, SQLAlchemy, Strawberry, and FastAPI without explaining what they are. Every section earns its place — directory layout, command reference, task routing table, and hard rules are all high-signal content.

3 / 3

Actionability

The skill provides concrete commands (make dev-backend, uv run pytest, etc.) and a clear directory map, but the actual implementation guidance is deferred entirely to reference files. The SKILL.md itself doesn't contain executable code examples for any of the core tasks (adding a mutation, writing a test, etc.), so a developer would need to follow references for any concrete guidance.

2 / 3

Workflow Clarity

The task routing table clearly directs users to the right reference for each task type, and the hard rules section highlights a critical security constraint. However, there are no explicit multi-step workflows with validation checkpoints in the SKILL.md itself — no 'add mutation → regenerate schema → run tests → verify' sequence, which is important for operations like adding migrations or modifying the GraphQL schema.

2 / 3

Progressive Disclosure

The skill has a well-structured overview with a clear task-to-reference routing table pointing to four reference files. However, since no bundle files were provided, we cannot verify these references exist or assess their quality. The structure is good in principle but the references are unverifiable, and some content like the Docstrings and Naming sections feel like they could be in a separate style guide reference rather than inline.

2 / 3

Total

9

/

12

Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that clearly identifies the technology stack, specific actions, and precise trigger conditions including directory paths. It uses third person voice appropriately and provides both 'what' and 'when' guidance with explicit trigger clauses. The description is concise yet comprehensive, making it easy for Claude to select this skill accurately.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: 'adding mutations, types, migrations, or tests' and names specific technologies (Strawberry GraphQL, SQLAlchemy async, FastAPI). Also references specific directory paths.

3 / 3

Completeness

Clearly answers both 'what' (backend development guide for Phoenix AI observability platform with specific technologies) and 'when' (explicit 'Use this skill when...' and 'Trigger on...' clauses with specific paths and task types).

3 / 3

Trigger Term Quality

Includes natural keywords a developer would use: 'Python server code', 'phoenix repo', 'mutations', 'types', 'migrations', 'tests', 'backend', plus specific directory paths that serve as precise triggers. These are terms developers would naturally reference.

3 / 3

Distinctiveness Conflict Risk

Highly distinctive with specific repo name (phoenix), specific directory paths (src/phoenix/server/, src/phoenix/db/), and specific technology stack (Strawberry GraphQL, SQLAlchemy async, FastAPI). Very unlikely to conflict with other skills.

3 / 3

Total

12

/

12

Passed

Validation

81%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation9 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

metadata_version

'metadata.version' is missing

Warning

metadata_field

'metadata' should map string keys to string values

Warning

Total

9

/

11

Passed

Repository
Arize-ai/phoenix
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.