CtrlK
BlogDocsLog inGet started
Tessl Logo

phoenix-client-development

Guide for the phoenix-client TypeScript package — experiment lifecycle, tracer provider management, and test conventions.

68

1.23x
Quality

52%

Does it follow best practices?

Impact

100%

1.23x

Average score across 3 eval scenarios

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./js/packages/phoenix-client/.agents/skills/phoenix-client-development/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

32%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description identifies a specific package and lists topic areas but lacks concrete actions (verbs) and has no 'Use when...' clause to guide skill selection. It reads more like a document title than a functional skill description, making it difficult for Claude to know precisely when to invoke this skill versus other TypeScript or testing-related skills.

Suggestions

Add an explicit 'Use when...' clause, e.g., 'Use when the user asks about phoenix-client setup, running experiments, configuring tracer providers, or writing tests for Phoenix integrations.'

Replace high-level category names with concrete actions, e.g., 'Creates and runs experiments, configures and shuts down tracer providers, and writes tests following phoenix-client conventions.'

Include natural trigger terms users might say, such as 'Phoenix', 'Arize', 'tracing', 'LLM observability', 'experiment tracking', or 'phoenix-client npm package'.

DimensionReasoningScore

Specificity

Names the domain ('phoenix-client TypeScript package') and some areas ('experiment lifecycle, tracer provider management, test conventions'), but these are high-level categories rather than concrete actions. No specific verbs describing what actions are performed.

2 / 3

Completeness

Describes 'what' at a high level (guide for phoenix-client covering certain topics) but completely lacks a 'Use when...' clause or any explicit trigger guidance for when Claude should select this skill. Per rubric guidelines, missing 'Use when' caps completeness at 2, and the 'what' is also weak, so this scores 1.

1 / 3

Trigger Term Quality

Includes relevant terms like 'phoenix-client', 'TypeScript', 'experiment lifecycle', 'tracer provider', and 'test conventions', but misses common variations users might say (e.g., 'Phoenix', 'tracing', 'observability', 'LLM experiments', 'Arize'). The terms are somewhat technical but relevant to the domain.

2 / 3

Distinctiveness Conflict Risk

The mention of 'phoenix-client TypeScript package' is fairly specific and narrows the domain, but 'test conventions' and 'tracer provider management' could overlap with general TypeScript testing or OpenTelemetry tracing skills.

2 / 3

Total

7

/

12

Passed

Implementation

72%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-structured routing/overview skill that excels at conciseness and progressive disclosure. Its main weakness is that it provides very little actionable guidance on its own — nearly all substance is delegated to rule files. The lack of any workflow description or validation steps for the complex operations mentioned (experiment lifecycle, tracer management) is a gap.

Suggestions

Add a brief high-level workflow or sequence overview (e.g., experiment lifecycle steps) so the SKILL.md provides some standalone actionable guidance beyond routing to rule files.

Consider adding a minimal concrete example (e.g., a quick-start code snippet for a common task like running an experiment) to improve actionability at the top level.

DimensionReasoningScore

Conciseness

Very lean and efficient. Every line serves a purpose — no explanation of what TypeScript is, what Phoenix does in detail, or how vitest works. The one-liner 'Read existing code in the directory you're working in before writing new code' is a useful behavioral directive without padding.

3 / 3

Actionability

The build/test command is concrete and executable, and the rule file table gives clear routing. However, the skill itself delegates almost all substantive guidance to rule files (experiments.md, tracing.md, testing.md), so the SKILL.md alone provides limited actionable instruction beyond running tests.

2 / 3

Workflow Clarity

There's no explicit multi-step workflow or validation checkpoints described. The skill is essentially a routing document pointing to rule files. For a skill that covers experiment lifecycle and tracer provider management, some sequencing or workflow guidance in the overview would be expected.

2 / 3

Progressive Disclosure

Excellent progressive disclosure structure: a concise overview with a well-organized table clearly signaling when to read each rule file. References are one level deep and clearly labeled with their trigger conditions.

3 / 3

Total

10

/

12

Passed

Validation

90%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation10 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

metadata_field

'metadata' should map string keys to string values

Warning

Total

10

/

11

Passed

Repository
Arize-ai/phoenix
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.