CtrlK
BlogDocsLog inGet started
Tessl Logo

arbiter

Push decisions to Arbiter Zebu for async human review. Use when you need human input on plans, architectural choices, or approval before proceeding.

79

Quality

76%

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

Optimize this skill with Tessl

npx tessl skill review --optimize ./public/skills/5hanth/arbiter/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Discovery

75%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a functional description that clearly communicates its purpose and when to use it. The explicit 'Use when...' clause with specific trigger scenarios is a strength. However, it could benefit from more specific action verbs and additional natural trigger terms users might employ when needing human review.

Suggestions

Add more natural trigger terms users might say, such as 'review', 'feedback', 'sign-off', 'checkpoint', or 'decision point'

Expand the specific actions beyond 'push decisions' to include what happens in the workflow (e.g., 'submits decisions, tracks review status, retrieves approvals')

DimensionReasoningScore

Specificity

Names the domain (async human review) and some actions ('Push decisions', 'human input on plans, architectural choices, or approval'), but doesn't list multiple concrete specific actions like what format decisions take or what happens after review.

2 / 3

Completeness

Clearly answers both what ('Push decisions to Arbiter Zebu for async human review') and when ('Use when you need human input on plans, architectural choices, or approval before proceeding') with explicit trigger guidance.

3 / 3

Trigger Term Quality

Includes some relevant keywords ('human input', 'plans', 'architectural choices', 'approval'), but missing common variations users might say like 'review', 'feedback', 'sign-off', 'decision point', or 'checkpoint'.

2 / 3

Distinctiveness Conflict Risk

The specific tool name 'Arbiter Zebu' and the async human review niche make this highly distinctive. The combination of 'async', 'human review', and 'approval before proceeding' creates a clear, unique trigger profile unlikely to conflict with other skills.

3 / 3

Total

10

/

12

Passed

Implementation

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a well-documented, highly actionable skill with excellent concrete examples and clear workflows. The main weakness is length - installation instructions, environment variables, file locations, and troubleshooting could be moved to separate reference files to improve token efficiency. The core tool documentation is strong and immediately usable.

Suggestions

Move installation instructions to a separate INSTALL.md and reference it with a one-liner in the main skill

Move file locations, environment variables, and troubleshooting tables to a REFERENCE.md file

Consider condensing the 'When to Use' section to just the positive cases, as Claude can infer when NOT to use it

DimensionReasoningScore

Conciseness

The skill is reasonably efficient but includes some unnecessary content like the full installation section (3 methods), environment variables table, and file locations that could be in a separate reference. The 'When to Use' section is helpful but slightly verbose.

2 / 3

Actionability

Excellent executable examples with complete CLI commands, JSON structures, and bash scripts. Every tool has concrete, copy-paste ready examples with expected return values clearly documented.

3 / 3

Workflow Clarity

Clear multi-step workflows shown in examples (push → check status → get answers). Example 2 demonstrates proper conditional flow with status checking before proceeding. The heartbeat integration provides a feedback loop for notifications.

3 / 3

Progressive Disclosure

Content is well-organized with clear sections, but the skill is monolithic - installation details, troubleshooting, and detailed API reference could be split into separate files. The 'See Also' links are good but the main file is too long.

2 / 3

Total

10

/

12

Passed

Validation

81%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation9 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

metadata_version

'metadata.version' is missing

Warning

metadata_field

'metadata' should map string keys to string values

Warning

Total

9

/

11

Passed

Repository
Demerzels-lab/elsamultiskillagent
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.