CtrlK
BlogDocsLog inGet started
Tessl Logo

backtesting-trading-strategies

Backtest crypto and traditional trading strategies against historical data. Calculates performance metrics (Sharpe, Sortino, max drawdown), generates equity curves, and optimizes strategy parameters. Use when user wants to test a trading strategy, validate signals, or compare approaches. Trigger with phrases like "backtest strategy", "test trading strategy", "historical performance", "simulate trades", "optimize parameters", or "validate signals".

71

Quality

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

Quality

Content

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is highly actionable with a clear sequenced workflow, but it loses points on conciseness due to stray injected comments and on progressive disclosure because implementation.md is unreferenced and detail is kept inline. It is not level 1 because the guidance is concrete and well-structured, not vague or monolithic.

Suggestions

Remove the stray injected comments ('# 10000: 10 seconds in ms', '# HTTP 200 OK') from the code blocks — they waste tokens and the one after a line-continuation backslash would break execution.

Surface references/implementation.md from the body (e.g., link it from the Supported Strategies or Files section) so all bundle files are navigable, or move the inline strategy/metric detail there to better split overview from reference material.

Add a brief note in the Optimize step on how to validate/interpret grid-search results so the batch optimization step has a verification checkpoint.

DimensionReasoningScore

Conciseness

The body is mostly lean (tables and executable code, no over-explanation of concepts Claude knows), but stray injected comments such as '# 10000: 10 seconds in ms' and '# HTTP 200 OK' add tokens that do not earn their place, so it does not meet the 'every token earns its place' bar of level 3.

2 / 3

Actionability

Provides concrete, copy-paste-ready commands with specific flags and example param JSON — e.g. 'python ${CLAUDE_SKILL_DIR}/scripts/fetch_data.py --symbol BTC-USD --period 2y --interval 1d' and the optimize --param-grid example — fully executable rather than pseudocode.

3 / 3

Workflow Clarity

A clear numbered sequence — fetch data, run backtest, analyze saved results, optimize via grid search — each step with an executable command and described outputs; backtesting is a read-only, non-destructive analysis so the missing explicit validation checkpoint does not cap the score at 2.

3 / 3

Progressive Disclosure

errors.md and examples.md are clearly signaled one-level-deep references and both exist in the bundle, but references/implementation.md is an orphaned bundle file never linked from the body, and strategy/metric/output detail is kept inline rather than split out, so navigation and content splitting fall short of level 3.

2 / 3

Total

10

/

12

Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

A strong, specific description that states concrete capabilities, gives explicit 'Use when...' trigger guidance with natural phrases, and occupies a distinct niche. It is not the level below because it answers both what and when explicitly rather than leaving the trigger implied.

DimensionReasoningScore

Specificity

Lists multiple concrete actions — 'Backtest crypto and traditional trading strategies', 'Calculates performance metrics (Sharpe, Sortino, max drawdown)', 'generates equity curves', and 'optimizes strategy parameters' — matching the level-3 anchor of multiple specific concrete actions.

3 / 3

Completeness

Explicitly answers both what ('Backtest... strategies', 'Calculates performance metrics', 'generates equity curves', 'optimizes... parameters') and when ('Use when user wants to test a trading strategy, validate signals, or compare approaches') with explicit triggers.

3 / 3

Trigger Term Quality

Natural user-facing phrases are provided — 'backtest strategy', 'test trading strategy', 'historical performance', 'simulate trades', 'optimize parameters', 'validate signals' — giving good coverage of terms a user would actually say.

3 / 3

Distinctiveness Conflict Risk

Backtesting trading strategies is a clear niche with distinct triggers unlikely to fire for unrelated skills; it is well-distinguishable from generic data or document skills.

3 / 3

Total

12

/

12

Passed

Validation

87%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation14 / 16 Passed

Validation for skill structure

CriteriaDescriptionResult

allowed_tools_field

'allowed-tools' contains unusual tool name(s)

Warning

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

14

/

16

Passed

Repository
jeremylongshore/claude-code-plugins-plus-skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.