CtrlK
BlogDocsLog inGet started
Tessl Logo

backtesting-trading-strategies

Backtest crypto and traditional trading strategies against historical data. Calculates performance metrics (Sharpe, Sortino, max drawdown), generates equity curves, and optimizes strategy parameters. Use when user wants to test a trading strategy, validate signals, or compare approaches. Trigger with phrases like "backtest strategy", "test trading strategy", "historical performance", "simulate trades", "optimize parameters", or "validate signals".

57

Quality

67%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./plugins/crypto/trading-strategy-backtester/skills/backtesting-trading-strategies/SKILL.md
SKILL.md
Quality
Evals
Security

Quality

Content

35%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The skill attempts a comprehensive backtesting framework but suffers from significant verbosity—metric description tables and example output blocks explain concepts Claude already knows. The referenced scripts and files don't exist in the bundle, making the actionable commands non-executable. Nonsensical inline comments ('# 10000: 10 seconds in ms', '# HTTP 200 OK') suggest quality issues in the content generation.

Suggestions

Remove the three metric description tables (Performance, Risk, Trade Statistics) and the example output block—Claude already knows what Sharpe ratio and max drawdown are. This would cut ~50 lines.

Provide the actual script files (backtest.py, fetch_data.py, optimize.py, strategies.py, metrics.py) in the bundle so the CLI commands are executable rather than referencing non-existent code.

Add validation checkpoints: verify data was fetched successfully before backtesting, sanity-check results (e.g., warn if Sharpe > 5 suggesting overfitting), and validate parameter grid inputs.

Remove the nonsensical inline comments ('# 10000: 10 seconds in ms', '# HTTP 200 OK') which appear to be artifacts and add confusion.

DimensionReasoningScore

Conciseness

The content is excessively verbose with large tables explaining metrics Claude already knows (Sharpe ratio, max drawdown, VaR, etc.), a full example output block, and descriptions of basic concepts. The metric description tables alone consume ~30 lines that add no value for Claude. The inline comments like '# 10000: 10 seconds in ms' and '# HTTP 200 OK' appear to be hallucinated/nonsensical artifacts that add confusion.

1 / 3

Actionability

The skill provides concrete CLI commands with specific flags and parameters, which is good. However, no bundle files are provided, meaning none of the referenced scripts (backtest.py, fetch_data.py, optimize.py, etc.) actually exist, making the commands non-executable. The commands reference a framework that doesn't exist in the bundle.

2 / 3

Workflow Clarity

Steps are numbered and sequenced (fetch data → run backtest → analyze results → optimize), but there are no validation checkpoints or error recovery steps. For a financial backtesting workflow where bad data or misconfigured parameters could produce misleading results, there should be data validation steps and result sanity checks.

2 / 3

Progressive Disclosure

The skill references external files (errors.md, examples.md) and has a clear file structure table, which is good progressive disclosure design. However, no bundle files are provided, so all references point to non-existent files. The main SKILL.md also includes too much inline content (metric tables, example output) that should be in reference files.

2 / 3

Total

7

/

12

Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is an excellent skill description that clearly defines its domain (trading strategy backtesting), lists specific capabilities (performance metrics, equity curves, parameter optimization), and provides explicit trigger guidance with natural user phrases. It uses proper third-person voice throughout and covers both crypto and traditional trading contexts, making it comprehensive and distinctive.

DimensionReasoningScore

Specificity

Lists multiple specific concrete actions: backtesting strategies, calculating performance metrics (Sharpe, Sortino, max drawdown), generating equity curves, and optimizing strategy parameters. These are precise, domain-specific capabilities.

3 / 3

Completeness

Clearly answers both 'what' (backtest strategies, calculate metrics, generate equity curves, optimize parameters) and 'when' (explicit 'Use when' clause plus a 'Trigger with phrases like' section listing specific trigger phrases).

3 / 3

Trigger Term Quality

Excellent coverage of natural trigger terms including 'backtest strategy', 'test trading strategy', 'historical performance', 'simulate trades', 'optimize parameters', 'validate signals'. Also includes domain terms like 'crypto', 'trading strategies', 'Sharpe', 'Sortino', 'max drawdown', and 'equity curves' that users in this domain would naturally use.

3 / 3

Distinctiveness Conflict Risk

Highly distinctive niche combining trading strategy backtesting, specific financial metrics (Sharpe, Sortino, max drawdown), and historical simulation. Unlikely to conflict with other skills due to the very specific domain of quantitative trading backtesting.

3 / 3

Total

12

/

12

Passed

Validation

81%

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation9 / 11 Passed

Validation for skill structure

CriteriaDescriptionResult

allowed_tools_field

'allowed-tools' contains unusual tool name(s)

Warning

frontmatter_unknown_keys

Unknown frontmatter key(s) found; consider removing or moving to metadata

Warning

Total

9

/

11

Passed

Repository
jeremylongshore/claude-code-plugins-plus-skills
Reviewed

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.