firecrawl-load-scale

Load test and scale Firecrawl scraping pipelines with concurrency control and batching. Use when testing scraping throughput, planning capacity for large crawl jobs, or optimizing concurrent scrape performance. Trigger with phrases like "firecrawl load test", "firecrawl scale", "firecrawl throughput", "firecrawl capacity", "firecrawl concurrent".

Quality

—

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Advisory

Suggest reviewing before use

Quality

Content

80%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is lean and highly actionable with executable code and useful Firecrawl-specific data, but the multi-step load/batch workflow lacks inline validation checkpoints and all content lives in one file with no progressive disclosure into bundle references.

Suggestions

Add an explicit validation/feedback loop to the workflow — e.g., after measuring baseline throughput, check for 429s and, if present, reduce concurrency and re-run before proceeding — so batch/load operations have a validate → fix → retry checkpoint.

Split the detailed per-technique code (queue-based scraping, async crawl parallelism, capacity planning) into reference files under references/ and point to them from a concise overview, enabling a score 3 on progressive disclosure.

Tighten the reporting boilerplate (repeated console.log report blocks in measureThroughput and estimateCapacity) or move it into an examples file to further improve token efficiency.

Dimension	Reasoning	Score
Conciseness	Lean and free of concept-explanation padding; it conveys only Firecrawl-specific knowledge Claude lacks (per-plan rate limits, method names like batchScrapeUrls/asyncCrawlUrl) plus executable code, with each code block earning its place rather than the verbose score 1 style.	3 / 3
Actionability	All five steps provide complete, copy-paste-ready TypeScript calling real Firecrawl APIs (scrapeUrl, batchScrapeUrls, asyncCrawlUrl, checkCrawlStatus) and p-queue, with a runnable "Quick Load Test" example, matching the fully-executable anchor.	3 / 3
Workflow Clarity	Steps 1–5 are clearly sequenced (measure baseline → batch → queue → async crawls → capacity), but there are no embedded validation/feedback checkpoints within the batch and load-testing workflow; the rubric caps batch-operation workflows at 2 when validation loops are missing, and the separate Error Handling table is reference material rather than inline checkpoints.	2 / 3
Progressive Disclosure	Sections are clearly organized (Overview, Rate Limits, Instructions, Error Handling, Examples, Resources, Next Steps), but the skill is a monolithic ~220-line single file with no bundle files and no content split into one-level-deep references, fitting the score 2 anchor where content that could be separate stays inline.	2 / 3
	Total	10 / 12 Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is specific, complete, and distinctive, with explicit what/when guidance and natural trigger phrases. It is held back only by minor formatting noise (stray blank lines and trailing whitespace inside the YAML string), which does not affect any rubric dimension.

Dimension	Reasoning	Score
Specificity	Names multiple concrete actions — "Load test and scale Firecrawl scraping pipelines with concurrency control and batching" plus "testing scraping throughput, planning capacity," and "optimizing concurrent scrape performance" — matching the multi-action anchor rather than the single-domain score 2.	3 / 3
Completeness	Answers both what ("Load test and scale Firecrawl scraping pipelines…") and when ("Use when testing scraping throughput, planning capacity for large crawl jobs, or optimizing concurrent scrape performance") with explicit triggers, matching the score 3 anchor exactly.	3 / 3
Trigger Term Quality	Explicitly lists natural user phrases ("firecrawl load test", "firecrawl scale", "firecrawl throughput", "firecrawl capacity", "firecrawl concurrent") giving good coverage of terms a user would actually say; not merely jargon like the score 1 anchor.	3 / 3
Distinctiveness Conflict Risk	A clearly scoped Firecrawl-only load/scale niche with dedicated "firecrawl …" triggers is unlikely to fire for unrelated skills, satisfying the clear-niche anchor rather than the overlapping score 2.	3 / 3
	Total	12 / 12 Passed

Validation

87%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 14 / 16 Passed

Validation for skill structure

Criteria	Description	Result
allowed_tools_field	'allowed-tools' contains unusual tool name(s)	Warning
frontmatter_unknown_keys	Unknown frontmatter key(s) found; consider removing or moving to metadata	Warning

	Total	14 / 16 Passed

Repository: jeremylongshore/claude-code-plugins-plus-skills
Commit: 6e3a65f

Reviewed: 4 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.