ai-models

Latest AI models reference - Claude, OpenAI, Gemini, Eleven Labs, Replicate

Quality

40%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./skills/ai-models/SKILL.md

Quality

Content

57%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is a comprehensive and highly actionable AI models reference with concrete, executable code examples for every provider and clear model selection guidance. Its main weaknesses are its monolithic structure—all content is inlined in a single large file rather than being split into provider-specific references—and some redundancy between the overview matrix, per-provider selection trees, and the quick reference section at the bottom. The time-sensitive nature of model IDs and pricing is acknowledged with a 'Last Updated' date but could benefit from clearer deprecation patterns.

Suggestions

Split provider-specific sections into separate files (e.g., ANTHROPIC.md, OPENAI.md, GEMINI.md) and keep SKILL.md as a concise overview with the selection matrix and links to each provider file.

Remove the duplicate 'Best For Each Task' section at the bottom since the Model Selection Matrix at the top already covers this, or consolidate into a single quick-reference section.

Add error handling patterns or fallback strategies (e.g., retry logic, model fallback chains) to improve workflow clarity for production use.

Dimension	Reasoning	Score
Conciseness	The skill is quite long (~500+ lines) and contains some redundancy—the model selection matrix at the top duplicates the 'Best For Each Task' section at the bottom, and the per-provider 'Model Selection' ASCII trees repeat information already in the code constants. However, it avoids explaining basic concepts and most content is reference data that earns its place.	2 / 3
Actionability	Every provider section includes fully executable TypeScript code examples with proper imports, API initialization, and real method calls. Model ID strings are concrete and copy-paste ready, and the environment variables template is immediately usable.	3 / 3
Workflow Clarity	The skill is primarily a reference document rather than a multi-step workflow, so workflow demands are lower. However, the 'Model Update Checklist' at the end is a useful workflow but lacks validation checkpoints or error recovery steps. The model selection guidance is clear but there's no guidance on fallback strategies or error handling when API calls fail.	2 / 3
Progressive Disclosure	This is a monolithic wall of text with no bundle files or references to separate documents. At 500+ lines, the per-provider details (Stability AI, Voyage AI, Mistral) could easily be split into separate reference files, with SKILL.md serving as a concise overview with the selection matrix and quick reference only.	1 / 3
	Total	8 / 12 Passed

Description

22%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This description is severely underdeveloped. It names several AI providers but fails to explain what the skill actually does with that information (e.g., compare models, look up pricing, check capabilities) and provides no guidance on when Claude should select it. It reads more like a title than a functional description.

Suggestions

Specify concrete actions the skill performs, e.g., 'Provides up-to-date model names, capabilities, pricing, and API details for Claude, OpenAI, Gemini, Eleven Labs, and Replicate models.'

Add an explicit 'Use when...' clause, e.g., 'Use when the user asks about latest AI model names, model comparisons, available models from specific providers, or needs current model identifiers for API calls.'

Include natural trigger terms users would say, such as 'model comparison', 'latest models', 'model names', 'API model IDs', 'which model', 'model pricing'.

Dimension	Reasoning	Score
Specificity	The description names some AI providers but describes no concrete actions. There are no verbs indicating what the skill does — it only says 'reference' which is extremely vague.	1 / 3
Completeness	The 'what' is barely implied (some kind of reference about AI models) and the 'when' is entirely missing. There is no 'Use when...' clause or any explicit trigger guidance.	1 / 3
Trigger Term Quality	It includes recognizable brand names (Claude, OpenAI, Gemini, Eleven Labs, Replicate) which users might mention, but lacks natural task-oriented keywords like 'model comparison', 'API', 'pricing', 'capabilities', or 'which model to use'.	2 / 3
Distinctiveness Conflict Risk	The listing of specific provider names (Eleven Labs, Replicate) gives it some distinctiveness, but 'AI models reference' is broad enough to overlap with many AI-related skills. The lack of specificity about what kind of reference (pricing, capabilities, API docs) increases conflict risk.	2 / 3
	Total	6 / 12 Passed

Validation

81%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 9 / 11 Passed

Validation for skill structure

Criteria	Description	Result
skill_md_line_count	SKILL.md is long (684 lines); consider splitting into references/ and linking	Warning
frontmatter_unknown_keys	Unknown frontmatter key(s) found; consider removing or moving to metadata	Warning

	Total	9 / 11 Passed

Repository: alinaqi/claude-bootstrap
Commit: 96d605a

Reviewed: 2 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.