introspect

Analyze Claude Code session logs - extract thinking blocks, tool usage stats, error patterns, debug trajectories. Triggers on: introspect, session logs, trajectory, analyze sessions, what went wrong, tool usage, thinking blocks, session history, my reasoning, past sessions, what did I do.

1.08x

Quality

—

Does it follow best practices?

Impact

90%

1.08x

Average score across 3 eval scenarios

Securityby

Risky

Do not use without reviewing

Quality

Content

77%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

The body is highly actionable and clearly organized as executable jq/bash recipes, with no padding over concepts Claude already knows. Its main weaknesses are repetition that could be tightened and a long monolithic structure that would benefit from offloading detail into reference files.

Suggestions

Remove the per-snippet PROJECT/SESSION variable re-declarations in favor of a single 'Variables' setup block referenced once, and drop or trim the 'Quick Reference Commands' table that duplicates commands already shown in detail.

Move the 'Export Formats' and 'Advanced Analysis' sections into separate reference files (e.g. references/exports.md, references/advanced.md) linked from a concise overview, to apply progressive disclosure and shrink the main SKILL.md.

Factor the repeated 'select(.type == "assistant") | .message.content[]? | select(.type == "tool_use") | .name' tool-frequency pattern into a named jq filter or a small scripts/ helper to reduce duplication across the tool-stats and comparison sections.

Dimension	Reasoning	Score
Conciseness	The body is command-dense and assumes Claude's competence, but the repeated PROJECT/SESSION variable blocks in nearly every snippet and a 'Quick Reference Commands' table that restates already-shown commands are content that could be tightened, fitting the score-2 anchor over the 'every token earns its place' bar of score 3.	2 / 3
Actionability	Dozens of complete, copy-paste-ready jq/bash pipelines (e.g. 'select(.type == "assistant") \| .message.content[]? \| select(.type == "tool_use") \| .name \| sort \| uniq -c') with concrete example UUIDs/projects match the fully-executable score-3 anchor.	3 / 3
Workflow Clarity	Each analysis task is a clearly sequenced, self-contained recipe and the Usage Examples map natural queries to commands; operations are read-only log analysis with no destructive/batch steps requiring validation checkpoints, so the score-2 cap does not apply.	3 / 3
Progressive Disclosure	It is well-organized into clear sections (above the monolithic score-1 anchor), but at ~300 lines with no bundle files and no external references it keeps detail that could be split (export formats, advanced analysis) inline, falling short of the one-level-deep-reference structure expected at score 3.	2 / 3
	Total	10 / 12 Passed

Description

100%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

The description is specific, uses natural trigger phrasings, and explicitly covers both capability and invocation context in third person. It is a strong, low-conflict skill description.

Dimension	Reasoning	Score
Specificity	Lists multiple concrete actions in third person ('extract thinking blocks', 'tool usage stats', 'error patterns', 'debug trajectories'), matching the score-3 anchor rather than the partial coverage of score 2.	3 / 3
Completeness	Explicitly states what it does ('Analyze Claude Code session logs - extract...') and when via an explicit 'Triggers on:' clause, satisfying both halves required for score 3 rather than only implied timing at score 2.	3 / 3
Trigger Term Quality	'Triggers on:' provides broad natural-language phrasings a user would actually say ('what went wrong', 'my reasoning', 'past sessions', 'what did I do'), exceeding the partial-coverage bar of score 2.	3 / 3
Distinctiveness Conflict Risk	The Claude Code session-log niche with distinctive triggers ('session logs', 'thinking blocks', 'introspect') is unlikely to fire for unrelated skills, matching the clear-niche anchor.	3 / 3
	Total	12 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 16 / 16 Passed

Validation for skill structure

No warnings or errors.

Repository: NeverSight/skills_feed
Commit: 26d37ce

Reviewed: 5 days ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.