This skill should be used for persistent semantic memory in agent systems: cross-session knowledge retention, entity tracking, temporal validity, graph or vector retrieval, memory consolidation, and memory benchmark selection. Route file-backed scratchpads to filesystem-context, handoff summaries to context-compression, and token-efficiency tactics to context-optimization.
59
67%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Optimize this skill with Tessl
npx tessl skill review --optimize ./skills/memory-systems/SKILL.mdQuality
Discovery
85%Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.
This is a strong skill description that clearly defines its niche in persistent semantic memory and provides explicit routing guidance to disambiguate from related skills. Its main weakness is that the trigger terms lean heavily toward technical jargon rather than natural language a user might employ. The routing instructions are a notable strength for reducing inter-skill conflicts.
Suggestions
Add more natural-language trigger terms users might say, such as 'remember across sessions', 'store knowledge', 'long-term memory', 'recall previous context', or 'knowledge graph'.
| Dimension | Reasoning | Score |
|---|---|---|
Specificity | Lists multiple specific concrete actions: cross-session knowledge retention, entity tracking, temporal validity, graph or vector retrieval, memory consolidation, and memory benchmark selection. Also provides explicit routing guidance for related but distinct concerns. | 3 / 3 |
Completeness | Clearly answers both 'what' (persistent semantic memory with specific sub-capabilities listed) and 'when' (opens with 'This skill should be used for...' and includes explicit routing guidance for when NOT to use it, directing to alternative skills). The 'Use when' equivalent is explicit. | 3 / 3 |
Trigger Term Quality | Includes some relevant technical keywords like 'semantic memory', 'entity tracking', 'graph or vector retrieval', 'memory consolidation', but these are fairly specialized terms. Missing more natural user phrases like 'remember across sessions', 'store knowledge', 'recall previous conversations'. The routing terms (filesystem-context, context-compression, context-optimization) are internal skill names, not user-facing trigger terms. | 2 / 3 |
Distinctiveness Conflict Risk | Highly distinctive with a clear niche in persistent semantic memory for agent systems. The explicit routing instructions to differentiate from filesystem-context, context-compression, and context-optimization skills directly address potential overlap and reduce conflict risk. | 3 / 3 |
Total | 11 / 12 Passed |
Implementation
50%Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.
This is a comprehensive memory system design skill that covers the topic thoroughly with useful framework comparisons, decision tables, and code examples. Its main weaknesses are verbosity (repeated 'start simple' messaging, explanatory prose Claude doesn't need, placeholder benchmark values), and a lack of concrete validation checkpoints in its workflows. The content would benefit from trimming redundant guidance, replacing placeholder benchmark values with either real numbers or removing the table, and adding explicit verification steps.
Suggestions
Remove the introductory paragraph explaining what memory is and the repeated 'start simple' advice that appears in at least four sections — consolidate into one clear statement in Practical Guidance
Either populate the benchmark table with actual numbers and citations or remove it entirely — placeholder values like 'Published high score' waste tokens without providing actionable information
Add explicit validation checkpoints to the escalation workflow, e.g., 'After step 2, run retrieval quality tests against your LoCoMo subset to confirm semantic search meets accuracy thresholds before proceeding to graph-based memory'
Move the detailed framework comparison table and benchmark table into a referenced file (e.g., ./references/frameworks.md) to reduce the main skill's length and improve progressive disclosure
| Dimension | Reasoning | Score |
|---|---|---|
Conciseness | The skill is reasonably well-organized but includes unnecessary verbosity in several places: the introductory paragraph explains what memory is to Claude, the 'When to Activate' section is lengthy, benchmark tables use vague placeholders like 'Published high score' that add tokens without actionable value, and some guidance is repeated across sections (e.g., 'start simple' appears in Core Concepts, Practical Guidance, Guidelines, and Gotchas). | 2 / 3 |
Actionability | The skill provides three concrete code examples (Mem0, temporal query, Cognee) which are helpful, but the benchmark table contains placeholder values rather than actual numbers, the framework comparison is descriptive rather than prescriptive with concrete integration steps, and the consolidation section defers to a reference file for 'working consolidation code' rather than providing inline examples. The decision tables are useful but much of the guidance remains at the advisory level rather than executable. | 2 / 3 |
Workflow Clarity | The escalation path (Prototype → Scale → Complex reasoning → Full control) provides a clear sequence, and the error recovery section has ordered fallback strategies. However, there are no explicit validation checkpoints — for example, no step says 'verify retrieval quality before proceeding' or 'run this test to confirm memory is working.' The consolidation workflow lacks concrete trigger thresholds or validation steps. | 2 / 3 |
Progressive Disclosure | The skill references an implementation file (./references/implementation.md) and related skills with clear 'Read when' annotations, which is good practice. However, no bundle files were provided to verify these references exist, the main SKILL.md is quite long (~300+ lines) with substantial inline content that could be split into reference files (e.g., the full benchmark tables, the detailed framework comparison), and the framework landscape section is essentially an inline reference document. | 2 / 3 |
Total | 8 / 12 Passed |
Validation
100%Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.
Validation — 11 / 11 Passed
Validation for skill structure
No warnings or errors.
25e1fa7
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.