Tests memory writes, confirms read-back accuracy, and validates retrieval success to ensure saved information can actually be recovered. Use when you need to verify memory was saved correctly, check if stored data can be retrieved, confirm a memory entry is discoverable, or escalate when saved information appears lost or corrupted. Covers write confirmation, read-back comparison, retrieval smoke testing, and failure escalation. Includes explicit untrusted-content/prompt-injection guardrails for third-party inputs.
92
Quality
90%
Does it follow best practices?
Impact
97%
1.19xAverage score across 5 eval scenarios
{
"context": "Evaluates whether the agent produced outputs aligned with the skill's reliability-check workflow.",
"type": "weighted_checklist",
"checklist": [
{
"name": "Roundtrip steps",
"description": "Defines write, read-back, and retrieval smoke-test flow",
"max_score": 25
},
{
"name": "Metadata requirements",
"description": "Specifies required metadata fields for entries",
"max_score": 25
},
{
"name": "Failure escalation",
"description": "Maps mismatch/retrieval failures to operational or critical paths",
"max_score": 25
},
{
"name": "Checksum validation",
"description": "Includes concrete checksum/hash validation example",
"max_score": 25
}
]
}