Capture and organize business meeting notes into durable decisions, action items, and takeaways.
98
100%
Does it follow best practices?
Impact
93%
1.20xAverage score across 3 eval scenarios
Passed
No known issues
{
"context": "Tests whether the agent correctly distinguishes durable content (incidents with root causes and fixes, architectural decisions, recurring patterns, compliance-driven constraints) from transient content (short-deadline action items, logistical facts, short-shelf-life items) when deciding what to save to institutional memory.",
"type": "weighted_checklist",
"checklist": [
{
"name": "Root cause captured",
"description": "The Redis connection pool root cause (pool size 5, exhaustion above 800 req/s, cascading timeouts) IS saved",
"max_score": 10
},
{
"name": "Fix captured",
"description": "The deployed fix (pool size bumped to 50 with adaptive scaling) IS saved alongside or as part of the incident record",
"max_score": 8
},
{
"name": "Recurring pattern captured",
"description": "The observation that this was the third Redis incident in four months and the case for a standard Redis config baseline IS saved",
"max_score": 12
},
{
"name": "Architectural decision captured",
"description": "The commitment to migrate billing and reporting monolith modules to separate services by Q3 IS saved",
"max_score": 10
},
{
"name": "Compliance driver captured",
"description": "The compliance isolation requirement driving the billing data migration IS saved (not just the migration target but the reason)",
"max_score": 8
},
{
"name": "Canary prerequisite captured",
"description": "The canary release prerequisite blocking the billing module from going live IS saved",
"max_score": 8
},
{
"name": "Reporting architecture captured",
"description": "The reporting service architecture (read-only, replicates from billing DB via CDC) IS saved",
"max_score": 8
},
{
"name": "Short-deadline actions excluded",
"description": "Named action items with near-term deadlines ('Tom to open Jira ticket this week', 'Anika to schedule follow-up by Thursday') are NOT saved as persistent entries",
"max_score": 13
},
{
"name": "Product/schedule items excluded",
"description": "Short-shelf-life product/schedule items (Raj's board deck due Friday, sprint ends Wednesday, 42-point velocity) are NOT saved",
"max_score": 13
},
{
"name": "Trivial facts excluded",
"description": "Obvious recoverable facts (coffee machine fixed, facilities ticket closed) are NOT saved",
"max_score": 10
}
]
}