Use when work needs to be handed off to another agent or another human. Produce a continuation-ready brief with the objective, completed work, assumptions, unresolved issues, and next action instead of a generic summary. Good triggers include "prepare a handoff", "make this resumable", and "summarize this for another agent".
92
100%
Does it follow best practices?
Impact
89%
1.41xAverage score across 8 eval scenarios
Passed
No known issues
{
"context": "The agent is asked to produce a handoff document for a Python 2-to-3 migration that is partially complete. This scenario tests whether the agent uses the prescribed six-section format, includes a Critical References section, and states a single concrete next action.",
"type": "weighted_checklist",
"checklist": [
{
"name": "Objective section present",
"description": "The handoff document contains a section explicitly labelled 'Objective' (or equivalent heading).",
"max_score": 8
},
{
"name": "Completed section present",
"description": "The handoff document contains a section explicitly labelled 'Completed' (or equivalent heading).",
"max_score": 8
},
{
"name": "Assumptions section present",
"description": "The handoff document contains a section explicitly labelled 'Assumptions' (or equivalent heading).",
"max_score": 8
},
{
"name": "Unresolved section present",
"description": "The handoff document contains a section explicitly labelled 'Unresolved' (or equivalent heading).",
"max_score": 8
},
{
"name": "Next Action section present",
"description": "The handoff document contains a section explicitly labelled 'Next Action' (or equivalent heading).",
"max_score": 8
},
{
"name": "Critical References section present",
"description": "The handoff document contains a section explicitly labelled 'Critical References' (or equivalent heading) that lists at least one file, command, or path.",
"max_score": 10
},
{
"name": "Next action is concrete",
"description": "The Next Action section contains a specific, actionable step (e.g. a file to edit, a command to run) rather than a vague directive.",
"max_score": 10
},
{
"name": "References are minimal",
"description": "Critical References contains only the minimum references needed for continuity — does NOT include every file or detail mentioned in the task description.",
"max_score": 10
},
{
"name": "No prose narrative",
"description": "The document does NOT contain long prose paragraphs narrating what happened; information is structured under the section headings rather than written as a story.",
"max_score": 10
},
{
"name": "Unresolved contains open questions",
"description": "The Unresolved section lists outstanding decisions or questions (e.g. the 'six' shim decision), not just remaining tasks.",
"max_score": 10
},
{
"name": "Assumptions section non-empty",
"description": "The Assumptions section contains at least one stated assumption (not just a placeholder).",
"max_score": 10
}
]
}evals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
scenario-6
scenario-7
scenario-8
skills
compact-handoff