Run a safe-to-fail experiment for Complex domain problems where cause-and-effect is only visible in retrospect.
88
88%
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
Passed
No known issues
Probe result: {refuted|surprise} What was learned: {what the probe eliminated or revealed} Why brainstorm: {need fresh hypotheses — old ones exhausted}
From: Complex (probe) → To: Complex (brainstorm) No domain shift — staying in Complex. Probe refuted hypothesis; returning to divergent exploration with new information.
Token guidance: target 300 tokens inline. For depth, use references — point to thinking artifact files rather than embedding full content. Soft cap: 600 tokens inline per handoff. If you need more, move detail to a knowledge file and reference it. Accumulated cap: 800 tokens across a chain — compress to 200 at cap (keep: decisions, constraints, rejected paths). References to thinking files do NOT count toward the cap.