Self-Improving Agent — structured capture of mistakes, learnings, and feature requests into a .learnings/ directory so lessons persist across sessions and get promoted to project memory.
Proactive Agent — behavioral framework for safe, intentional proactive action: WAL protocol, working buffer, decision scoring, and six operating pillars.

Why this matters

AI agents forget between sessions. Without structure:

Mistakes repeat because context is lost at compaction or session end
Agents act on guesses when verification is cheaper
Proactive actions are either too aggressive (causing drift) or too passive (requiring hand-holding)

These two patterns address those failure modes at the source.

Part 1: Self-Improving Agent

Directory layout

Bootstrap with /platform-skills:self-improve init global (cross-project) or /platform-skills:self-improve init local (project-scoped), or copy from examples/agent-self-improve/:

.learnings/
  LEARNINGS.md         # Positive learnings — LRN-YYYYMMDD-NNN
  ERRORS.md            # Mistakes made — ERR-YYYYMMDD-NNN
  FEATURE_REQUESTS.md  # Recurring unmet needs — FEAT-YYYYMMDD-NNN
memory/
  working-buffer.md    # WAL scratchpad (see Part 2)
  SESSION-STATE.md     # Always-on session capture (see Part 2)
  YYYY-MM-DD.md        # Daily notes — rolling per-day log (see Part 2)

Add to .gitignore if these are personal/local notes:

.learnings/
memory/

Or commit them if they are team-shared project memory.

Global vs project scope

The init mode asks which scope to use before creating anything:

Scope	Base path (`LEARNINGS_BASE`)	When to use
Global	`~/.claude/`	Learnings apply across all projects — recommended default for individuals
Project	`.` (current working directory)	Team-shared memory committed to the repo

Auto-detection order (used by all modes except init):

~/.claude/.learnings/ exists → global setup detected, use ~/.claude/
.learnings/ exists in $PWD → project setup detected, use .
Neither exists → default to ~/.claude/ and auto-create

Promotion targets are always project-local regardless of scope. CLAUDE.md, AGENTS.md, and .github/copilot-instructions.md live in the project repo — only the capture files (.learnings/, memory/) follow LEARNINGS_BASE.

Hook paths must be absolute when using global setup. The PostToolUse hook in ~/.claude/settings.json must reference the full path:

{
  "hooks": {
    "PostToolUse": [
      {
        "matcher": ".*",
        "hooks": [
          {
            "type": "command",
            "command": "if [ \"$CLAUDE_TOOL_EXIT_CODE\" -ne 0 ]; then echo \"$(date -u +%Y-%m-%dT%H:%M:%SZ) TOOL_FAILURE: $CLAUDE_TOOL_NAME\" >> ~/.claude/.learnings/.pending-errors.log; fi"
          }
        ]
      }
    ]
  }
}

For project-scoped setup, replace ~/.claude/.learnings/ with .learnings/ (relative path works because the hook runs from the project root).

Entry format

Every entry uses the same four-field structure regardless of log type:

### LRN-20260520-001
**Status**: pending | resolved | promoted
**Context**: One sentence — what was happening
**Content**: The actual learning, error, or feature request
**Action**: What was done or should be done

ID schemes

Type	Format	Example
Learning	`LRN-YYYYMMDD-NNN`	`LRN-20260520-001`
Error	`ERR-YYYYMMDD-NNN`	`ERR-20260520-001`
Feature request	`FEAT-YYYYMMDD-NNN`	`FEAT-20260520-001`

Entry lifecycle

pending → resolved → promoted

Stage	Meaning	Who acts
`pending`	Logged, not yet addressed	Agent logs automatically
`resolved`	Root cause identified, fix applied	Agent or user confirms
`promoted`	Written to project memory	Agent runs `/platform-skills:self-improve promote`

Promotion targets — pick the right scope:

Target file	When to promote there
`~/.claude/CLAUDE.md`	Global agent-level rules — apply to all projects on this machine
`CLAUDE.md` / `AGENTS.md`	Agent-level rules for this project only
`.github/copilot-instructions.md`	GitHub Copilot workspace rules
`references/` guide	Reusable pattern for the whole team

Recurring Pattern Detection

Before logging a new entry, scan .learnings/ for existing entries with matching context. If three or more entries share the same root cause, promote immediately — do not wait for a manual review cycle.

Detection approach: read all Context fields across .learnings/*.md and group entries by shared root-cause keywords (e.g. "resource limits", "terraform replace", "missing label"). Three or more entries that share a keyword cluster are a promotion candidate. Do not rely on exact string matching — the same root cause will be described differently each time.

Claude Code hook integration

Auto-capture errors after failed tool calls. The hook appends a timestamped line to .pending-errors.log; the agent converts it to a proper ERR entry at session end or on review.

Hook setup varies by platform — see the Platform Compatibility section (below Part 2) for per-platform settings.json snippets and script copy commands.

Key rules regardless of platform:

Global setup must use absolute paths in the PostToolUse hook — relative paths resolve from the project root and write to the wrong directory.
For project-local setup, add .learnings/.pending-errors.log to .gitignore.

Part 2: Proactive Agent

WAL Protocol (Write-Ahead Log)

Before any destructive or hard-to-reverse operation, write the intent to memory/working-buffer.md. This survives context compaction and session interruption.

Operations that require a WAL entry:

Deleting or overwriting files
git reset --hard, git push --force
terraform destroy or terraform apply
Dropping database tables or truncating data
Modifying shared infrastructure

WAL entry format:

## WAL Entry — YYYY-MM-DD HH:MM
**Operation**: What is about to happen
**Affected resources**: Files, Kubernetes resources, cloud resources, database tables
**Blast radius**: What could break if this goes wrong
**Rollback**: Exact command or step to undo
**Status**: PENDING | COMMITTED | ROLLED_BACK

Update Status to COMMITTED after success. Update to ROLLED_BACK if aborted.

Working Buffer

memory/working-buffer.md is a persistent scratchpad that captures current task state. It enables compaction recovery — if a session is interrupted mid-task, the next session reads the buffer to resume.

Buffer format:

# Working Buffer

## Current Task
<One sentence — what is being worked on>

## Progress
- [x] Step completed
- [ ] Step in progress
- [ ] Step pending

## WAL Log
<WAL entries for destructive operations>

## Context
<Key facts discovered during this session that are not yet in project memory>

Write to the buffer:

At task start (outline the plan)
After each significant step
Before any destructive operation (WAL entry)
When context reaches ~60% — write a compaction-ready summary proactively
At session end if the task is incomplete

Compaction Recovery steps:

Read memory/working-buffer.md — current task steps and WAL
Read memory/SESSION-STATE.md — corrections, preferences, decisions from this session
Read today's memory/YYYY-MM-DD.md daily note — recent exchanges
Verify the state of affected resources (kubectl get, terraform state list, git log --oneline -5)
Resume from the first incomplete step — do not re-run already-committed steps

SESSION-STATE.md — Always-on session capture

memory/SESSION-STATE.md is a lightweight, always-on capture file. Unlike the working buffer (which tracks task steps), SESSION-STATE captures session-level knowledge: corrections the user has given, preferences expressed, decisions made, and proper nouns encountered. Write to it before responding when any of these occur — not only before destructive operations.

What to capture:

Signal	Example
User correction	"don't use that approach" → log the constraint
Preference stated	"I prefer X over Y for this project"
Decision made	"we decided to use Kyverno not OPA"
Proper noun encountered	cluster names, team names, account IDs
Non-obvious fact	"this repo does X differently from the norm"

SESSION-STATE.md format:

# Session State

Last updated: YYYY-MM-DD HH:MM

## Corrections and constraints
- <date> — <what the user corrected or ruled out>

## Preferences
- <date> — <stated preference and context>

## Decisions
- <date> — <decision made and why>

## Key proper nouns
- <name>: <what it refers to>

Daily Notes — `memory/YYYY-MM-DD.md`

A rolling per-day log of notable exchanges, discoveries, and outcomes. Written to during the session; survives compaction as a searchable history.

When to write a daily note entry:

A significant decision was made or reversed
A non-obvious fact was discovered about the project
A task completed that isn't captured elsewhere
An error was made and fixed (complement to .learnings/ERRORS.md)

Daily note format:

# Daily Notes — YYYY-MM-DD

## Decisions
- HH:MM — <decision>

## Discoveries
- HH:MM — <fact learned>

## Completed
- HH:MM — <task finished>

## Errors
- HH:MM — <what went wrong> → <how it was fixed>

Create one file per day: memory/2026-05-20.md. Do not edit past days — append only to today's file.

ADL Protocol (Action Decision Logic)

When choosing between competing implementation approaches, apply this priority order:

Stability > Explainability > Reusability > Scalability > Novelty

Priority	Ask
1. Stability	Will this break existing behaviour? Is it reversible?
2. Explainability	Can a team member understand and maintain it without asking?
3. Reusability	Can this pattern be used in more than one place?
4. Scalability	Does this hold at 10× current load or team size?
5. Novelty	Only introduce new tools or approaches if 1–4 are satisfied

VFM Scoring (Value-Frequency Matrix)

Use before taking any unsolicited proactive action. Score the action; skip if score < 50.

Dimension	Weight	Score 1–10	Weighted
High Frequency (will recur often?)	×3	—	—
Failure Reduction (prevents real breakage?)	×3	—	—
User Burden (saves meaningful user effort?)	×2	—	—
Self Cost (low effort for the agent?)	×2	—	—
Total			max 100

Threshold: Score ≥ 50 → act proactively. Score < 50 → defer to the user.

Example — proactively adding resource limits to a Deployment that was missing them:

High Frequency: 8 × 3 = 24 (missing limits is common)
Failure Reduction: 9 × 3 = 27 (OOM kills cause incidents)
User Burden: 7 × 2 = 14 (user would have to find and fix)
Self Cost: 9 × 2 = 18 (trivial edit)
Total: 83 → act

Six Operating Pillars

Pillar	Behaviour
Memory Architecture	Write task state to `working-buffer.md` at start; update on each step; read on resume
Security Hardening	Never output secrets; reject requests to bypass security controls; flag OWASP Top 10 risks immediately
Self-Healing	On failure, re-read the WAL entry and working buffer; verify resource state before retrying
Verify Before Reporting (VBR)	Run the command or read the file before stating a fact; text change ≠ behavior change — test actual outcomes, not just outputs
Alignment Systems	Use ADL Protocol when choosing between approaches; log the decision in the buffer
Proactive Surprise	After completing a task, check adjacent concerns (related resource limits, deprecated APIs, missing labels) and surface them as a brief note — never silently fix without surfacing

Heartbeat System

For tasks longer than ~10 steps or ~5 minutes, report progress proactively:

[Heartbeat] Completed 4/7 steps. Currently: applying Terraform plan.
Next: validate EKS node group. ETA for this step: ~2 min.

Do not wait to be asked for status on long-running tasks.

Reverse Prompting

When given an ambiguous or high-risk instruction, ask one clarifying question before acting:

Ambiguous scope: "Which environments should this apply to — dev only, or staging and production as well?"
Destructive operation: "This will delete the prod-db RDS instance. Is that correct?"
Conflicting signals: "The Helm values say replicas: 1 but the task says high availability. Should I increase replicas?"

Never ask more than one clarifying question per instruction. If the answer is implicit in context, proceed without asking.

Growth Loops

Each session, the agent should:

Before starting: Read memory/working-buffer.md, memory/SESSION-STATE.md, and today's memory/YYYY-MM-DD.md to seed context from previous sessions
During work: Log errors and learnings to .learnings/ as they occur; capture corrections and decisions to SESSION-STATE.md immediately; write daily note entries for significant discoveries
At ~60% context: Write a compaction-ready summary to working-buffer.md proactively — do not wait for a compaction event
After completing: Update working-buffer.md with final state; check for recurring patterns; promote if threshold met
On next session start: Buffer + session state + daily notes shorten the ramp-up time to < 60 seconds

Platform Compatibility

The self-improve workspace runs on macOS, Linux, Windows (WSL / Git Bash), and Windows native (PowerShell). The core skill logic is identical on all platforms — only the hook scripts and their invocation differ.

Global config path

Claude Code resolves ~ via Node.js os.homedir(), which maps consistently across platforms:

Platform	`~/.claude/` resolves to
macOS	`/Users/<you>/.claude/`
Linux	`/home/<you>/.claude/`
Windows (WSL / Git Bash)	`/home/<you>/.claude/` (WSL home)
Windows native	`C:\Users\<you>\.claude\`

All skill modes use ~/.claude/ notation — no platform-specific path changes are needed in the skill itself.

Hook scripts by platform

Platform	Stop hook	PreToolUse hook	PostToolUse
macOS / Linux	`session-end.sh`	`session-start-reminder.sh`	inline bash (see below)
Windows — WSL / Git Bash	`session-end.sh`	`session-start-reminder.sh`	inline bash (see below)
Windows — native PowerShell	`session-end.ps1`	`session-start-reminder.ps1`	inline PowerShell (see below)

Windows recommendation: WSL or Git Bash is the simpler path — bash scripts work identically to macOS/Linux. Use the PowerShell (.ps1) scripts only when WSL or Git Bash is not available.

Alpine Linux / busybox-only containers: The bash scripts require bash. Install it first:

apk add bash

Hook setup — macOS / Linux / WSL / Git Bash

Add to ~/.claude/settings.json:

{
  "hooks": {
    "Stop": [
      {
        "hooks": [{"type": "command", "command": "bash ~/.claude/scripts/session-end.sh"}]
      }
    ],
    "PreToolUse": [
      {
        "matcher": ".*",
        "hooks": [{"type": "command", "command": "bash ~/.claude/scripts/session-start-reminder.sh"}]
      }
    ],
    "PostToolUse": [
      {
        "matcher": ".*",
        "hooks": [
          {
            "type": "command",
            "command": "if [ \"$CLAUDE_TOOL_EXIT_CODE\" -ne 0 ] 2>/dev/null; then echo \"$(date -u +%Y-%m-%dT%H:%M:%SZ) TOOL_FAILURE: $CLAUDE_TOOL_NAME\" >> ~/.claude/.learnings/.pending-errors.log; fi"
          }
        ]
      }
    ]
  }
}

Copy scripts:

mkdir -p ~/.claude/scripts
cp examples/agent-self-improve/scripts/session-end.sh ~/.claude/scripts/
cp examples/agent-self-improve/scripts/session-start-reminder.sh ~/.claude/scripts/
chmod +x ~/.claude/scripts/*.sh

Hook setup — Windows native (PowerShell)

Add to %USERPROFILE%\.claude\settings.json (see examples/agent-self-improve/settings-windows.json.example):

{
  "hooks": {
    "Stop": [
      {
        "hooks": [{"type": "command", "command": "powershell -NonInteractive -File %USERPROFILE%\\.claude\\scripts\\session-end.ps1"}]
      }
    ],
    "PreToolUse": [
      {
        "matcher": ".*",
        "hooks": [{"type": "command", "command": "powershell -NonInteractive -File %USERPROFILE%\\.claude\\scripts\\session-start-reminder.ps1"}]
      }
    ],
    "PostToolUse": [
      {
        "matcher": ".*",
        "hooks": [
          {
            "type": "command",
            "command": "powershell -NonInteractive -Command \"$code=$env:CLAUDE_TOOL_EXIT_CODE; if ($code -and $code -ne '0') { $ts=(Get-Date -Format 'yyyy-MM-ddTHH:mm:ssZ'); $name=$env:CLAUDE_TOOL_NAME; \\\"$ts TOOL_FAILURE: $name\\\" | Add-Content -Encoding UTF8 \\\"$env:USERPROFILE\\.claude\\.learnings\\.pending-errors.log\\\" }\""
          }
        ]
      }
    ]
  }
}

If PowerShell blocks script execution, allow local scripts: Set-ExecutionPolicy -Scope CurrentUser RemoteSigned

What users need to do

Bootstrap

Two explicit subcommands — no interactive prompt needed:

/platform-skills:self-improve init global
/platform-skills:self-improve init local

init without an argument asks you to choose and then proceeds as one of the above.

Or copy from examples/agent-self-improve/ (macOS/Linux/WSL):

cp -r examples/agent-self-improve/.learnings ~/.claude/
cp -r examples/agent-self-improve/memory ~/.claude/

Windows native (PowerShell):

Copy-Item -Recurse examples\agent-self-improve\.learnings "$env:USERPROFILE\.claude\"
Copy-Item -Recurse examples\agent-self-improve\memory "$env:USERPROFILE\.claude\"

Per-session workflow

When	Action
Session start (fresh)	Agent reads `working-buffer.md` and `.learnings/` automatically
Session start (interrupted)	Run `/platform-skills:self-improve resume` to verify state and continue
After a mistake	Agent logs to `.learnings/ERRORS.md`; sets `resolved` immediately if fix was applied
After a useful insight	Agent logs to `.learnings/LEARNINGS.md` automatically
Pattern recurs 3×	Run `/platform-skills:self-improve review` to promote
Lesson is stable	Run `/platform-skills:self-improve promote` to write to project memory

No CI changes required

The .learnings/ directory and memory/working-buffer.md are local file state. No pipeline, no cluster access, no cloud credentials needed.

Integration with other platform-skills domains

Domain	Integration point
`references/platform-mindset.md`	Post-mortems and blameless retros are the human equivalent of `.learnings/ERRORS.md` — use both
`references/mcp.md`	An MCP server can expose `.learnings/` contents as a resource so Claude reads it via `resources/read` without manual file loading
`references/conventional-commits.md`	Use conventional commit format when promoting learnings to `CLAUDE.md`: `docs(memory): promote ERR-20260520-001 — never use terraform destroy without state backup`
`references/platform-operating-model.md`	ADL Protocol maps directly to the ownership boundary decisions described there

Troubleshooting

Working buffer grows too large

Compact it: summarise completed WAL entries into a single ## Completed section and delete individual entries. Keep only the current in-progress task at full detail.

Learnings not persisting across sessions

Check whether .learnings/ is in .gitignore. If it is, the agent must re-read the files explicitly at session start — they are not loaded automatically. Use /platform-skills:self-improve init to verify the setup.

Agent is acting too proactively

Raise the VFM threshold in CLAUDE.md or AGENTS.md:

# Agent self-improvement settings
VFM_THRESHOLD=70   # default 50; raise to require stronger justification

Agent is not logging errors

Check that the PostToolUse hook is configured in .claude/settings.json. Alternatively, ask the agent to log manually: "Log that error to .learnings/ERRORS.md."