CtrlK
BlogDocsLog inGet started
Tessl Logo

jbaruch/koog

Koog 1.0 idioms, gotchas, and scaffolding skills for Kotlin agents on the JVM

86

1.86x
Quality

88%

Does it follow best practices?

Impact

86%

1.86x

Average score across 45 eval scenarios

SecuritybySnyk

Advisory

Suggest reviewing before use

Overview
Quality
Evals
Security
Files

criteria.jsonevals/scenario-31/

{
  "context": "Tests whether the agent reaches for the planner's PlannerNode tree (the right depth — parallel + sequential composition modeled explicitly) AND uses storage keys to track in-flight subtasks for the debugging requirement — rather than building this as a graph strategy with hand-wired conditional edges.",
  "type": "weighted_checklist",
  "checklist": [
    {
      "name": "Uses PlannerNode.parallel inside PlannerNode.sequential",
      "description": "Composes the plan via PlannerNode.sequential(...) wrapping PlannerNode.parallel(...) — explicit tree composition. Does NOT model the angles as conditional edges in a strategy { ... } graph (which serializes them) and does NOT use PlannerNode.sequential alone (would serialize the angles)",
      "max_score": 30
    },
    {
      "name": "Builds the agent with PlannerAIAgent / Planners.*",
      "description": "Constructs the agent through the top-level AIAgent(...) factory with a strategy from Planners.llmBased or AIAgentPlannerStrategy(..., planner). Does NOT use GraphAIAgent (which would mean hand-rolling planner behavior in a graph)",
      "max_score": 20
    },
    {
      "name": "Tracks current subtask via storage key",
      "description": "Declares a typed storage key (createStorageKey<PlannerNode.Builder.Reference>(\"...\")) and stores the current subtask reference on entry. Without this the debugging requirement (\"know which subtask is executing\") is unmet",
      "max_score": 20
    },
    {
      "name": "Adds the planner module dependency",
      "description": "Adds ai.koog:agents-planner at 1.0.0+ to build.gradle.kts. The umbrella does not include it",
      "max_score": 15
    },
    {
      "name": "Sets maxIterations well above the default",
      "description": "Raises maxIterations significantly above the factory default of 50 (e.g., 200+). Each subtask plus the planning step costs multiple iterations; a tree of 4 angles plus composition needs much more than 50",
      "max_score": 10
    },
    {
      "name": "Does not over-engineer with graph subgraphs",
      "description": "Does not author a strategy { ... } block with subgraphWithTask on top of the planner — the planner already handles the tree internally; adding subgraphs on top is double-modeling",
      "max_score": 5
    }
  ]
}

evals

README.md

tile.json