CtrlK
BlogDocsLog inGet started
Tessl Logo

tessl/npm-google--gemini-cli-core

Gemini CLI Core - Core functionality library for the open-source AI agent that brings the power of Gemini directly into your terminal.

Overall
score

87%

Evaluation87%

1.01x

Agent success when using this tile

Overview
Eval results
Files

rubric.jsonevals/scenario-9/

{
  "context": "This evaluation assesses how effectively the engineer uses the @iarna/toml package to implement a policy rule validation system. The criteria focus on proper TOML parsing, error handling, and implementing the core policy engine logic (pattern matching, priority evaluation, and mode filtering) as specified in the problem.",
  "type": "weighted_checklist",
  "checklist": [
    {
      "name": "TOML parsing setup",
      "description": "Uses @iarna/toml package (specifically the parse or parseAsync function) to parse TOML content into JavaScript objects in the parseTomlRules function",
      "max_score": 15
    },
    {
      "name": "Parse error handling",
      "description": "Implements proper error handling for TOML parsing errors and validates required fields (toolName, decision, priority) exist with appropriate types in parsed rules",
      "max_score": 15
    },
    {
      "name": "Decision validation",
      "description": "Validates that the decision field contains only valid values ('allow', 'deny', or 'ask_user') and throws an error for invalid values",
      "max_score": 10
    },
    {
      "name": "Wildcard pattern matching",
      "description": "Implements pattern matching in matchesToolPattern that correctly handles both exact string matches and wildcard patterns (e.g., 'github__*' matching 'github__create_issue')",
      "max_score": 15
    },
    {
      "name": "Priority-based evaluation",
      "description": "In evaluatePolicy, correctly sorts or filters rules by priority value (highest priority first) and returns the highest priority matching rule",
      "max_score": 15
    },
    {
      "name": "Mode filtering logic",
      "description": "Correctly filters rules based on approval mode - rules with undefined/empty modes array match all modes, rules with specific modes array only match if current mode is in the array",
      "max_score": 15
    },
    {
      "name": "Command regex matching",
      "description": "When commandRegex is specified in a rule, uses JavaScript RegExp to test if the provided command matches the pattern before considering the rule as matching",
      "max_score": 10
    },
    {
      "name": "Complete evaluation logic",
      "description": "The evaluatePolicy function integrates all matching criteria (tool name pattern, command regex if present, mode filtering) and returns null when no rules match",
      "max_score": 5
    }
  ]
}

Install with Tessl CLI

npx tessl i tessl/npm-google--gemini-cli-core

tile.json