Ctrl + k

or run

tessl search
Log in

Version

Workspace
tessl
Visibility
Public
Created
Last updated
Describes
pypipkg:pypi/crosshair-tool@0.0.x
tile.json

tessl/pypi-crosshair-tool

tessl install tessl/pypi-crosshair-tool@0.0.0

Analyze Python code for correctness using symbolic execution and SMT solving to automatically find counterexamples for functions with type annotations and contracts.

Agent Success

Agent success rate when using this tile

86%

Improvement

Agent success rate improvement when using this tile compared to baseline

1.25x

Baseline

Agent success rate without this tile

69%

rubric.jsonevals/scenario-10/

{
  "context": "This criteria evaluates how effectively the engineer uses CrossHair's test generation capabilities to automatically create comprehensive test suites with high code coverage. The focus is on proper use of the crosshair cover command, pytest output format generation, and coverage-driven test creation.",
  "type": "weighted_checklist",
  "checklist": [
    {
      "name": "Uses crosshair cover",
      "description": "Implementation invokes the crosshair cover command (via subprocess or similar mechanism) to generate tests for the target module",
      "max_score": 30
    },
    {
      "name": "Pytest format output",
      "description": "Implementation configures or uses CrossHair to generate test output in pytest format (complete test functions with imports and assertions)",
      "max_score": 25
    },
    {
      "name": "Writes output file",
      "description": "Implementation correctly captures the generated test output and writes it to the specified output_path file",
      "max_score": 20
    },
    {
      "name": "Handles module path",
      "description": "Implementation correctly passes the module_path parameter to the crosshair cover command for test generation",
      "max_score": 15
    },
    {
      "name": "Returns success status",
      "description": "Implementation returns True when test generation succeeds and False when it fails, properly handling command execution status",
      "max_score": 10
    }
  ]
}