coding-agent-helpers/regression-scout

Use when the user wants regression hunting after a change. Identify nearby flows, shared code paths, error states, and configuration edges that may have broken even if the main fix works. Good triggers include "check for regressions", "what else might this have broken", and "test the surrounding area".

2.72x

Quality

94%

Does it follow best practices?

Impact

98%

2.72x

Average score across 8 eval scenarios

Securityby

Passed

No known issues

A data engineering team optimized a Python batch job that processes daily transaction records. The job previously read all records into memory at once; it now uses a streaming/chunked approach (reading 1000 records at a time). The job runs nightly and the team wants a regression scout before the next scheduled run. Write findings to report.md.

=============== FILE: inputs/batch-job.md ===============

Transaction Batch Job Overview

Name: coding-agent-helpers/regression-scout
Rating: 96.8 (1 reviews)
Author: coding-agent-helpers

Change

jobs/process_transactions.py — main job file (CHANGED: added chunked reading)
Reads 1000 rows at a time using OFFSET/LIMIT instead of SELECT *
Downstream: writes results to reports/daily_summary.json and inserts to processed_transactions table

Job Pipeline

Read transactions from transactions table
Apply enrichment from merchant_lookup table (JOIN)
Calculate per-merchant summaries
Write summary to reports/daily_summary.json
Insert enriched records to processed_transactions (upsert on transaction_id)

Known Edge Cases

Duplicate transaction_id values exist in source (job uses upsert)
Some transactions have NULL merchant_id (handled by left join, produces "unknown" merchant)
Report file is overwritten atomically at job end
Database uses READ COMMITTED isolation; other processes may write during job run

Performance Notes

Old approach: loaded ~500k rows into memory, occasionally hit 8GB limit
New approach: processes in chunks, but OFFSET performance degrades for large offsets
Nightly window: job must complete within 2-hour maintenance window =============== END FILE ===============

evals

scenario-1

scenario-2

scenario-3

scenario-4

scenario-5

scenario-6

scenario-7

scenario-8

criteria.json

task.md

skills

tile.json