Skill | Added | Review |
|---|---|---|
fix-ci-tests Diagnose and fix CI failures on a GitHub PR by analyzing failing checks, reading logs, and applying fixes | 61 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: 00bdc03 | |
fix-local-tests Fix failing tests by prioritising shell implementation fixes to match bash behaviour | 50 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 00bdc03 | |
review-fix-loop Self-review a PR, fix all issues, and re-review in a loop until clean. Coordinates code-review, address-pr-comments, and fix-ci-tests skills. | 54 Impact — No eval scenarios have been run Securityby Critical Do not install without reviewing Reviewed: Version: 00bdc03 | |
code-review Comprehensive code review covering security, correctness, bash compatibility, test coverage, and code quality. Use for PRs, commits, or any code changes. | 61 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: 00bdc03 | |
improve-loop Systematically review and improve every shell feature and builtin command. Iterates through each feature/command, runs code-review, fixes issues, and re-reviews until clean. | 60 1.56x Agent success vs baseline Impact 91% 1.56xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: 00bdc03 | |
implement-posix-command Implement a new POSIX command as a builtin in the safe shell interpreter | 52 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: 00bdc03 | |
improve-test-coverage Improve test coverage for shell features and commands using reference test suites from yash, GNU coreutils, and uutils/coreutils | 42 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: 00bdc03 | |
address-pr-comments Read PR review comments, evaluate validity, implement fixes, push changes, and reply/resolve threads | 48 Impact — No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 00bdc03 | |
gtfobins-validate Validate shell builtins against GTFOBins attack patterns to ensure exploits are blocked by the sandbox | 57 Impact — No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: 00bdc03 |