Skill | Added | Review |
|---|---|---|
fix-ci-tests Diagnose and fix CI failures on a GitHub PR by analyzing failing checks, reading logs, and applying fixes | 77 Impact Pending No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: 729dfbb | |
fix-local-tests Fix failing tests by prioritising shell implementation fixes to match bash behaviour | 63 Impact Pending No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 729dfbb | |
review-fix-loop Self-review a PR, fix all issues, and re-review in a loop until clean. Coordinates code-review, address-pr-comments, and fix-ci-tests skills. | 68 Impact Pending No eval scenarios have been run Securityby Critical Do not install without reviewing Reviewed: Version: 729dfbb | |
code-review Comprehensive code review covering security, correctness, bash compatibility, test coverage, and code quality. Use for PRs, commits, or any code changes. | 77 Impact Pending No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: 729dfbb | |
improve-loop Systematically review and improve every shell feature and builtin command. Iterates through each feature/command, runs code-review, fixes issues, and re-reviews until clean. | 66 1.56x Agent success vs baseline Impact 91% 1.56xAverage score across 3 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: 729dfbb | |
implement-posix-command Implement a new POSIX command as a builtin in the safe shell interpreter | 65 Impact Pending No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: 729dfbb | |
improve-test-coverage Improve test coverage for shell features and commands using reference test suites from yash, GNU coreutils, and uutils/coreutils | 54 Impact Pending No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: 729dfbb | |
address-pr-comments Read PR review comments, evaluate validity, implement fixes, push changes, and reply/resolve threads | 68 Impact Pending No eval scenarios have been run Securityby Passed No known issues Reviewed: Version: 729dfbb | |
gtfobins-validate Validate shell builtins against GTFOBins attack patterns to ensure exploits are blocked by the sandbox | 73 Impact Pending No eval scenarios have been run Securityby Advisory Suggest reviewing before use Reviewed: Version: 729dfbb |