Tiles from experiments
| Tile | Score | Security | Updated |
|---|---|---|---|
experiments/eval-setup v0.3.1| Skills | 97 Quality 97% Does it follow best practices? Impact Pending No eval scenarios have been run Securityby Advisory Suggest reviewing before use Version: 0.3.1 | Advisory | |
experiments/eval-improve v0.5.0| Skills | 94 1.02x Agent success vs baseline Quality 90% Does it follow best practices? Impact 100% 1.02xAverage score across 5 eval scenarios Securityby Passed No known issues Reviewed: Version: 0.5.0 | Passed |