Tiles from experiments
| Tile | Score | Impact | Security | Updated |
|---|---|---|---|---|
experiments/eval-setup v0.3.1| Skills | 97 Quality 96% Does it follow best practices? Impact Pending Average score across 0 eval scenarios Securityby Advisory Suggest reviewing before use Reviewed: Version: 0.3.1 | Advisory | ||
experiments/eval-improve v0.5.0| Skills | 94 1.02x Agent success vs baseline Quality 88% Does it follow best practices? Impact 100% 1.02xAverage score across 5 eval scenarios Securityby Passed No known issues Reviewed: Version: 0.5.0 | 1.02x | Passed |