Auto-generated tile from GitHub (107 skills)
Eval Run Status
Version
1ff5bc820f52cb485a7f88fce01c16d2b3e48847
Score
Agent success rate when using this tile
81%
Improvement
Agent success rate improvement when using this tile compared to baseline
1.8x
Baseline
Agent success rate without this tile
45%
Schema-first API inspection
Schema inspection step
15%
100%
Help command reference
100%
100%
Correct CLI resource syntax
0%
100%
Params flag used
0%
100%
Schema-driven flag construction
13%
100%
No secret output
100%
100%
Plain text vs rich text appending
+write for plain text
100%
100%
batchUpdate for table
100%
100%
Correct +write flags
100%
100%
batchUpdate uses --json
100%
100%
Reason for split documented
100%
100%
Confirmation before write
0%
0%
No hardcoded credentials
100%
100%
Batch update safety with dry-run
Dry-run preview pass
22%
100%
Dry-run is separate from live run
100%
100%
User confirmation before live run
0%
0%
Correct batchUpdate syntax
0%
100%
Loops over all IDs
100%
100%
Runbook dry-run explanation
27%
100%
No secrets exposed
100%
100%
Output formatting and pagination
--format flag used
0%
100%
--page-all flag present
0%
100%
--output flag for file saving
0%
0%
FORMAT argument controls --format
0%
100%
Pagination flags documented
0%
100%
Output filename matches format
100%
100%
Service account auth and PII screening
GOOGLE_APPLICATION_CREDENTIALS env var
72%
22%
No credential values output
90%
100%
--sanitize flag on get
0%
0%
Credential path as variable
100%
100%
Setup guide covers no-secret rule
100%
100%
Sanitize explained in setup guide
16%
100%