Skills for building AEM Edge Delivery Services sites — block development, content modeling, code review, testing, and page import.
82
76%
Does it follow best practices?
Impact
88%
1.04xAverage score across 6 eval scenarios
Advisory
Suggest reviewing before use
{
"context": "Tests whether the agent produces structured acceptance criteria for an iOS Safari scroll-snap bug fix without writing code or proposing a solution. Covers bug analysis, root cause investigation, reproduction steps, cross-browser scope, and regression prevention.",
"type": "weighted_checklist",
"checklist": [
{
"name": "Bug clearly restated",
"description": "Accurately summarizes the reported behavior (carousel snaps back to first item), the affected browser (iOS Safari only), and the trigger condition (swipe to next testimonial)",
"max_score": 10
},
{
"name": "Root cause analysis",
"description": "Identifies the CSS change from overflow-x: auto to overflow-x: scroll as the likely cause; discusses the scroll-snap and overflow interaction on iOS Safari",
"max_score": 15
},
{
"name": "Expected behavior defined",
"description": "Clearly states what correct scrolling behavior looks like: carousel advances to the next item on swipe and stays there, consistently across all browsers",
"max_score": 12
},
{
"name": "Reproduction criteria",
"description": "Describes how to reproduce the bug: iOS Safari, swipe gesture on the carousel, and any specific conditions (e.g., number of items, scroll speed)",
"max_score": 10
},
{
"name": "Edge cases identified",
"description": "Addresses edge cases such as rapid swiping, swiping past the last item, single-item carousel, orientation change during scroll, and programmatic scroll",
"max_score": 12
},
{
"name": "Cross-browser scope",
"description": "Acceptance criteria ensure the fix does not break behavior on Chrome, Firefox, or desktop Safari",
"max_score": 10
},
{
"name": "No premature solution",
"description": "Analyzes the problem without prescribing a specific CSS fix or implementation approach",
"max_score": 10
},
{
"name": "Responsive behavior considered",
"description": "Mentions different viewport behaviors if relevant (e.g., touch vs non-touch, portrait vs landscape)",
"max_score": 8
},
{
"name": "Definition of done is testable",
"description": "Includes a specific browser/device test matrix (e.g., iOS Safari 16+, Chrome latest, Firefox latest, desktop Safari)",
"max_score": 8
},
{
"name": "Regression prevention",
"description": "Mentions the need for a test or verification step to prevent re-introduction of the bug in future refactors",
"max_score": 5
}
]
}evals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
scenario-6
skills
analyze-and-plan
block-collection-and-party
block-inventory
building-blocks
code-review
content-driven-development
content-modeling
docs-search
find-test-content
generate-import-html
identify-page-structure
page-decomposition
page-import
preview-import
scrape-webpage
testing-blocks