Iterative evaluation of Fusion Framework MCP search quality against documented domain patterns. Loads domain files from eval/index/, queries Fusion MCP for each pattern, validates recall against must/should requirements, and produces a human-readable pass/fail report. USE FOR: eval core, eval all, evaluate MCP index accuracy, validate search recall for a domain, check index freshness. DO NOT USE FOR: writing domain patterns, populating eval/index/ files, running CI pipelines, or batch automation.
72
88%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Advisory
Suggest reviewing before use
Use this skill to evaluate whether Fusion MCP search returns accurate, relevant results for documented framework patterns.
Typical triggers:
eval core — evaluate all patterns in eval/index/core.mdeval http-services — evaluate patterns in eval/index/http-services.mdeval all — evaluate every domain file in eval/index/Do not use this skill for:
eval/index/ (follow the README template)Collect before execution:
core, http-services) or allstrict (must patterns only) or full (must + should); default is fullIf the user says only eval with no domain, ask which domain to evaluate or whether to run all.
core), read eval/index/<domain>.mdall, list all .md files in eval/index/ except README.md and process each one sequentially## query sections, report it as empty and skipDomain files have a simple structure:
# Domain Name — the domain heading# — judgement instructions for how to evaluate results across all queries in this domain## <query> — each ## heading is a search query to run against MCP- must ... / - should ... — expectations for each query's resultsExtract the judgement instructions once (use them as context when evaluating every query). Then build a list of queries, each with its must and should expectations.
For each ## heading, spin up a query-judge sub-agent (see agents/query-judge.md). Pass it:
must and should bullets for that queryThe judge sub-agent will:
pass / partial / fail) with counts and explanationIf the runtime does not support sub-agents, follow the same workflow inline:
mcp_fusion_search_framework (preferred) or mcp_fusion_searchmust / should bulletCollect all verdicts before producing the report.
Produce a report following the template in assets/report-template.md. The report includes:
eval all, present a per-domain summary first, then offer to show detailed results for any domainmust failures prominently — these indicate critical index gapsA structured evaluation report containing:
eval coreUser: eval core
Workflow:
eval/index/core.md@equinor/fusion-framework and @equinor/fusion-framework-module..."## headingsmust/should bullets## Evaluation Report: core
Date: 2026-03-14
Strictness: full
Domain: eval/index/core.md
| # | Query | Verdict | Explanation |
|---|-------|---------|-------------|
| 1 | How to initialize Fusion Framework | pass | Results mention FrameworkConfigurator, init, configureMsal |
| 2 | How to create a custom module | pass | Module interface, BaseConfigBuilder, BaseModuleProvider all covered |
| 3 | Module lifecycle phases | partial | Lifecycle order shown but postConfigure/postInitialize hooks not detailed |
| 4 | How to configure an app with AppModuleInitiator | fail | No results reference AppModuleInitiator |
| 5 | How to listen to framework events | pass | addEventListener, dispatchEvent, event module all returned |
### Summary
- Queries: 5 | Pass: 3 | Partial: 1 | Fail: 1
- Must expectations met: 14/17 (82%)
### Recommendations
1. **[CRITICAL]** Query 4: Re-index `packages/app/src/types.ts` and `cookbooks/app-react/src/config.ts`
2. **[IMPROVE]** Query 3: Enrich lifecycle section in `packages/modules/module/README.md`partial with explanation060e3af
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.