Automated pipeline that takes a company name and produces a custom Tessl skill plus an eval report showing per-scenario lift (baseline agent vs with-skill agent). A1 MVP cell of the produce/consume × personalization 2x2.
88
86%
Does it follow best practices?
Impact
89%
1.45xAverage score across 13 eval scenarios
Advisory
Suggest reviewing before use
A sales team scraped a developer conference badge list and exported it into a text file. The list has 14 entries, but many are problematic: some appear to be marketing taglines rather than company names, some use "trading as" / "t/a" notation for legal entities, some are acronyms whose meaning isn't immediately clear, and a few may simply be typos with no identifiable referent.
The team needs a triage report to decide which entries are worth running through the discovery pipeline. For any entries that were ambiguous and required research to resolve, include a brief resolution note explaining what you found so the team can audit your reasoning. Entries that remain truly unresolvable after thorough research should be flagged for manual review.
The raw list is at inputs/companies.txt.
Produce two files:
triage-report.md — the classification report with all four sections. For any entry that was ambiguous and required research to resolve, include a parenthetical resolution note on the same line (e.g., - Liquid AI *(resolved from tagline "Purpose AI at Every Scale")*).
dedup-output.json — the raw JSON from the deduplication step (unique, count_in, count_out).
evals
scenario-1
scenario-2
scenario-3
scenario-4
scenario-5
scenario-6
scenario-7
scenario-8
scenario-9
scenario-10
scenario-11
scenario-12
scenario-13
skills
batch-driver
build-and-evaluate
company-list-filter
discovery
discovery-produce
select-target