test-oracle-generator

Generates test oracles — the "expected output" part of a test — by choosing among reference implementations, invariants, inverse functions, or differential comparison when the correct answer isn't obvious. Use when the hard part of testing is knowing what the right answer is, not generating inputs.

Quality

88%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Passed

No known issues

Quality

Discovery

85%

Based on the skill's description, can an agent find and select it at the right time? Clear, specific descriptions lead to better discovery.

This is a strong description that clearly articulates a specific testing niche (the oracle problem) with concrete techniques and explicit usage triggers. The main weakness is trigger term coverage - while technically accurate, it could include more natural language variations that users might employ when struggling with test assertions or expected values.

Suggestions

Add natural trigger terms users might say: 'assertions', 'expected values', 'verify output', 'test expectations', 'what should the test check'

Dimension	Reasoning	Score
Specificity	Lists multiple specific concrete actions: 'choosing among reference implementations, invariants, inverse functions, or differential comparison' - these are distinct, concrete techniques for generating test oracles.	3 / 3
Completeness	Clearly answers both what ('Generates test oracles by choosing among reference implementations, invariants, inverse functions, or differential comparison') AND when ('Use when the hard part of testing is knowing what the right answer is, not generating inputs').	3 / 3
Trigger Term Quality	Includes some relevant terms like 'test oracles', 'expected output', 'testing', and 'right answer', but misses common variations users might say like 'assertions', 'test expectations', 'verify results', or 'check output'.	2 / 3
Distinctiveness Conflict Risk	Highly distinctive niche focusing specifically on the oracle problem in testing, with clear differentiation from general test generation skills through the explicit 'when the correct answer isn't obvious' framing.	3 / 3
	Total	11 / 12 Passed

Implementation

92%

Reviews the quality of instructions and guidance provided to agents. Good implementation is clear, handles edge cases, and produces reliable results.

This is an excellent skill that efficiently teaches oracle selection with a clear decision tree and concrete, executable examples. The content is dense but well-structured, with the table providing quick reference and worked examples demonstrating each pattern. Minor improvement possible in progressive disclosure by extracting detailed examples to separate files.

Dimension	Reasoning	Score
Conciseness	Every section earns its place. The decision flow, table, and examples are dense with actionable information. No explanations of concepts Claude already knows (like what testing is or how assertions work).	3 / 3
Actionability	Provides fully executable Python code examples with Hypothesis for each oracle type. The worked examples are copy-paste ready with complete implementations of slow_median, invariant checks, and round-trip tests.	3 / 3
Workflow Clarity	The decision flow diagram provides an explicit sequence for choosing oracle types. Each branch has clear criteria and the 'Do not' section provides validation guidance against common mistakes.	3 / 3
Progressive Disclosure	Content is well-organized with clear sections, but everything is inline in one file. The metamorphic relation row references another skill, which is good, but the skill could benefit from separating the worked examples into a reference file for complex use cases.	2 / 3
	Total	11 / 12 Passed

Validation

100%

Warnings & errors only

Checks the skill against the spec for correct structure and formatting. All validation checks must pass before discovery and implementation can be scored.

Validation — 11 / 11 Passed

Validation for skill structure

No warnings or errors.

Repository: santosomar/general-secure-coding-agent-skills
Commit: 47d56bb

Reviewed: 3 months ago

Table of Contents

Discovery Implementation Validation

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.