behavior-preservation-checker

Compare runtime behavior between original and migrated repositories to detect behavioral differences, regressions, and semantic changes. Use when validating code migrations, refactorings, language ports, framework upgrades, or any transformation that should preserve behavior. Automatically compares test results, execution traces, API responses, and observable outputs between two repository versions. Provides actionable guidance for fixing deviations and ensuring behavioral equivalence.

2.62x

Quality

78%

Does it follow best practices?

Impact

97%

2.62x

Average score across 3 eval scenarios

Securityby

Passed

No known issues

Fix and improve this skill with Tessl

tessl review fix ./skills/behavior-preservation-checker/SKILL.md

Evaluation results

95%

70%

Validate a Refactored Statistics Library

Test-based comparison workflow

Criteria

Without context

With context

pytest json-report flag

100%

json-report-file flag

100%

compare_test_results.py usage

50%

Report behavioral_equivalence

50%

100%

Report summary fields

100%

Severity classification

100%

Differences list

100%

Recommendations list

100%

Separate result files

100%

Variance regression identified

100%

96%

73%

Validate a Ported String Processing Library

behavior_checker.py orchestrator and report format

Criteria

Without context

With context

behavior_checker.py invocation

41%

66%

behavior_report.json created

100%

Report summary section

20%

100%

passed_original_failed_migrated count

100%

Differences array

100%

Severity on differences

100%

Critical severity regression

100%

Recommendations array

100%

Guidance on differences

100%

behavioral_equivalence percentage

100%

Workflow log

100%

35%

Validate a Migrated Sorting and Search Library

Property-based testing and multi-method comparison

Criteria

Without context

With context

hypothesis import

100%

@given decorator used

100%

strategies used

100%

Equivalence assertion in property tests

50%

100%

Multiple comparison methods

100%

Tolerance threshold applied

100%

behavior_report.json present

100%

Report differences populated

100%

Report recommendations

100%

Severity levels present

100%

comparison_notes.md present

100%

Repository: ArabelaTso/Skills-4-SE
Commit: 0f00a4f

Evaluated: 4 months ago
Agent: Claude Code
Model: Claude Sonnet 4.6

Table of Contents

Validate a Refactored Statistics Library Validate a Ported String Processing Library Validate a Migrated Sorting and Search Library

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.