CtrlK
BlogDocsLog inGet started
Tessl Logo

sharaf/llm-learning-system-auditor

Use when the user wants to review, audit, or check safety for an AI memory system, agent learning pipeline, prompt-tuning workflow, skill builder, trace-mining tool, or eval/feedback loop. Produces an evidence-led audit report with learning-loop map, evidence inventory, maturity scorecard, severity-ranked findings, privacy/provenance gaps, counterfactual/eval coverage, and Stabilize/Standardize/Scale roadmap.

100

1.28x
Quality

100%

Does it follow best practices?

Impact

100%

1.28x

Average score across 3 eval scenarios

SecuritybySnyk

Passed

No known issues

Overview
Quality
Evals
Security
Files

README.md

LLM Learning System Auditor

Use this Tessl skill to audit systems that learn from LLM session history, tool-use traces, feedback, evals, or production failures. It focuses on whether raw experience can safely become durable memories, rules, prompts, skills, evals, or policy changes.

When To Use

Use when reviewing an agent memory system, trace store, skill builder, rule learner, prompt optimizer, eval pipeline, session-history mining workflow, or agent observability stack. It is also suited to symptoms such as stale memories, bad learned rules, unsafe skill activation, weak evals, missing provenance, cost spikes, context bloat, unreviewed prompt changes, or unclear rollback.

Install

tessl install sharaf/llm-learning-system-auditor

Organization

PathPurpose
skills/llm-learning-system-auditor/SKILL.mdActivation, first actions, workflow, maturity scoring, finding contract
skills/llm-learning-system-auditor/references/evidence-inventory.mdEvidence table and inventory rules
skills/llm-learning-system-auditor/references/audit-domains.mdDomain-by-domain audit checks
skills/llm-learning-system-auditor/references/generated-skill-checks.mdGenerated executable skill and registry rollout checks
skills/llm-learning-system-auditor/references/findings-and-roadmap.mdSeverity classification and roadmap sequencing
skills/llm-learning-system-auditor/references/report-template-and-guardrails.mdFinal report template, guardrails, and success criteria

Output

The skill produces an evidence-led audit with a learning-loop brief, evidence inventory, maturity scorecard, severity-ranked findings, domain assessment, privacy/provenance notes, counterfactual coverage, observability notes, failure mode review, prioritized roadmap, and open questions.

Eval Results (v0.2.20)

Full-tile verification run: 019e5647-40cf-77ef-83aa-9a32d780c092.

ScenarioBaselineWith Skill
MemoryHarvester rule and memory audit74%100%
SkillForge generated skill audit83%100%
PromptTuner optimization audit78%100%

With-context average: 100%.

Registry search reports aggregate 100%, Quality 100%, Impact 100%, and 3 eval scenarios.

README.md

tile.json