CtrlK
BlogDocsLog inGet started
Tessl Logo

autolab-reporter

Operate the local Trackio reporter for Autolab HF Jobs. Use when a reporter or planner needs to inspect scores, active jobs, worker anomalies, duplicate launches, or the overall experiment board.

70

Quality

88%

Does it follow best practices?

Impact

No eval scenarios have been run

SecuritybySnyk

Advisory

Suggest reviewing before use

SKILL.md
Quality
Evals
Security

Evaluation results

94%

66%

Experiment Fleet Status Script

Fleet-check shell script workflow

Criteria
Without context
With context

Credential sourcing

0%

100%

Sync command present

0%

100%

Summary command present

0%

100%

uv run invocation

0%

100%

Correct project flag

0%

50%

max-jobs flag value

0%

90%

Workflow order

70%

100%

No hardcoded credentials

100%

100%

README produced

100%

100%

Executable script

100%

100%

100%

92%

Continuous Experiment Monitoring Setup

Continuous monitoring and dashboard setup

Criteria
Without context
With context

Credential sourcing

0%

100%

Watch mode flag

0%

100%

Interval value

0%

100%

Dashboard command

0%

100%

MCP server flag

0%

100%

No-footer flag

0%

100%

uv run invocation

0%

100%

Correct project flag

0%

100%

Duplicate job monitoring

0%

100%

Capacity assessment guidance

0%

100%

Notes file produced

100%

100%

56%

-8%

Experiment Fleet Anomaly Report

Fleet anomaly detection and launch decision

Criteria
Without context
With context

Duplicate job detection

25%

25%

Duplicate dropout jobs

20%

10%

Prepare job with experiment label

91%

41%

Prepare jobs waste identified

25%

12%

Leaderboard beats promoted master

100%

100%

Experiment vs non-experiment breakdown

100%

100%

Reporter as source of truth

75%

75%

Fix-before-launch guardrail

91%

100%

Duplicates not real parallelism

20%

20%

Specific job IDs cited

90%

90%

Repository
huggingface/context-course
Evaluated
Agent
Claude Code
Model
Claude Sonnet 4.6

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.