autolab-reporter

Operate the local Trackio reporter for Autolab HF Jobs. Use when a reporter or planner needs to inspect scores, active jobs, worker anomalies, duplicate launches, or the overall experiment board.

Quality

88%

Does it follow best practices?

Impact

—

No eval scenarios have been run

Securityby

Advisory

Suggest reviewing before use

Evaluation results

94%

66%

Experiment Fleet Status Script

Fleet-check shell script workflow

Criteria

Without context

With context

Credential sourcing

100%

Sync command present

100%

Summary command present

100%

uv run invocation

100%

Correct project flag

50%

max-jobs flag value

90%

Workflow order

70%

100%

No hardcoded credentials

100%

README produced

100%

Executable script

100%

92%

Continuous Experiment Monitoring Setup

Continuous monitoring and dashboard setup

Criteria

Without context

With context

Credential sourcing

100%

Watch mode flag

100%

Interval value

100%

Dashboard command

100%

MCP server flag

100%

No-footer flag

100%

uv run invocation

100%

Correct project flag

100%

Duplicate job monitoring

100%

Capacity assessment guidance

100%

Notes file produced

100%

56%

-8%

Experiment Fleet Anomaly Report

Fleet anomaly detection and launch decision

Criteria

Without context

With context

Duplicate job detection

25%

Duplicate dropout jobs

20%

10%

Prepare job with experiment label

91%

41%

Prepare jobs waste identified

25%

12%

Leaderboard beats promoted master

100%

Experiment vs non-experiment breakdown

100%

Reporter as source of truth

75%

Fix-before-launch guardrail

91%

100%

Duplicates not real parallelism

20%

Specific job IDs cited

90%

Repository: huggingface/context-course
Commit: 0448a7c

Evaluated: 26 days ago
Agent: Claude Code
Model: Claude Sonnet 4.6

Table of Contents

Experiment Fleet Status Script Continuous Experiment Monitoring Setup Experiment Fleet Anomaly Report

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.