CtrlK
BlogDocsLog inGet started
Tessl Logo

streaming-inference-setup

Streaming Inference Setup - Auto-activating skill for ML Deployment. Triggers on: streaming inference setup, streaming inference setup Part of the ML Deployment skill category.

36

1.02x

Quality

3%

Does it follow best practices?

Impact

97%

1.02x

Average score across 3 eval scenarios

SecuritybySnyk

Passed

No known issues

Optimize this skill with Tessl

npx tessl skill review --optimize ./planned-skills/generated/08-ml-deployment/streaming-inference-setup/SKILL.md
SKILL.md
Quality
Evals
Security

Evaluation results

100%

Real-Time Text Generation Service

Production-ready streaming server

Criteria
Without context
With context

Streaming protocol

100%

100%

Error handling

100%

100%

Request validation

100%

100%

Logging

100%

100%

Health check endpoint

100%

100%

Model loaded at startup

100%

100%

Externalized configuration

100%

100%

Correct streaming headers

100%

100%

Generation limits

100%

100%

Step-by-step README

100%

100%

Without context: $0.4282 · 1m 52s · 21 turns · 22 in / 6,782 out tokens

With context: $0.5238 · 2m 29s · 27 turns · 26 in / 8,452 out tokens

94%

7%

Visibility into Production Inference Service

Monitoring and observability setup

Criteria
Without context
With context

Latency metric

100%

100%

Throughput metric

100%

100%

Error rate metric

100%

100%

Standard metrics format

100%

100%

Structured logging

50%

80%

Health or readiness endpoint

100%

100%

Alert rules defined

100%

100%

Response validation

70%

60%

Resource metrics

50%

100%

Numbered setup steps

100%

100%

Without context: $0.7110 · 3m 27s · 28 turns · 28 in / 13,083 out tokens

With context: $0.7965 · 3m 29s · 30 turns · 29 in / 13,597 out tokens

99%

-1%

Reusable MLOps Pipeline Template for Streaming Inference

End-to-end MLOps pipeline coverage

Criteria
Without context
With context

Model serving stage

100%

100%

MLOps pipeline stages

100%

100%

Monitoring component

100%

100%

Production optimization

100%

100%

Validation step

100%

100%

Model versioning

100%

100%

Rollback or safe deployment

100%

100%

Production-ready config

100%

90%

Numbered pipeline guide

100%

100%

All four domains covered

100%

100%

Without context: $1.4772 · 6m 15s · 45 turns · 44 in / 27,022 out tokens

With context: $1.4624 · 6m 28s · 43 turns · 43 in / 26,961 out tokens

Repository
jeremylongshore/claude-code-plugins-plus-skills
Evaluated
Agent
Claude Code
Model
Claude Sonnet 4.6

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.