CtrlK
BlogDocsLog inGet started
Tessl Logo

infrastructure-maintainer

Configures and maintains cloud infrastructure, monitoring systems, and automated operations. Sets up Prometheus/Grafana alerting, writes Terraform IaC for AWS/GCP/Azure resources, manages auto-scaling groups, load balancers, VPCs, and databases, and implements encrypted backup pipelines with S3 offload. Troubleshoots deployment failures, optimizes resource right-sizing, enforces security hardening (SOC2/ISO27001), and produces capacity planning and cost analysis reports. Use when the user asks about server setup, infrastructure provisioning, monitoring and alerting configuration, Terraform or CloudFormation, CI/CD pipeline issues, Kubernetes or container orchestration, cloud cost optimization, backup and disaster recovery, or database performance and scaling.

98

1.42x
Quality

100%

Does it follow best practices?

Impact

98%

1.42x

Average score across 3 eval scenarios

SecuritybySnyk

Passed

No known issues

SKILL.md
Quality
Evals
Security

Evaluation results

100%

32%

Automated Database and Filesystem Backup Pipeline

Encrypted backup pipeline

Criteria
Without context
With context

GPG AES256 cipher

100%

100%

SHA512 key derivation

0%

100%

pg_dump for database

100%

100%

tar for filesystem

100%

100%

S3 STANDARD_IA storage class

0%

100%

Integrity verification

100%

100%

30-day local retention

0%

100%

Slack webhook notification

100%

100%

Secrets via env vars

100%

100%

100%

12%

Monitoring and Alerting Setup for a Production Application Stack

Prometheus monitoring config

Criteria
Without context
With context

Node exporter scrape job

100%

100%

App scrape job on :8080

100%

100%

PostgreSQL exporter scrape job

100%

100%

Alertmanager endpoint

100%

100%

CPU alert threshold >80%

100%

100%

Memory alert threshold >90%

0%

100%

Disk alert threshold >85%

100%

100%

Service-down alert

100%

100%

Evaluation interval configured

100%

100%

Alert rules file referenced

100%

100%

95%

42%

Infrastructure Change Runbook for Application Server Scaling

Infrastructure change workflow

Criteria
Without context
With context

Pre-change state capture

37%

100%

CloudWatch baseline metrics

0%

100%

terraform fmt and validate

0%

100%

terraform plan with output file

100%

100%

Staging before production

100%

100%

Staging health check

62%

100%

ELB target health validation

100%

100%

Prometheus service-up check

0%

100%

ASG launch template rollback

20%

100%

Instance refresh MinHealthyPercentage:90

100%

100%

Security group/IAM review step

100%

100%

Rollback documented before production

11%

44%

Repository
OpenRoster-ai/awesome-agents
Evaluated
Agent
Claude Code
Model
Claude Sonnet 4.6

Table of Contents

Is this your skill?

If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.