Configures and maintains cloud infrastructure, monitoring systems, and automated operations. Sets up Prometheus/Grafana alerting, writes Terraform IaC for AWS/GCP/Azure resources, manages auto-scaling groups, load balancers, VPCs, and databases, and implements encrypted backup pipelines with S3 offload. Troubleshoots deployment failures, optimizes resource right-sizing, enforces security hardening (SOC2/ISO27001), and produces capacity planning and cost analysis reports. Use when the user asks about server setup, infrastructure provisioning, monitoring and alerting configuration, Terraform or CloudFormation, CI/CD pipeline issues, Kubernetes or container orchestration, cloud cost optimization, backup and disaster recovery, or database performance and scaling.
98
100%
Does it follow best practices?
Impact
98%
1.42xAverage score across 3 eval scenarios
Passed
No known issues
Encrypted backup pipeline
GPG AES256 cipher
100%
100%
SHA512 key derivation
0%
100%
pg_dump for database
100%
100%
tar for filesystem
100%
100%
S3 STANDARD_IA storage class
0%
100%
Integrity verification
100%
100%
30-day local retention
0%
100%
Slack webhook notification
100%
100%
Secrets via env vars
100%
100%
Prometheus monitoring config
Node exporter scrape job
100%
100%
App scrape job on :8080
100%
100%
PostgreSQL exporter scrape job
100%
100%
Alertmanager endpoint
100%
100%
CPU alert threshold >80%
100%
100%
Memory alert threshold >90%
0%
100%
Disk alert threshold >85%
100%
100%
Service-down alert
100%
100%
Evaluation interval configured
100%
100%
Alert rules file referenced
100%
100%
Infrastructure change workflow
Pre-change state capture
37%
100%
CloudWatch baseline metrics
0%
100%
terraform fmt and validate
0%
100%
terraform plan with output file
100%
100%
Staging before production
100%
100%
Staging health check
62%
100%
ELB target health validation
100%
100%
Prometheus service-up check
0%
100%
ASG launch template rollback
20%
100%
Instance refresh MinHealthyPercentage:90
100%
100%
Security group/IAM review step
100%
100%
Rollback documented before production
11%
44%
010799b
Table of Contents
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.