Production-grade platform engineering handbook — Kubernetes, Terraform, Flux CD, GitHub Actions, AWS, and more.
67
84%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Status: Stable
Baseline manifests for Kubernetes platform patterns. Apply across EKS, AKS, GKE, and vanilla clusters. Each file is a self-contained, production-hardened example you can copy directly into your GitOps repo.
| File | What it shows | Key patterns |
|---|---|---|
| namespace-baseline.yaml | Namespace with ownership labels and pod security enforcement | pod-security.kubernetes.io/enforce: restricted, team/env labels |
| deployment-baseline.yaml | Deployment with resource limits, probes, and locked-down security context | runAsNonRoot, readOnlyRootFilesystem, capabilities.drop: ALL, no CPU limit |
| network-policy-default-deny.yaml | Default-deny ingress + example allow rule for ingress controller | Applied before workload; explicit allow for ingress controller namespace |
| pod-disruption-budget.yaml | PDB protecting minimum availability during node drain | minAvailable: 1 — prevents simultaneous eviction |
# 1. Apply namespace first — sets pod security admission before workloads land
kubectl apply -f namespace-baseline.yaml
# 2. Deploy the baseline workload
kubectl apply -f deployment-baseline.yaml
# 3. Lock down network — default deny then explicit allow
kubectl apply -f network-policy-default-deny.yaml
# 4. Protect availability during drain / rolling update
kubectl apply -f pod-disruption-budget.yaml
# Verify security context is in effect
kubectl get pod -n app-team -o jsonpath='{.items[0].spec.containers[0].securityContext}'securityContext:
runAsNonRoot: true
runAsUser: 1000
allowPrivilegeEscalation: false
readOnlyRootFilesystem: true
capabilities:
drop: ["ALL"]resources:
requests:
cpu: "100m"
memory: "128Mi"
limits:
memory: "256Mi" # CPU limit intentionally omittedlivenessProbe:
httpGet:
path: /healthz
port: 8080
initialDelaySeconds: 10
periodSeconds: 10
readinessProbe:
httpGet:
path: /ready
port: 8080
initialDelaySeconds: 5
periodSeconds: 5team, env)restricted mode)default service accountrunAsNonRoot, allowPrivilegeEscalation: false, capabilities.drop: ALLreadOnlyRootFilesystem: true (add emptyDir mounts for writable paths)replicas >= 2/platform-skills:review — production-readiness review of any manifest/platform-skills:debug — structured diagnosis for Kubernetes issues.claude-plugin
.github
commands
docs
examples
agent-self-improve
argocd
awesome-docs
aws
cloudfront
functions
lambda-edge
functions
azure
compliance
conventional-commits
datadog
llm-observability
demo
documentation
dora
dynatrace
fluxcd
github-actions
composite-actions
configure-cloud
db-migrate
docker-build-push
k8s-deploy
notify-slack
pr-comment
release-tag
security-scan
setup-env
setup-terraform
terraform-plan
helm
web-service
templates
kubernetes
kyverno
mcp
observability
openshift
pr-review
ownership
runtime-security
supply-chain
terraform
references
scripts
skills
platform-skills
tests