Production-grade platform engineering handbook — Kubernetes, Terraform, Flux CD, GitHub Actions, AWS, and more.
67
84%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Author: @dave (human reviewer)
PR: Adds a new Deployment for the notifications service
This Deployment has no PodDisruptionBudget. During a node drain, all replicas could be evicted simultaneously. Worth adding in a follow-up to protect against that.
Reason: The suggestion is valid but explicitly marked as "follow-up" by the reviewer.
Adding a PDB requires knowing the correct minAvailable value, which requires load testing
this new service first. Out of scope for an initial rollout PR.
Agreed — a PDB is the right call once we have a baseline for this service.
Adding it now would require picking a
minAvailablevalue blind, which risks either breaking rolling deploys (too high) or providing no real protection (too low). The plan is to run load tests in staging during the first week, then add the PDB in a follow-up PR with measured values.Tracked in issue #112.
ℹ️ Thread resolved — no code change needed.
.claude-plugin
.github
commands
docs
examples
agent-self-improve
argocd
awesome-docs
aws
cloudfront
functions
lambda-edge
functions
azure
compliance
conventional-commits
datadog
llm-observability
demo
documentation
dora
dynatrace
fluxcd
github-actions
composite-actions
configure-cloud
db-migrate
docker-build-push
k8s-deploy
notify-slack
pr-comment
release-tag
security-scan
setup-env
setup-terraform
terraform-plan
helm
web-service
templates
kubernetes
kyverno
mcp
observability
openshift
pr-review
ownership
runtime-security
supply-chain
terraform
references
scripts
skills
platform-skills
tests