pantheon-ai/cfn-template-compare

Compares deployed CloudFormation templates with locally synthesized CDK templates to detect drift, validate changes, and ensure consistency before deployment. Use when the user wants to compare CDK output with a deployed stack, check for infrastructure drift, run a pre-deployment validation, audit IAM or security changes, investigate a failing deployment, or perform a 'cdk diff'-style review. Triggered by phrases like 'compare templates', 'check for drift', 'cfn drift', 'stack comparison', 'infrastructure drift detection', 'safe to deploy', or 'what changed in my CDK stack'.

Overall
score

95%

Review — 93%

Does it follow best practices?

Validation — 11 / 11 Passed

Validation for skill structure

name:: cfn-template-compare
description:: Compares deployed CloudFormation templates with locally synthesized CDK templates to detect drift, validate changes, and ensure consistency before deployment. Use when the user wants to compare CDK output with a deployed stack, check for infrastructure drift, run a pre-deployment validation, audit IAM or security changes, investigate a failing deployment, or perform a 'cdk diff'-style review. Triggered by phrases like 'compare templates', 'check for drift', 'cfn drift', 'stack comparison', 'infrastructure drift detection', 'safe to deploy', or 'what changed in my CDK stack'.

CloudFormation Template Comparison Skill

Name: pantheon-ai/cfn-template-compare
Author: pantheon-ai

Quick Start

# 1. Retrieve the deployed template
aws cloudformation get-template \
  --stack-name <stack-name> \
  --region <region> \
  --profile <profile> \
  --query TemplateBody \
  --output json > deployed.json

# 2. Synthesize the local CDK template
make synth
cp cdk.out/<stack-name>.template.json local.json

# 3. Compare structure
jq 'keys' deployed.json
jq 'keys' local.json

# 4. Compare resource counts
jq '.Resources | length' deployed.json
jq '.Resources | length' local.json

# 5. Find added/removed resource IDs
diff <(jq -r '.Resources | keys[]' deployed.json | sort) \
     <(jq -r '.Resources | keys[]' local.json | sort)

# 6. Deep diff of a specific resource
diff <(jq '.Resources.<ResourceId>' deployed.json) \
     <(jq '.Resources.<ResourceId>' local.json)

Expected Workflow

Step 1: Preparation — verify prerequisites

# Check AWS credentials
aws sts get-caller-identity --profile <profile>
# → If this fails: verify AWS_PROFILE or --profile value before proceeding

# Confirm stack exists
aws cloudformation describe-stacks \
  --stack-name <stack-name> --region <region> --profile <profile> \
  --query 'Stacks[0].StackStatus'
# → If StackNotFoundException: check stack name and region

# Confirm CDK project synthesises cleanly
make synth
# → If synth fails: fix missing env vars in env-local.mk / env.mk before proceeding

Step 2: Retrieval

# Deployed template
aws cloudformation get-template \
  --stack-name <stack-name> --region <region> --profile <profile> \
  --query TemplateBody --output json > deployed.json
# → Validate: jq '.' deployed.json >/dev/null || echo "ERROR: invalid JSON"

# Local template
cp cdk.out/<stack-name>.template.json local.json
# → Validate: jq '.' local.json >/dev/null || echo "ERROR: invalid JSON"

Step 3: Hierarchical Analysis

# 1. Structure (top-level keys)
diff <(jq 'keys' deployed.json) <(jq 'keys' local.json)

# 2. Resource count
echo "Deployed: $(jq '.Resources | length' deployed.json)"
echo "Local:    $(jq '.Resources | length' local.json)"

# 3. Added / removed resources
comm -3 \
  <(jq -r '.Resources | keys[]' deployed.json | sort) \
  <(jq -r '.Resources | keys[]' local.json | sort)

# 4. Security — CDK Nag suppressions
diff \
  <(jq '[.Resources[].Metadata."cdk_nag" // empty]' deployed.json) \
  <(jq '[.Resources[].Metadata."cdk_nag" // empty]' local.json)

# 5. IAM roles and policies
diff \
  <(jq '[.Resources | to_entries[] | select(.value.Type | startswith("AWS::IAM"))]' deployed.json) \
  <(jq '[.Resources | to_entries[] | select(.value.Type | startswith("AWS::IAM"))]' local.json)

Step 4: Risk Assessment

Categorise each difference before reporting:

Category	Examples	Action
Expected	Environmental tags, GitRef, stack-name in resource IDs	Auto-approve
Low risk	Display names, cosmetic metadata	Note and approve
Medium risk	Alarm thresholds, EventBridge schedules, Lambda config	Review and approve
High risk	IAM policies, encryption settings	Require explicit sign-off
Critical	CDK Nag suppressions, public access flags, resource removal	Block — get InfoSec/stakeholder approval

Step 5: Save Artifacts

# Timestamped directory for audit trail
BRANCH=$(git rev-parse --abbrev-ref HEAD)
DIR="cfn-compare-results/$(date +%Y-%m-%d-%H%M%S)_deployed-main_local-${BRANCH}"
mkdir -p "$DIR"

cp deployed.json "$DIR/"
cp local.json    "$DIR/"
jq -r '.Resources | keys[]' deployed.json | sort > "$DIR/deployed-resources.txt"
jq -r '.Resources | keys[]' local.json    | sort > "$DIR/local-resources.txt"

# Write report
cat > "$DIR/comparison-report.md" <<EOF
# CloudFormation Template Comparison

## Summary
- Deployed: <stack-name> ($(jq '.Resources|length' deployed.json) resources)
- Local:    <stack-name> ($(jq '.Resources|length' local.json) resources)
- Status: ✅ Safe to deploy | ⚠️ Review required | ❌ Critical issues

## Differences
<!-- Populate from Step 3 output -->

## Recommendations
<!-- List required actions -->

## Deployment Decision
<!-- Approve | Reject | Conditional — reasoning -->
EOF

Decision Framework

When to Use This Skill

Use CloudFormation template comparison when:

Pre-deployment validation: Verify CDK changes match expectations before deploying to prod
Drift detection: Investigate whether console changes have diverged from IaC
Security audits: Check for unauthorized IAM policy modifications or CDK Nag suppression changes
Deployment troubleshooting: Understand why cdk deploy is failing or showing unexpected diffs
Change review: Provide stakeholders with concrete before/after comparison for approval

When NOT to Use This Skill

Skip template comparison when:

Initial stack deployment: No deployed template exists yet — synthesize and deploy directly
Cross-account comparisons: Different account IDs/ARNs make diffs noisy and unreliable
Frequently changing resources: Dynamic autoscaling groups, ephemeral Lambdas — accept constant drift
CloudFormation managed entirely outside CDK: If stack wasn't created via CDK, comparison won't map correctly

Risk Assessment Strategy

Always categorize diffs by risk level before approving:

Auto-approve (green): Environmental tags, GitRef, timestamps — expected variance
Review (yellow): Config changes (alarms, schedules) — verify intent, then approve
Block (red): IAM policies, CDK Nag suppressions, resource deletions — require explicit sign-off

Anti-Patterns

NEVER compare templates without verifying both sources are valid JSON first

WHY: invalid JSON from AWS CLI or CDK synth causes cryptic jq errors that waste time debugging.
BAD: jq '.Resources' deployed.json → parse error: Expected separator between values at line 1, column 3.
GOOD: jq '.' deployed.json >/dev/null && echo "valid" || echo "INVALID" before any comparison.

NEVER rely on line-by-line diff for large templates

WHY: 5000+ line diffs are unreadable and hide critical changes in noise.
BAD: diff deployed.json local.json → terminal flooded with irrelevant formatting differences.
GOOD: hierarchical comparison (Step 3) — resource counts, added/removed IDs, then targeted deep diffs per resource.

NEVER approve deployments with unexplained IAM policy changes

WHY: unauthorized privilege escalation or resource exposure can occur through subtle IAM modifications.
BAD: diff shows IAM role trust policy changed → "looks fine, deploying" → security breach.
GOOD: extract IAM diff specifically (jq filter for AWS::IAM::*), document justification, get InfoSec approval before deploy.

NEVER skip saving comparison artifacts before deployment

WHY: if deployment goes wrong, you lose the evidence of what changed and can't rollback confidently.
BAD: run comparison in terminal, approve deploy, stack fails → no record of what was attempted.
GOOD: timestamped directory with deployed.json, local.json, diff report — audit trail for incident investigation.

Error Recovery

Error	Cause	Fix
`Stack not found`	Wrong name/region	Verify `--stack-name`, `--region`, `--profile`
`CDK synth failed`	Missing env var	Check `env-local.mk` and `env.mk`
`jq: parse error`	Invalid JSON from CLI	Use `--output json` and `--query TemplateBody`
`Diff > 5000 lines`	Template too large	Switch to hierarchical comparison (Step 3) instead of line diff

Required Tools

aws CLI — configured with appropriate profile
jq — JSON query and transformation
make — CDK synthesis via make synth
bash — shell scripting
diff / comm — comparison utilities

Common Scenarios

Clean deployment: identical resource counts, only expected environmental differences → approve
Drift detected: deployed threshold differs from local → revert console change or update CDK
New CDK Nag suppression: requires documented justification and InfoSec approval
Resource removal: block deployment, data-loss risk — review with stakeholders first

References

Automated script: scripts/compare-cfn-templates.sh
Real-world examples: references/compare-cfn-templates.md
CI/CD integration: .gitlab-ci.yml validate-template stage

Install with Tessl CLI

npx tessl i pantheon-ai/cfn-template-compare@0.2.0

evals

references

scripts

SKILL.md

tile.json