Scan a project to identify Kafka applications, extract schemas from data models, tag PII fields, generate Terraform for Confluent Schema Registry registration, and produce a migration report with rollout ordering. Use this skill when a user asks to analyze a folder or repo for Kafka usage, extract schemas, audit producer/consumer configurations, or generate Terraform for Schema Registry.
68
81%
Does it follow best practices?
Impact
—
No eval scenarios have been run
Passed
No known issues
Scan a project to identify Kafka applications, extract schemas, generate Terraform for Schema Registry registration, and produce a comprehensive analysis report.
Invoke this skill when:
This skill produces 3 outputs in the target project:
schema-report.md — Full analysis report with findings, risks, and upgrade recommendationsschemas/ — Extracted schema files (Avro, JSON Schema, Protobuf) with PII taggingterraform/ — Terraform configs using Confluent provider to register schemasIf the user asks for their application code to be updated to integrate Schema Registry, use the Code Migration Reference to update the code with proper Schema Registry integration patterns.
schema.yaml and schemas/ directory manuallypom.xml, build.gradle, requirements.txt, package.json, etc.spring-kafka, confluent-kafka, kafkajs, etc.KafkaTemplate, Producer(, producer.send, etc.value.serializer, KafkaAvroSerializer, custom serializersDetailed patterns: Detection Patterns Reference
App catalog structure:
app_name: module name
language: Java | Python | .NET | Go | Node/TS
role: producer | consumer | both
topics: [list of topics]
serializer_class: value.serializer used
custom_serializer: true | false
schema_format: AVRO | JSON | PROTOBUF | UNKNOWN
sr_integrated: true | false
category: A | B | C | D | E # REQUIREDMulti-schema topic detection:
oneOf/union/oneofschema_reference blocksSearch for:
auto.register.schemas=true — Uncontrolled schema evolution (Category C)use.latest.version — Eases migration when setRecord file path, line number, and affected topics for each occurrence.
Patterns: Detection Patterns Reference
For each producer:
**/*.avsc, **/*.proto, **/*.schema.jsonemail, ssn, phone, address, etc.PII tagging: Add confluent:tags (PII, PRIVATE, SENSITIVE, PHI) to detected fields.
Detailed inference patterns: Schema Inference Reference
Classify each producer:
| Category | Criteria |
|---|---|
| A: Compliant | Confluent serializer + SR + no auto.register |
| A→Header | Already on SR, migrating to headers |
| B: Schema in code, no SR | Data models exist, but no SR integration |
| C: Auto-register | auto.register.schemas=true |
| D: No schema | Raw strings/bytes, no data model |
| E: Custom serializer | Custom Serializer<T> or inline serialization without SR |
CRITICAL: Use exact phrase "Category X" in:
Details: Categorization Reference
Directory structure:
schemas/
├── avro/
│ └── {topic}-value.avsc
├── json/
│ └── {topic}-value.json
└── proto/
└── {topic}-value.protoFile naming: MUST use kebab-case (lowercase with hyphens):
{topic}-value.{ext}{topic}-key.{ext}order-events-value.avsc, user-notifications-value.jsonInitialize: Create schema.yaml.
Validate: Call schema_lint(path: schemas/, fix: true) if available.
File structure (MANDATORY separate files):
terraform/
├── providers.tf # Provider config
├── variables.tf # Variable definitions
├── tags.tf # confluent_tag resources (if PII exists)
├── schemas.tf # Active schemas (A, B, E)
├── flagged-auto-register.tf # Category C only (commented out)
├── outputs.tf # Output values
└── import.sh # Import scriptCRITICAL:
schemas.tf = Categories A, B, E — NOT commented outflagged-auto-register.tf = Category C ONLY — MUST be commented outtags.tf = MUST exist if ANY schema uses confluent:tagsTemplates: Terraform Templates Reference
Create schema-report.md with:
CRITICAL formatting requirements:
Template: Report Template Reference
Details: Categorization Reference
{project_root}/
├── schema-report.md # Analysis report
├── schemas/
│ ├── schema.yaml # Schema project config
│ ├── avro/
│ │ └── {topic}-value.avsc
│ ├── json/
│ │ └── {topic}-value.json
│ └── proto/
│ └── {topic}-value.proto
└── terraform/
├── providers.tf
├── variables.tf
├── tags.tf # PII/PRIVATE/SENSITIVE tags
├── schemas.tf # Active schemas (depends_on tags)
├── flagged-auto-register.tf # Commented-out Category C
├── outputs.tf
└── import.sh # Import existing schemasNo need to use Agent tool — this skill is self-contained and uses direct tool calls.
8b85616
If you maintain this skill, you can claim it as your own. Once claimed, you can manage eval scenarios, bundle related skills, attach documentation or rules, and ensure cross-agent compatibility.