CtrlK
BlogDocsLog inGet started
Tessl Logo

tessl/npm-pulumi--aws

A Pulumi package for creating and managing Amazon Web Services (AWS) cloud resources with infrastructure-as-code.

Pending

Quality

Pending

Does it follow best practices?

Impact

Pending

No eval scenarios have been run

Overview
Eval results
Files

glue.mddocs/services/

AWS Glue (Services Overview)

AWS Glue is a serverless data integration service for ETL workloads.

Glue Services

Glue Data Catalog

Centralized metadata repository.

See Glue Documentation for detailed documentation.

const database = new aws.glue.CatalogDatabase("catalog", {
    name: "data_catalog",
});

const table = new aws.glue.CatalogTable("table", {
    name: "events",
    databaseName: database.name,
    storageDescriptor: {
        location: "s3://bucket/data/",
        columns: [
            { name: "id", type: "string" },
            { name: "timestamp", type: "timestamp" },
        ],
    },
});

Glue Crawler

Automatically discover and catalog data.

const crawler = new aws.glue.Crawler("crawler", {
    name: "data-crawler",
    role: role.arn,
    databaseName: database.name,
    s3Targets: [{ path: "s3://bucket/data/" }],
});

Glue ETL Jobs

Run ETL transformations.

const job = new aws.glue.Job("etl", {
    name: "transform-data",
    roleArn: role.arn,
    command: {
        scriptLocation: "s3://bucket/scripts/transform.py",
        pythonVersion: "3",
    },
    glueVersion: "4.0",
});

Use Cases

  • Data Lake ETL: Transform and catalog data
  • Schema Discovery: Automatic schema inference
  • Data Preparation: Clean and structure data

Related Services

Install with Tessl CLI

npx tessl i tessl/npm-pulumi--aws

docs

index.md

quickstart.md

README.md

tile.json