A Pulumi package for creating and managing Amazon Web Services (AWS) cloud resources with infrastructure-as-code.
—
Quality
Pending
Does it follow best practices?
Impact
Pending
No eval scenarios have been run
AWS Glue is a serverless data integration service for ETL workloads.
Centralized metadata repository.
See Glue Documentation for detailed documentation.
const database = new aws.glue.CatalogDatabase("catalog", {
name: "data_catalog",
});
const table = new aws.glue.CatalogTable("table", {
name: "events",
databaseName: database.name,
storageDescriptor: {
location: "s3://bucket/data/",
columns: [
{ name: "id", type: "string" },
{ name: "timestamp", type: "timestamp" },
],
},
});Automatically discover and catalog data.
const crawler = new aws.glue.Crawler("crawler", {
name: "data-crawler",
role: role.arn,
databaseName: database.name,
s3Targets: [{ path: "s3://bucket/data/" }],
});Run ETL transformations.
const job = new aws.glue.Job("etl", {
name: "transform-data",
roleArn: role.arn,
command: {
scriptLocation: "s3://bucket/scripts/transform.py",
pythonVersion: "3",
},
glueVersion: "4.0",
});Install with Tessl CLI
npx tessl i tessl/npm-pulumi--aws