Skip to main content

A command line tool to help deploy AWS Glue Jobs at ease :)

Project description

glueetl

A command line tool to help deploy AWS Glue Jobs at ease :)

Install

$ pip install glueetl

How to develop a Glue job

You can develop a Glue job by following the steps below.

1. Set up AWS Credentials and Region

Before you can deploy a Glue job to AWS Glue, you must set up AWS Credentials and Region.

$ vim ~/.aws/credentials

[default]
aws_access_key_id=<AWS_ACCESS_KEY_ID>
aws_secret_access_key=<AWS_SECRET_ACCESS_KEY>
region=<REGION>

2. Initialize a Glue job

$ mkdir sample
$ cd sample
$ glueetl init
.
├── README.md
├── config.yaml
└── script.py

config.yaml includes job properties and currently it supports the following properties:

job: 
  name: sample-glue-job
  role_name: AWSGlueServiceRole
  script_location: s3://glue-job-scripts/sample-glue-job/script.py
  max_concurrent_runs: 10
  command_name: glueetl
  max_retries: 0
  timeout: 28800
  max_capacity: 10
  connections:
    - first_connection
    - second_connection
  default_arguments:
    argument1: value1
    argument2: value2
  non_overridable_arguments:
    argument1: value1
    argument2: value2
  trigger:
    name: trigger-sample-glue-job
    schedule: cron(5 * * * ? *)
  tags:
    key1: value1
    key2: value2

Please change default values in file config.yaml and write your job logic in file script.py.

3. Deploy a Glue job

$ cd sample
$ glueetl deploy

Your job will be deployed to AWS Glue.

4. Run a Glue job

You can manually run your Glue job like this.

$ cd sample
$ glueetl run --arg1=value1 --arg2=value2

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

glueetl-0.0.5.tar.gz (4.1 kB view details)

Uploaded Source

Built Distribution

glueetl-0.0.5-py3-none-any.whl (5.5 kB view details)

Uploaded Python 3

File details

Details for the file glueetl-0.0.5.tar.gz.

File metadata

  • Download URL: glueetl-0.0.5.tar.gz
  • Upload date:
  • Size: 4.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.5

File hashes

Hashes for glueetl-0.0.5.tar.gz
Algorithm Hash digest
SHA256 bb0e1111acc2bd43537646a62d26d7dda44d36cfa3fb207c4efc6aab242e1ff0
MD5 c0321c792cba282689e2256f49d76270
BLAKE2b-256 dc6eaf53c03e9de0c5f4619905b0213359e3e964ce4837cd065107256692fe3d

See more details on using hashes here.

File details

Details for the file glueetl-0.0.5-py3-none-any.whl.

File metadata

  • Download URL: glueetl-0.0.5-py3-none-any.whl
  • Upload date:
  • Size: 5.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.5

File hashes

Hashes for glueetl-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 9ac412396defa207e657d8a9dc00fff8a22dde91e5e1f38385f2378c26981817
MD5 5abdf9351a910e56648a67af8b9c2b38
BLAKE2b-256 2af137781241c350bebbecb13c8fe000d90c665837a3a80c61bd01757133ae4c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page