Skip to main content

ML & data helper code!

Project description

ML & data helper code!

Jakub Langr (c) 2021

This is a CLI utility for speeding up basic Data Engineering/Data Science tasks

usage: dt [-h] [-d] [-q] [-v]
          {cai,config,ec2,gcp,gh,hist,load,monitor,nf,py,run,s3,sftp,sg,ssh,td,viz}
          ...

ML & data helper code!

optional arguments:
  -h, --help            show this help message and exit
  -d, --debug           full application debug mode
  -q, --quiet           suppress all console output
  -v, --version         show program's version number and exit

sub-commands:
  {cai,config,ec2,gcp,gh,hist,load,monitor,nf,py,run,s3,sftp,sg,ssh,td,viz}
    cai                 Ops with Codex AI from OpenAI.
    config              Operations with config.
    ec2                 AWS EC2 helper commands.
    gcp                 GCP VM Instances helper commands.These are currently
                        in alpha and some may not work
    gh                  All Github related operations
    hist                Operations on CLI hist
    load                Appends defaults: from zshrc.txt to ~/.bashrc or
                        ~/.zshrc or tmux conf
    monitor             Monitor lack of GPU activity. When there is none, runs
                        -job
    nf                  Displays the number of files
    py                  (WIP/Pre-Alpha) Execute a Python command across all
                        files in current dir.
    run                 Operations with snippets
    s3                  Operations with s3 bucket creationlabs-raw-data
    sftp                Operations to easily work with remote servers /
                        devices
    sg                  AWS Security Groups helper functions
    ssh                 Operations on the SSH config
    td                  Manage TODOs using Google Keep
    viz                 Basic viz using streamlit for image comparisons

Usage: dt command [args] [kwargs]

The screen above can be called using:

dt -h

For example, to run to list the running ec2 isntances run

dt ec2 ls --profile <profile>

Installation

$ pip install data-toolkit

Tips and examples

Personally I find most helpful the following commands:

dt ec2 ls

Shows running EC2 instances with a range of parameters depending on the width of your terminal.

dt sg ls

Shows an overview of all security groups in the current region.

dt sg show

Shows detail of either all or one security group in the current region.

dt ssh update

Updates the SSH config with with the IP addresses of the running EC2 instances (matches them by name/tag).

When I'm travelling I like to use the following commands:

dt sg update -n <sg_name>  -m "22:jakub 4g"

Which updates the <sg_name> security group to allow SSH from the current IP address and port 22 and sets the description to "jakub 4g".

Hope this helps!

Development

This project includes a number of helpers in the Makefile to streamline common development tasks.

Environment Setup

The following demonstrates setting up and working with a development environment:

### create a virtualenv for development

$ make virtualenv

$ source env/bin/activate


### run dt cli application

$ dt --help


### run pytest / coverage

$ make test

Project details


Release history Release notifications | RSS feed

This version

1.0.6

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data-toolkit-1.0.6.tar.gz (40.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

data_toolkit-1.0.6-py3-none-any.whl (50.5 kB view details)

Uploaded Python 3

File details

Details for the file data-toolkit-1.0.6.tar.gz.

File metadata

  • Download URL: data-toolkit-1.0.6.tar.gz
  • Upload date:
  • Size: 40.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for data-toolkit-1.0.6.tar.gz
Algorithm Hash digest
SHA256 f62b88a3023090336bf6d6d37c0ee7dd9f41a708389cbdd4a0402d73642d6e73
MD5 e5086089117ccd6c30e70b32ac0e82be
BLAKE2b-256 3179977c6b18c8e5fff6b8f65649a7a94ce7116201fd020f4b0a98b3e206a981

See more details on using hashes here.

File details

Details for the file data_toolkit-1.0.6-py3-none-any.whl.

File metadata

  • Download URL: data_toolkit-1.0.6-py3-none-any.whl
  • Upload date:
  • Size: 50.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for data_toolkit-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 8243b24c2e365ebf3f442acf83b65d234cd6aec544fa53f5dc49c56d71d9f58f
MD5 f5915b19164c0f0bc5d2de20a4167923
BLAKE2b-256 2a554763255d0d541d276f1fd49834507d1517f10e15373d8617fb0d4199482e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page