Skip to main content

A selection of tools for easier processing of data using Pandas and AWS

Project description

New Tools

Provides useful libraries for processing large data sets. Developed by the team at www.dativa.com as we find them useful in our projects.

The key libraries included here are:

  • S3Location

S3Location

Class that parses out an S3 location from a passed string. Subclass of str so supports most string operations.

Also contains properties .bucket, .key, .path, .prefix and method .join()

  • param s3_str: string representation of s3 location, accepts most common formats
    eg:
        - 's3://bucket/folder/file.txt'
        - 'bucket/folder'
        - 'http[s]://s3*.amazonaws.com/bucket-name/'
    also accepts None if using `bucket` and `key` keyword
    
  • param bucket: ignored if s3_str is not None. can specify only bucket for bucket='mybucket' - 's3://mybucket/' or in conjuction with key
  • param key: ignored if s3_str is not None. Bucket must be set. bucket='mybucket', key='path/to/file' - 's3://mybucket/path/to/file'
  • param ignore_double_slash: default False. If true allows s3 locations containing '//' these are valid s3 paths, but typically result from mistaken joins

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

newtools-1.0.9.9.tar.gz (4.2 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page