CLI to list and fetch objects from versioned S3 buckets. Plus get tmp url.
Project description
Tools to list and fetch objects from versioned AWS S3 bucket:
s3lsvers: List object versions, see versioning
s3getvers: Fetch specified object versions
s3tmpgen: Generate temporary url links to objects
Installation
Using pip:
$ pip install ttr.aws.utils.s3
Using pipx:
$ pipx install ttr.aws.utils.s3
Quick start
We want to fetch versions of feed in bucket mybucket named my/versioned/feed.xml
Configure AWSCLI credentials to allow access to your buckets and objects. E.g. using AWS_DEFAULT_PROFILE. See AWS_config.
create csv file for given feed and time period:
$ s3lsvers -from 2012-05-24T00:15 -to 2012-05-24T01:15 -list-file list.csv mybucket/my/versioned/feed.xml
You shall then find file list.csv on your disk.
Review records in list.csv and delete all lines with version, which are not of your interest.
Using list.csv, ask s3getvers to fetch all versions specified in the file. Be sure to run it on empty directory:
$ s3getvers mybucket list.csv
You will see, how is each version downloaded and saved to your current directory.
Finally, you can try generating temorary url to your feed (showing the latest existing):
$ s3tmpgen 2014-09-30T00:00:00Z mybucket my/versioned/feed.xml https://mybucket.s3.amazonaws.com/my/versioned/feed.xml?Signature=kOCwz%2FkanVWX8O15dlXhy4jrbwY%3D&Expires=1412031600&AWSAccessKeyId=AKIAxyzxyzxyzEQA
Note, that the url does not include VersionId, so it will always point to the most up todate version (in case the key happens to be on versioned bucket).
Provided commands
s3lsvers
List versions of some feed. Could output into CSV file (-list-file) and/or html chart (-html-file).:
$ s3lsvers -h usage: s3lsvers [-h] [-from None] [-to None] [-list-file None] [-html-file None] [-version-id None] [-profile-name None] [-aws-access-key-id None] [-aws-secret-access-key None] bucket_key List object versions stored on versioned S3 bucket, create CSV and/or HTML file. CSV file can be used e.g. by `s3getvers` command. HTML file allows showing feed size and update period in chart. Version can be limited by time range `from` - `to`. `version-id` allow starting from specific version (back to the past, excluding given version). Object key is defined either as {bucket_name}/{key_name} or as alias from .s3lsvers file. Times are expressed in RFC 3339 format using Zulu (UTC) timezone, possibly truncated. For truncated time strings, maximal time extent is used. Listing has records with structure: `{key_name};{version_id};{size};{last_modified};{age}` - key_name: name of the key (excluding bucket name). - version_id: unique identifier for given version on given bucket. - size: size of key object in bytes - last_modified: RFC 3339 formated object modification time - age: update interval [s] for given version Examples: Lists all versions of given `keyname` on `bucket`:: $ s3lsvers bucketname/keyname Lists all versions in period betwen `from` and `to` time:: $ s3lsvers -from 2010-01-01 -to 2011-07-19T12:00:00 bucket/key Lists all versions and writes them into csv file named `versions.csv`:: $ s3lsvers -list-file versions.csv bucketname/keyname Lists all versions and write them into html chart file `chart.html`:: $ s3lsvers -html-file chart.html bucketname/keyname Using bucket/key_name aliases in .s3lsvers file Aliases are specified in file .s3lsvers, which may be located in currect directory, home directory or /etc/s3lsvers" `.s3lsvers` example:: #.s3lsversrc - definition of some preconfigured bucket/key values [DEFAULT] pl-base: pl-base.dp.tamtamresearch.com cz-base: cz-base.dp.tamtamresearch.com # alias name must not contain "/" [aliases] plcsr: %(pl-base)s/region/pl/ConsumerServiceReady.xml czcsr: %(cz-base)s/region/cz/ConsumerServiceReady.xml The format follows SafeConfigParser rules, see http://docs.python.org/2/library/configparser.html#safeconfigparser-objects To list all versions of czcsr alias:: $ s3lsvers czcsr positional arguments: bucket_key {bucket_name}/{key_name} for the key to list optional arguments: -h, --help show this help message and exit -from None, --from-time None start of version modification time range (default: oldest version) -to None, --to-time None end of version modification time range (default: now) -list-file None Name of output CSV file. -html-file None Name of output HTML file. -version-id None version-id to start after -profile-name None AWSCLI profile name -aws-access-key-id None AWS Access Key ID -aws-secret-access-key None AWS Secret Access Key
s3getvers
$ s3getvers -h usage: s3getvers [-h] [-output-version-id-names] [-no-decompression] [-profile-name None] [-aws-access-key-id None] [-aws-secret-access-key None] bucket_name csv_version_file Fetch S3 object versions as listed in a csv file Typical csv file (as by default produced by s3lsvers) is: m/y.xml;OrUr6XO8KSKEHbd8mQ.MloGcGlsh7Sir;191;2012-05-23T20:45:10.000Z;39 m/y.xml;xhkVOy.dJfjSfUwse8tsieqjDicp0owq;192;2012-05-23T20:44:31.000Z;62 m/y.xml;oKneK.N2wS8pW8.EmLqjldYlgcFwxN3V;193;2012-05-23T20:43:29.000Z;58 for `s3getvers` only the first two columns are significant: :key_name: name of the object (not containing the bucket name itself) :version_id: string, identifying unique version. Typical use (assuming, above csv file is available under name verlist.csv):: $ s3getvers yourbucketname verlist.csv What will create following files in current directory: * f.2012-05-23T20_45_10.xml * f.2012-05-23T20_44_31.xml * f.2012-05-23T20_43_29.xml Files are (by default) saved decompressed (even if gzipped on the bucket) positional arguments: bucket_name bucket name (default: None) csv_version_file name of CSV file with version_id optional arguments: -h, --help show this help message and exit -output-version-id-names Resulting file names shall use version_id to become distinguished (default is to use timestamp of file creation) -no-decompression Keeps the files as they come, do not decompress, if they come compressed -profile-name None Name of AWSCLI profile to use for credentials -aws-access-key-id None Your AWS Access Key ID -aws-secret-access-key None Your AWS Secret Access Key
s3tmpgen
$ s3tmpgen -h usage: s3tmpgen [-h] [-profile-name None] [-aws-access-key-id None] [-aws-secret-access-key None] [-validate-bucket] [-validate-key] [-http] expire_dt bucket_name [key_names [key_names ...]] Generate temporary url for accessing content of AWS S3 key. Temporary url includes expiration time, after which it rejects serving the content. Urls are printed one per line to stdout. For missing key names empty line is printed and error goes to stderr. If the bucket is versioned, tmp url will serve the latest version at the moment of request (version_id is not part of generated url). By default, bucket and key name existnence is not verified. Url is using https, unless `-http` is used. positional arguments: expire_dt ISO formatted time of expiration, full seconds, 'Z' is obligatory, e.g. '2014-02-14T21:47:16Z' bucket_name name of bucket key_names key names to generate tmpurl for optional arguments: -h, --help show this help message and exit -profile-name None Name of AWSCLI profile to use for credentials -aws-access-key-id None Your AWS Access Key ID -aws-secret-access-key None Your AWS Secret Access Key -validate-bucket Make sure, the bucket really exists -validate-key Make sure, the key really exists -http Force the url to use http and not https
Configuring AWS S3 credentials
Configure the credentials as you would do for using AWS CLI.
If you configure profiles, you may use switch -profile when calling the commands.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ttr_aws_utils_s3-0.6.1.tar.gz
.
File metadata
- Download URL: ttr_aws_utils_s3-0.6.1.tar.gz
- Upload date:
- Size: 16.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: pdm/2.7.4 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | eecc606d32dc6a2d69b196f8175ed778db55bbdf15784db38b2e141f3de7c54e |
|
MD5 | df468426ae3e5097a79e1b23b7c48422 |
|
BLAKE2b-256 | 8403ad422886d350f57508b6d861b71e48432d9c2f630a25a1b6d4e6d6ca69c3 |
File details
Details for the file ttr_aws_utils_s3-0.6.1-py3-none-any.whl
.
File metadata
- Download URL: ttr_aws_utils_s3-0.6.1-py3-none-any.whl
- Upload date:
- Size: 16.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: pdm/2.7.4 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2d257b3054c89c8481ddce61a1b67831f62274684e4f4ed974f0a58276e86363 |
|
MD5 | f22fc9fb41ea8f09b4741d3608c0b857 |
|
BLAKE2b-256 | dd4b9af32608164879024d3d3eb4fc5dbbc5ae5e2622e4f77304f5330c14e186 |