CUREd+ metadata tool: generates a list of all the columns in every table in the database.
Project description
CUREd+ metadata generator
The CUREd+ metadata generator tool generates a list of all the columns in every table in the database.
Installation
Ensure Python is installed. (See this tutorial.)
Install AWS command-line interface (CLI).
Configure your access key using the
aws configure
command.
Install this package using the Python package manager:
pip install curedcolumns
Usage
The basic usage of this app is to specify the AWS CLI profile and the bucket name you want to inspect.
curedcolumns --profile $AWS_PROFILE $AWS_BUCKET --output $OUTPUT_FILE
You should create an AWS profile using the aws configure
command.
aws configure --profile $AWS_PROFILE
To view the command line options:
$ curedcolumns --help
usage: curedcolumns [-h] [-v] [--version] [-l LOGLEVEL] [--prefix PREFIX] [--no-sign-request] [--profile PROFILE] [-d DELIMITER] [-o OUTPUT] [-f] bucket
List all the field names for all the data sets in a bucket on AWS S3 object storage and display the metadata in CSV format. This assumes a folder structure in this layout: <data_set_id>/<table_id>/data/*.parquet
positional arguments:
bucket S3 bucket location URI
options:
-h, --help show this help message and exit
-v, --verbose
--version Show the version number of this tool
-l LOGLEVEL, --loglevel LOGLEVEL
--prefix PREFIX Limits the response to keys that begin with the specified prefix.
--no-sign-request
--profile PROFILE AWS profile to use
-d DELIMITER, --delimiter DELIMITER
Column separator character
-o OUTPUT, --output OUTPUT
Output file path. Default: screen
-f, --force Overwrite output file if it already exists
Example
Use the AWS CLI profile named "clean"
curedcolumns --profile clean s3://my_bucket.aws.com
Development
See CONTRIBUTING.md.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file curedcolumns-0.1.2.tar.gz
.
File metadata
- Download URL: curedcolumns-0.1.2.tar.gz
- Upload date:
- Size: 8.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.0 CPython/3.12.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7fff6a8aaf16ce9856b7ea4a4803b70389b4ee2dffca57b53930d9c0f451e297 |
|
MD5 | e4d1a7d3a7cb5132106cccd560f0dc56 |
|
BLAKE2b-256 | 4b6d5bfcb1b9f155d0dda4d94d8e22631fa7f778cad6d197b649524314a55940 |
File details
Details for the file curedcolumns-0.1.2-py3-none-any.whl
.
File metadata
- Download URL: curedcolumns-0.1.2-py3-none-any.whl
- Upload date:
- Size: 8.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.0 CPython/3.12.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 28a69c7b391449908359e1cadbe2306a19d7ee61786efd48f7aa2e97dee0ac07 |
|
MD5 | 3a61313493c8457ad4c67e16927832c0 |
|
BLAKE2b-256 | 9257174b454c8a1ff4db40969f83d5cafbe9083dd7f60beca659931efb779a3b |