Easily load data from CSV to test out your DynamoDB table design

Project description

dynamodb-dev-importer (ddbimp)

Easily load data from CSV to test out your DynamoDB table design.

When working with DynamoDB, it is common practice to minimise the number of tables used, ideally down to just one.

Techniques such as sparse indexes and GSI overloading allow a lot of flexibility and efficiency.

Designing a good schema that supports your query patterns can be challenging. Often it is nice to try things out with a small amount of data. I personally find it convenient to enter data into a spreadsheet and play around with it there.

This utility eases the process of populating a DynamoDB table from a CSV file, exported from a spreadsheet, that follows a specific format common to DynamoDB modelling patterns.

Install

You can install it with

$ pip3 install ddbimp

Run

Assuming table people(pk:S, sk:S) is provisioned in your default region.

$ ddbimp --table people --skip 1 example.csv

Expected input format

pk	sk	data
PERSON-1	sales-Q1-2019	Alex	jan: 12012	feb: 1927

Your spreadsheet (and exported CSV) should contain columns for:

pk
sk
data (optional)
anything after those three can contain arbitrary attributes of form attribute_name: value i.e. city: Edinburgh

Example row:

PERSON-1,sales-Q1-2019,Alex,jan: 12012,feb: 1927

Will yield an item like this:

{
    pk: 'PERSON-1',
    sk: 'sales-Q1-2019',
    data: 'Alex',
    jan: 12012,
    feb 1927
}

For a full example CSV, take a look at example.csv.

Key ideas

Table consists of partition key pk: S and sort key sk: S - their meaning varies depending on the item
A secondary index swaps the sort and partition keys, so the partition key is sk: S and sort key pk: S
A final secondary index uses sk: S and data: S where data is an arbitrary value you might want to search for, the meaning of data depends on the item it is part of
Group items through a shared partition key, store sub items with a sort key e.g.
- e.g. pk:PERSON-1, sk:sales-Q1-2019, jan:12012, feb:1927

AWS recently released a preview build of a tool called NoSQL Workbench. It builds on the above ideas. I've tried it out and it seems pretty good, but I am a luddite and am faster working in a spreadsheet right now. I'd certainly recommend giving it a try.

Useful resources

Caveats, TODO

Uses your default AWS profile
Region needs to be set
Make work directly with a Google Sheets via sheets API

Project details

Release history Release notifications | RSS feed

This version

0.4

Jan 23, 2020

0.3

Jan 23, 2020

0.2

Jan 23, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ddbimp-0.4.tar.gz (4.2 kB view details)

Uploaded Jan 23, 2020 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ddbimp-0.4-py3-none-any.whl (8.7 kB view details)

Uploaded Jan 23, 2020 Python 3

File details

Details for the file ddbimp-0.4.tar.gz.

File metadata

Download URL: ddbimp-0.4.tar.gz
Upload date: Jan 23, 2020
Size: 4.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.1.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.8.0

File hashes

Hashes for ddbimp-0.4.tar.gz
Algorithm	Hash digest
SHA256	`af97cc1d00550cb871c54adf7abe393a5362636aaa278df390bfc9930e9cc774`
MD5	`ff249871300d737d87f5f8ad02f5a4df`
BLAKE2b-256	`1ad85e7577ac5bb8e8d5a8c4fac894f68b25a3fedbdad49a46ee52f05b3f1ba6`

See more details on using hashes here.

File details

Details for the file ddbimp-0.4-py3-none-any.whl.

File metadata

Download URL: ddbimp-0.4-py3-none-any.whl
Upload date: Jan 23, 2020
Size: 8.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.1.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.8.0

File hashes

Hashes for ddbimp-0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`85ae955d83c9172f14d3b2064785ba30d3cf14e17a044f70d5c4bccdebfa119e`
MD5	`c3d4832082a2cde0b7057f78eee69058`
BLAKE2b-256	`ef1290ec1bbdd53e66a3e6b68a226e83f0d839f392e631d893899f74f194ba97`

See more details on using hashes here.

ddbimp 0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

dynamodb-dev-importer (ddbimp)

Install

Run

Expected input format

Key ideas

Useful resources

Caveats, TODO

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes