a build tool for data
Project description
make for your data.
An automation tool for data manipulation.
Inspired by Open Refine.
The general principles in Databuild are:
- Low entry barrier
- Easy to install
- Easy to grasp
- Extensible
Databuild can be useful for scenarios such as:
- Documenting data transformations in your infoviz project
- Automate data processing in a declarative way
Installation
Install databuild:
$ pip install databuild
Quickstart
For more details, see the Extended Documentation.
$ data-build.py buildfile.json
buildfile.yaml contains a list of operations to be performed on data. Think of it as a script for a spreadsheet.
An example of build file could be:
- operation: sheets.import_data description: Importing data from csv file params: sheet: dataset1 format: csv filename: dataset1.csv skip_last_lines: 1 - operation: columns.add_column description: Calculate the gender ratio params: sheet: dataset1 name: Gender Ratio expression: language: python content: "return float(row['Totale Maschi']) / float(row['Totale Femmine'])" - operation: sheets.export_data description: save the data params: sheet: dataset1 format: csv filename: dataset2.csv
JSON buildfiles are also supported. databuild will guess the type based on the extension.
License
Licensed under BSD 3-clauses.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Filename, size | File type | Python version | Upload date | Hashes |
---|---|---|---|---|
Filename, size databuild-0.0.10.tar.gz (17.3 kB) | File type Source | Python version None | Upload date | Hashes View |