a build tool for data
Project description
make for your data.
An automation tool for data manipulation.
Inspired by Open Refine.
Installation
Install databuild:
$ pip install https://github.com/databuild/databuild/archive/master.tar.gz
Quickstart
For more details, see the Extended Documentation.
$ data-build.py buildfile.json
buildfile.json contains a list of operations to be performed on data. Think of it as a script for a spreadsheet.
An example of build file could be:
[
{
"function": "sheets.import_data",
"description": "Importing data from csv file",
"params": {
"sheet": "dataset1",
"format": "csv",
"filename": "dataset1.csv",
"skip_last_lines": 1
}
},
{
"function": "columns.add_column",
"description": "Calculate the gender ratio",
"params": {
"sheet": "dataset1",
"name": "Gender Ratio",
"expression": {
"language": "python",
"content": "return float(row['Male Total']) / float(row['Female Total'])"
}
}
},
{
"function": "sheets.export_data",
"description": "",
"params": {
"sheet": "dataset1",
"format": "csv",
"filename": "dataset2.csv"
}
}
]
YAML buildfiles are also supported. databuild will guess the type based on the extension.
License
Licensed under BSD 3-clauses.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
databuild-0.0.1.tar.gz
(13.8 kB
view details)
File details
Details for the file databuild-0.0.1.tar.gz.
File metadata
- Download URL: databuild-0.0.1.tar.gz
- Upload date:
- Size: 13.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2a2ee36a6b205a0c85cacb31c2b3b1456fdf5aed5637e5dfc2530ca4a320880f
|
|
| MD5 |
4397628109a34edd2775c323e869ff00
|
|
| BLAKE2b-256 |
148487471b94d80412ac17352c44cf66debc8aea0481da5b7cd4d3054d78696e
|