Skip to main content

a build tool for data

Project description

make for your data.

An automation tool for data manipulation.

Inspired by Open Refine.

Installation

Install databuild:

$ pip install https://github.com/databuild/databuild/archive/master.tar.gz

Quickstart

For more details, see the Extended Documentation.

$ data-build.py buildfile.json

buildfile.json contains a list of operations to be performed on data. Think of it as a script for a spreadsheet.

An example of build file could be:

[
  {
    "function": "sheets.import_data",
    "description": "Importing data from csv file",
    "params": {
      "sheet": "dataset1",
      "format": "csv",
      "filename": "dataset1.csv",
      "skip_last_lines": 1
    }
  },
  {
    "function": "columns.add_column",
    "description": "Calculate the gender ratio",
    "params": {
      "sheet": "dataset1",
      "name": "Gender Ratio",
      "expression": {
        "language": "python",
        "content": "return float(row['Male Total']) / float(row['Female Total'])"
      }
    }
  },
  {
    "function": "sheets.export_data",
    "description": "",
    "params": {
      "sheet": "dataset1",
      "format": "csv",
      "filename": "dataset2.csv"
    }
  }
]

YAML buildfiles are also supported. databuild will guess the type based on the extension.

License

Licensed under BSD 3-clauses.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

databuild-0.0.1.tar.gz (13.8 kB view details)

Uploaded Source

File details

Details for the file databuild-0.0.1.tar.gz.

File metadata

  • Download URL: databuild-0.0.1.tar.gz
  • Upload date:
  • Size: 13.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for databuild-0.0.1.tar.gz
Algorithm Hash digest
SHA256 2a2ee36a6b205a0c85cacb31c2b3b1456fdf5aed5637e5dfc2530ca4a320880f
MD5 4397628109a34edd2775c323e869ff00
BLAKE2b-256 148487471b94d80412ac17352c44cf66debc8aea0481da5b7cd4d3054d78696e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page