Skip to main content

a build tool for data

Project description

make for your data.

An automation tool for data manipulation.

Inspired by Open Refine.

The general principles in Databuild are:

  • Low entry barrier
  • Easy to install
  • Easy to grasp
  • Extensible

Databuild can be useful for scenarios such as:

  • Documenting data transformations in your infoviz project
  • Automate data processing in a declarative way


Install databuild:

$ pip install databuild


For more details, see the Extended Documentation.

$ buildfile.json

buildfile.yaml contains a list of operations to be performed on data. Think of it as a script for a spreadsheet.

An example of build file could be:

- operation: sheets.import_data
  description: Importing data from csv file
    sheet: dataset1
    format: csv
    filename: dataset1.csv
    skip_last_lines: 1
- operation: columns.add_column
  description: Calculate the gender ratio
    sheet: dataset1
    name: Gender Ratio
      language: python
      content: "return float(row['Totale Maschi']) / float(row['Totale Femmine'])"
- operation: sheets.export_data
  description: save the data
    sheet: dataset1
    format: csv
    filename: dataset2.csv

JSON buildfiles are also supported. databuild will guess the type based on the extension.


Licensed under BSD 3-clauses.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for databuild, version 0.0.10
Filename, size File type Python version Upload date Hashes
Filename, size databuild-0.0.10.tar.gz (17.3 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page