Skip to main content

Tabulates structured data into a mergeable CSV format

Project description

OpenTabulate

OpenTabulate is a Python package designed to organize, tabulate, and process structured data. It currently aims to be a data processing framework for the Linkable Open Data Environment, an exploratory project by the Data Exploration and Integration Lab (DEIL) within the Center for Special Business Projects (CSBP) at Statistics Canada. OpenTabulate offers

  • a systematic way of organizing data using sources files (inspired by OpenAddresses),
  • tabulation of data into a common format (CSV) and common schema that is suitable for merging and linkage,
  • basic data cleaning and formatting, data filtering and customizing output schema.

Requirements

This package is meant to be run on a Linux-based distribution. Using the package only requires

Installation

Install the opentabulate package from the Python Package Index using whatever Python package manager you choose. For example, with pip installed you may simply run

$ pip3 install opentabulate

with any Python environment.

After installing the package, copy the OpenTabulate configuration file using the built-in command line flag.

$ opentab --copy-config

This copies the file opentabulate.conf.example from the package installation to $HOME/.config/opentabulate.conf. Provide a directory for the root_directory variable in the configuration, which will specify where the datasets will be loaded from and tabulated to. Configure the remaining variables as needed. Next run

opentab --initialize

which creates the subdirectories in root_directory. Now OpenTabulate is ready to run.

Documentation

For more information, please read our documentation here.

Issues

You can post questions, enhancement requests, and bugs in Issues.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for opentabulate, version 2.1.1
Filename, size File type Python version Upload date Hashes
Filename, size opentabulate-2.1.1-py3-none-any.whl (34.8 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size opentabulate-2.1.1.tar.gz (40.1 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page