Skip to main content

A utility library which repairs and analyzes tablular data

Project description

A Python repository which repairs and analyzes tablular data


This module provides the capability to extract and repair blocks of data from 2D tables. These blocks can then be individually processed, stitched together, or filtered as needed by a particular program.

Autoconversions of cells along with a multi-tier flagging system for each magnitude of change allows for a wide variety of error handling. Additionally missing titles can be repaired from surrounding cells in order to generate compelete blocks from implied headings.


  • allset
  • pydatawrap



From source:

python install

From pip:

pip install carpenter


  • Block detection
  • Title repairing
  • Tunable cell conversions
  • Column re-orienting

Language Preferences

  • Google Style Guide
  • Object Oriented (with a few exceptions)


  • Add refactor top-level functionality
  • Add new usable functions
  • Separate flagging some from block iteration code


Author(s): Matthew Seal

© Copyright 2013, OpenGov

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date (44.4 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page