Skip to main content

A table data processor

Project description

TabPro - Table Data Processor

TabPro is a Python-based tool for efficient processing of tabular data.

Features

Data Format Support

  • CSV
  • TSV
  • Excel
  • JSON
  • JSON Lines
  • Bidirectional conversion between all supported formats

Table Operations

  1. Table Conversion

    • Convert between different formats
    • Customize output format settings
    • Filter and transform data
  2. Table Merging

    • Merge tables based on common columns
    • Handle multiple table merging
    • Support for staging and version control
  3. Table Aggregation

    • Data aggregation based on grouping
    • Statistical calculations
    • Duplicate detection
  4. Table Sorting

    • Sort by multiple columns
    • Custom sort order
  5. Table Comparison

    • Detect differences between tables
    • Data consistency checking
    • Detailed comparison reports

Installation

Prerequisites

  • Python 3.10 or higher
  • pip (Python package installer)

Installation

pip install tabpro

CLI Usage

Basic Command

tabpro [command] [options]

Available Commands

Table Conversion (convert)

tabpro convert [options] <input_file> <output_file>
# or
convert-tables [options] <input_file> <output_file>

Options:

  • --output-file, --output, -O: Path to the output file
  • --output-file-filtered-out, --output-filtered-out, -f: Path to the output file for filtered out rows
  • --config, -c: Path to the configuration file
  • --pick-columns, --pick: Pick specific columns
  • --do-actions, --actions, --do: Actions to perform on the data
  • --ignore-file-rows, --ignore-rows, --ignore: Ignore specific rows
  • --no-header: Treat CSV/TSV data as having no header row

Table Merging (merge)

tabpro merge [options] <input_file1> <input_file2> [<input_file3> ...]
# or
merge-tables [options] <input_file1> <input_file2> [<input_file3> ...]

Options:

  • --previous-files, --previous, --old, -P: Previous files to merge
  • --modification-files, --modification, --new, -M: Modification files to merge
  • --keys, -K: Primary keys for merging
  • --allow-duplicate-conventional-keys: Allow duplicate keys in previous files
  • --allow-duplicate-modification-keys: Allow duplicate keys in modification files
  • --output-base-data-file: Path to output base data file
  • --output-modified-data-file: Path to output modified data file
  • --output-remaining-data-file: Path to output remaining data file
  • --merge-fields: Fields to merge
  • --merge-staging: Merge staging fields from modification files
  • --use-staging: Use staging fields files

Table Aggregation (aggregate)

tabpro aggregate [options] <input_file>
# or
aggregate-tables [options] <input_file>

Options:

  • --output-file, --output, -O: Path to output file
  • --keys-to-show-duplicates: Keys to show duplicates
  • --keys-to-show-all-count: Keys to show all count
  • --keys-to-expand: Keys to expand
  • --show-count-threshold, --count-threshold, -C: Show count threshold (default: 50)
  • --show-count-max-length, --count-max-length, -L: Show count max length (default: 100)

Table Sorting (sort)

tabpro sort [options] <input_file>
# or
sort-tables [options] <input_file>
# or
tabpro-diff [options] <input_file>

Options:

  • --sort-keys, --sort-key, -K: Keys to sort by
  • --output-file, --output, -O: Path to output file
  • --reverse, -R: Reverse the sort order

Table Comparison (compare)

tabpro compare [options] <input_file1> <input_file2>
# or
compare-tables [options] <input_file1> <input_file2>

Options:

  • --output-path, --output-file, --output, -O: Path to the output table
  • --query-keys, --query, -Q: Primary keys for query
  • --compare-keys, --compare, -C: Keys for comparison

Common Options

  • --verbose, -v: Enable verbose logging
  • --version, -V: Show version information

Features

  • Simple and user-friendly command-line interface
  • Flexible data processing options
  • Handles large datasets efficiently
  • Extensible design

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tabpro-0.5.10.tar.gz (29.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tabpro-0.5.10-py3-none-any.whl (45.0 kB view details)

Uploaded Python 3

File details

Details for the file tabpro-0.5.10.tar.gz.

File metadata

  • Download URL: tabpro-0.5.10.tar.gz
  • Upload date:
  • Size: 29.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.8.0-45-generic

File hashes

Hashes for tabpro-0.5.10.tar.gz
Algorithm Hash digest
SHA256 e4a1cac5d32d4a1ed9a7101e1db602aebcf76871c32024a1ad66c1c0fe953124
MD5 e2badc3c6b4f6a382b81cf3452383b0e
BLAKE2b-256 50f6afe5f6c5e856a7bbada1a608365752e2ce9fcca3b1a411ba5b63b6a385b1

See more details on using hashes here.

File details

Details for the file tabpro-0.5.10-py3-none-any.whl.

File metadata

  • Download URL: tabpro-0.5.10-py3-none-any.whl
  • Upload date:
  • Size: 45.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.8.0-45-generic

File hashes

Hashes for tabpro-0.5.10-py3-none-any.whl
Algorithm Hash digest
SHA256 62bffeecfc0847c10d78e7b3d12afb1a224f48e36abbfc9032486a5ebe965f07
MD5 a200cb40379875faf8c2a5802229683c
BLAKE2b-256 4917ce308fadfc8dcbd5ddcee8c65c3ac6da4c6331198e6bca0b114b666ce8d4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page