Skip to main content

A tabular data format that doesn't make you want to pull your hair out

Project description

The Tablo File Format

tablo is a plain text interchange format for tabular data. It is more expressive than CSV while remaining easy for people to read and write.

It adds explicit headers, datatypes, and cell formatting to address shortcomings of delimiter-separated formats like CSV or TSV. tablo solves delimiter collision issues by having well-defined quoting and escaping rules that are familiar to users of formats like JSON or common programming languages like Python or JavaScript.

What's Wrong with CSV?

the Microsoft version of CSV is a textbook example of how not to design a textual file format

—Eric S. Raymond, The Art of Unix Programming

Stated simply, there is no single CSV standard. It exists as a myriad of informal variants whose implementation varies from vendor to vendor. Character encodings and escape sequences vary from one application to the next, and the ambiguities in various edge cases means that the output of one application may not be readable by another.

tablo is designed to solve a number of ambiguities and shortcomings in CSV.

One of the first obvious differences is that header rows are optional, but well-defined. In other words, a document may or may not contain a header, but determining whether the document includes a header is always unambiguous.

A crucial aspect of the tablo format is that it doesn't make assumptions about the type of data in each cell. If a value is surrounded by quotes, it is always a string. If a value is a number without quotes, it is always a number. If a value is an ISO-8601 formatted date preceded by a hash mark, it is always a datetime.

Installation

Install with pip:

python -m pip install tablo-fyi

Usage

Parsing is accomplished with the parse function.

import tablo

data = tablo.parse('"name", "age"\n=0.1\n"Tom", 24\n"Jerry", 27\n')

name = data['A0']  # Retrieves the value in column A, row 0 => 'Tom'
age = data['B1']   # Retrieves the value in column B, row 1 => 27

More Information

More information can be found in the tablo specification, and a set of example files can be found in the project repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tablo-fyi-0.4.5.tar.gz (8.7 kB view details)

Uploaded Source

Built Distribution

tablo_fyi-0.4.5-py3-none-any.whl (8.3 kB view details)

Uploaded Python 3

File details

Details for the file tablo-fyi-0.4.5.tar.gz.

File metadata

  • Download URL: tablo-fyi-0.4.5.tar.gz
  • Upload date:
  • Size: 8.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.0b1+

File hashes

Hashes for tablo-fyi-0.4.5.tar.gz
Algorithm Hash digest
SHA256 f926d7e4f859d1c9d275b25ed2188634df7573f6732ae8c82ca02ce572db6887
MD5 fa41a10e053a73bb4261af96892ca845
BLAKE2b-256 3f7e69189ceadfa62b8c9ea1df8bf2180ba7939c663533f6b16f4cf8bc72b193

See more details on using hashes here.

File details

Details for the file tablo_fyi-0.4.5-py3-none-any.whl.

File metadata

  • Download URL: tablo_fyi-0.4.5-py3-none-any.whl
  • Upload date:
  • Size: 8.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.0b1+

File hashes

Hashes for tablo_fyi-0.4.5-py3-none-any.whl
Algorithm Hash digest
SHA256 4a884bc804f8973c38a8af9327a9676ea512279f61cfd4126a0cbe43aef6a5c7
MD5 5dea8d4323775d79e4a55c53be9481b7
BLAKE2b-256 1c6e25c1871fad0b0f4dcfd76be07f5112a801760f39a17706e3a7227979cf4c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page