tabbed

An iterative reader of irregular text files

These details have not been verified by PyPI

Project links

Homepage

Project description

A Python package for reading variably structured text files at scale

python license

Tabbed is a Python library for reading variably structured text files. It automatically deduces data start locations, data types and performs iterative and value-based conditional reading of data rows.

Key Features

Structural Inference:
A common variant of the standard text file is one that contains metadata prior to a header or data section. Tabbed can locate the metadata, header and data locations in a file.
Type inference:
Tabbed can parse int, float, complex, time, date and datetime instances at high-speed via a polling strategy.
Conditional Reading:
Tabbed can filter rows during reading with equality, membership, rich comparison, regular expression matching and custom callables via simple keyword arguments.
Partial and Iterative Reading:
Tabbed supports reading of large text files that consumes only as much memory as you choose.

Usage

Below is a sample file with a Metadata section and Header using the tab character as the delimiter.

annotations.txt

Experiment ID Experiment
Animal ID Animal
Researcher Test
Directory path 

Number Start Time End Time Time From Start Channel Annotation
0 02/09/22 09:17:38.948 02/09/22 09:17:38.948 0.0000 ALL Started Recording
1 02/09/22 09:37:00.000 02/09/22 09:37:00.000 1161.0520 ALL start
2 02/09/22 09:37:00.000 02/09/22 09:37:08.784 1161.0520 ALL exploring
3 02/09/22 09:37:08.784 02/09/22 09:37:13.897 1169.8360 ALL grooming
4 02/09/22 09:37:13.897 02/09/22 09:38:01.262 1174.9490 ALL exploring
5 02/09/22 09:38:01.262 02/09/22 09:38:07.909 1222.3140 ALL grooming
6 02/09/22 09:38:07.909 02/09/22 09:38:20.258 1228.9610 ALL exploring
7 02/09/22 09:38:20.258 02/09/22 09:38:25.435 1241.3100 ALL grooming
8 02/09/22 09:38:25.435 02/09/22 09:40:07.055 1246.4870 ALL exploring
9 02/09/22 09:40:07.055 02/09/22 09:40:22.334 1348.1070 ALL grooming
10 02/09/22 09:40:22.334 02/09/22 09:41:36.664 1363.3860 ALL exploring

Dialect and Type Inference

Tabbed can detect the dialect via clevercsv and infer the data types.

from tabbed.reading import Reader
from tabbed.samples import paths

infile = open(paths.annotations, 'r')
reader = Reader(infile)
dialect = reader.sniffer.dialect
types, _ = reader.sniffer.types(poll=10)
    
print(dialect) # a clevercsv SimpleDialect
print('---')
print(types)

Output

SimpleDialect('\t', '"', None)
---
[<class 'int'>, <class 'datetime.datetime'>, <class 'datetime.datetime'>, <class 'float'>, <class 'str'>, <class 'str'>]

Metadata and Header detection

Tabbed can automatically locate the metadata, header and data rows.

print(reader.header)
print('---')
print(reader.metadata())

Output

Header(line=6,
       names=['Number', 'Start_Time', 'End_Time', 'Time_From_Start', 'Channel', 'Annotation'],
       string='Number\tStart Time\tEnd Time\tTime From Start\tChannel\tAnnotation')
---
MetaData(lines=(0, 6),
         string='Experiment ID\tExperiment\nAnimal ID\tAnimal\nResearcher\tTest\nDirectory path\t\n\n')

Filtered Reading with Tabs

Tabbed supports row and column filtering with equality, membership, rich comparison and regular expression matching. Its also fully iterative allowing users to choose the amount of memory to consume during file reading.

from itertools import chain

# tab rows whose Start_Time is between 9:38 and 9:40 and set reader to read
# only the Number and Start_Time columns
reader.tab(
    Start_Time='>= 2/09/2022 9:38:00 and <2/09/2022 9:40:00',
    columns=['Number', 'Start_Time']
)

# read the data to an iterator reading only 2 rows at a time
gen = reader.read(chunksize=2)

# convert to an in-memory list
data = list(chain.from_iterable(gen))
print(data)

# close the reader when done or open under context-management
reader.close()

Output

{'Number': 5, 'Start_Time': datetime.datetime(2022, 2, 9, 9, 38, 1, 262000)}
{'Number': 6, 'Start_Time': datetime.datetime(2022, 2, 9, 9, 38, 7, 909000)}
{'Number': 7, 'Start_Time': datetime.datetime(2022, 2, 9, 9, 38, 20, 258000)}
{'Number': 8, 'Start_Time': datetime.datetime(2022, 2, 9, 9, 38, 25, 435000)}

Documentation

The official documentation is hosted on github.io.

Dependencies

Tabbed depends on the excellent clevercsv package for dialect detection. The rest is pure Python.

Installation

Tabbed is hosted on pypi and can be installed with pip into a virtual environment.

pip install tabbed

To get a development version of Tabbed from source start by cloning the repository

git clone git@github.com:mscaudill/tabbed.git

Go to the directory you just cloned and create an editable install with pip.

pip install -e .[dev]

Contributing

We're excited you want to contribute! Please check out our Contribution guide.

Acknowledgements

We are grateful for the support of the Ting Tsung and Wei Fong Chao Foundation and the Jan and Dan Duncan Neurological Research Institute at Texas Children's that generously supports Tabbed.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

1.2.1

Dec 2, 2025

1.2.0

Nov 7, 2025

1.1.1

Sep 8, 2025

1.1.0

Aug 21, 2025

1.0.1

Jul 8, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tabbed-1.2.1.tar.gz (45.1 kB view details)

Uploaded Dec 2, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tabbed-1.2.1-py3-none-any.whl (31.7 kB view details)

Uploaded Dec 2, 2025 Python 3

File details

Details for the file tabbed-1.2.1.tar.gz.

File metadata

Download URL: tabbed-1.2.1.tar.gz
Upload date: Dec 2, 2025
Size: 45.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for tabbed-1.2.1.tar.gz
Algorithm	Hash digest
SHA256	`7be321097cecccf894bf2be9affce3752cda11b3bb67ce17fa43508fd24161e4`
MD5	`d3c4a74a279f7a222f9208c5f7ffe585`
BLAKE2b-256	`f83d074ca356d9264f992db92e19a8f98abe5e06e20b8f741de8e05a4d197e65`

See more details on using hashes here.

File details

Details for the file tabbed-1.2.1-py3-none-any.whl.

File metadata

Download URL: tabbed-1.2.1-py3-none-any.whl
Upload date: Dec 2, 2025
Size: 31.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for tabbed-1.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5d28ebf00b1910d038ddb1cd4ef05cc2cb2e17ba8d478a63f5eaa2cd3089b0e4`
MD5	`c341b3931d0bb01be1f462751e01eedb`
BLAKE2b-256	`62f501c12220884f78dc90697a22c5aaaa50563792ecd375af52fd7f1ca636c0`

See more details on using hashes here.

tabbed 1.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

A Python package for reading variably structured text files at scale

Key Features

Usage

Documentation

Dependencies

Installation

Contributing

Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes