Pure Python data table functions.
Project description
TinyTim
A pure Python package for processing tabular data stored in dictionaries.
Description
TinyTim is a lightweight alternative to Pandas for processing tabular data. It's perfect for when you want to work with structured data but don't want to install Pandas and its many dependencies. TinyTim has zero dependencies outside of Python's standard library.
Data is stored in a simple dictionary format: {column_name: column_values}, making it easy to understand and work with.
Features
- Zero dependencies - Pure Python implementation
- Lightweight - Minimal footprint, fast installation
- Familiar API - Intuitive functions for data manipulation
- Type hints - Full type annotations for better IDE support
- Well tested - Comprehensive test suite
Installation
pip install tinytim
Requirements
- Python >= 3.8
Quick Start
Basic Data Operations
from tinytim.data import column_count, row_count, shape, head, tail
# Data is stored as {column_name: column_values}
data = {'x': [1, 2, 3, 4], 'y': [10, 20, 30, 40]}
# Get basic information
column_count(data)
# 2
row_count(data)
# 4
shape(data)
# (4, 2)
# Preview data
head(data, 2)
# {'x': [1, 2], 'y': [10, 20]}
tail(data, 2)
# {'x': [3, 4], 'y': [30, 40]}
Working with Rows
from tinytim.rows import row_dict, iterrows, row_dicts_to_data
data = {'x': [1, 2, 3], 'y': [6, 7, 8]}
# Get a single row
row_dict(data, 1)
# {'x': 2, 'y': 7}
# Iterate over rows
for index, row in iterrows(data):
print(f"Row {index}: {row}")
# Row 0: {'x': 1, 'y': 6}
# Row 1: {'x': 2, 'y': 7}
# Row 2: {'x': 3, 'y': 8}
# Convert row dictionaries to data
rows = [{'x': 1, 'y': 20}, {'x': 2, 'y': 21}, {'x': 3, 'y': 22}]
result = row_dicts_to_data(rows)
# {'x': [1, 2, 3], 'y': [20, 21, 22]}
Working with Columns
from tinytim.edit import add_to_column, multiply_column
data = {'x': [1, 2, 3, 4], 'y': [5, 6, 7, 8]}
# Add to a column
result = add_to_column(data, 'x', 10)
# {'x': [11, 12, 13, 14], 'y': [5, 6, 7, 8]}
# Multiply a column
result = multiply_column(data, 'x', 2)
# {'x': [2, 4, 6, 8], 'y': [5, 6, 7, 8]}
# Column-wise operations with sequences
result = add_to_column(data, 'x', [10, 20, 30, 40])
# {'x': [11, 22, 33, 44], 'y': [5, 6, 7, 8]}
Filtering and Selection
from tinytim.filter import filter_by_column_gt, filter_by_column_isin
data = {'x': [1, 2, 3, 4, 5], 'y': [10, 20, 30, 40, 50]}
# Filter rows where column value is greater than threshold
filtered = filter_by_column_gt(data, 'x', 2)
# {'x': [3, 4, 5], 'y': [30, 40, 50]}
# Filter rows where column value is in a list
filtered = filter_by_column_isin(data, 'x', [1, 3, 5])
# {'x': [1, 3, 5], 'y': [10, 30, 50]}
Handling Missing Values
from tinytim.na import isna, dropna, fillna
data = {'x': [1, None, 3], 'y': [10, 20, None]}
# Check for missing values
isna(data)
# {'x': [False, True, False], 'y': [False, False, True]}
# Drop rows with missing values
dropna(data)
# {'x': [1], 'y': [10]}
# Fill missing values
fillna(data, 0)
# {'x': [1, 0, 3], 'y': [10, 20, 0]}
# Forward fill
data2 = {'x': [None, 2, None, 4], 'y': [10, None, None, 40]}
fillna(data2, method='ffill')
# {'x': [None, 2, 2, 4], 'y': [10, 10, 10, 40]}
Grouping and Aggregation
from tinytim.group import groupby, sum_groups
data = {'category': ['A', 'B', 'A', 'B'], 'value': [1, 2, 3, 4]}
# Group by column
groups = groupby(data, 'category')
# [('A', {'category': ['A', 'A'], 'value': [1, 3]}),
# ('B', {'category': ['B', 'B'], 'value': [2, 4]})]
# Sum grouped data
labels, sums = sum_groups(groups)
# labels: ['A', 'B']
# sums: {'value': [4, 6]}
Joining Data
from tinytim.join import inner_join, left_join
left = {'id': [1, 2, 3], 'value': ['a', 'b', 'c']}
right = {'id': [2, 3, 4], 'score': [10, 20, 30]}
# Inner join on 'id' column
result = inner_join(left, right, 'id')
# {'id': [2, 3], 'value': ['b', 'c'], 'score': [10, 20]}
# Left join keeps all rows from left table
result = left_join(left, right, 'id')
# {'id': [1, 2, 3], 'value': ['a', 'b', 'c'], 'score': [None, 10, 20]}
More Examples
Editing Data
from tinytim.edit import edit_row_items, drop_row, drop_column
data = {'name': ['Alice', 'Bob', 'Charlie'],
'age': [25, 30, 35],
'city': ['NYC', 'LA', 'SF']}
# Edit specific row values
result = edit_row_items(data, 1, {'age': 31, 'city': 'Seattle'})
# {'name': ['Alice', 'Bob', 'Charlie'],
# 'age': [25, 31, 35],
# 'city': ['NYC', 'Seattle', 'SF']}
# Drop a row
result = drop_row(data, 0)
# {'name': ['Bob', 'Charlie'], 'age': [30, 35], 'city': ['LA', 'SF']}
# Drop a column
result = drop_column(data, 'city')
# {'name': ['Alice', 'Bob', 'Charlie'], 'age': [25, 30, 35]}
Inserting Data
from tinytim.insert import insert_row, insert_rows
data = {'x': [1, 2], 'y': [10, 20]}
# Insert a single row
result = insert_row(data, {'x': 3, 'y': 30})
# {'x': [1, 2, 3], 'y': [10, 20, 30]}
# Insert multiple rows
rows = [{'x': 4, 'y': 40}, {'x': 5, 'y': 50}]
result = insert_rows(data, rows)
# {'x': [1, 2, 4, 5], 'y': [10, 20, 40, 50]}
Advanced Filtering
from tinytim.filter import (
filter_by_column_eq,
filter_by_column_ne,
filter_by_column_le,
sample
)
data = {'product': ['A', 'B', 'A', 'C', 'B'],
'price': [10, 20, 15, 30, 25]}
# Filter by equality
result = filter_by_column_eq(data, 'product', 'A')
# {'product': ['A', 'A'], 'price': [10, 15]}
# Filter by inequality
result = filter_by_column_ne(data, 'product', 'A')
# {'product': ['B', 'C', 'B'], 'price': [20, 30, 25]}
# Filter by less than or equal
result = filter_by_column_le(data, 'price', 20)
# {'product': ['A', 'B', 'A'], 'price': [10, 20, 15]}
# Random sample
result = sample(data, 2, random_state=42)
# Returns 2 random rows (deterministic with random_state)
Copying Data
from tinytim.copy import copy_table, deepcopy_table
data = {'x': [1, 2, 3], 'y': [[10], [20], [30]]}
# Shallow copy (copies dict and lists)
copy1 = copy_table(data)
# Deep copy (copies everything recursively)
copy2 = deepcopy_table(data)
copy2['y'][0][0] = 999
# Original data unchanged: data['y'][0][0] == 10
Development
Setting up a development environment
# Clone the repository
git clone https://github.com/eddiethedean/tinytim.git
cd tinytim
# Install development dependencies
pip install -e ".[dev]"
# Or use the requirements file
pip install -r requirements_dev.txt
Running Tests
# Run tests with pytest
pytest
# Run tests with coverage
pytest --cov=tinytim
# Run tests across multiple Python versions
tox
Linting and Type Checking
# Lint with ruff
ruff check src tests
# Format code with ruff
ruff format src tests
# Type check with mypy
mypy src
Pre-commit Hooks
# Install pre-commit hooks
pip install pre-commit
pre-commit install
# Run hooks manually
pre-commit run --all-files
Examples
See the examples directory for Jupyter notebooks demonstrating various features:
- data.ipynb - Basic data operations
- rows.ipynb - Working with rows
- columns.ipynb - Working with columns
- join.ipynb - Joining datasets
- group.ipynb - Grouping and aggregation
- fillna.ipynb - Handling missing values
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
Author
Odos Matthews - odosmatthews@gmail.com
License
This project is licensed under the MIT License - see the LICENSE.md file for details.
Why TinyTim?
TinyTim is ideal for:
- Embedded systems or environments with limited resources
- Lambda functions or serverless applications where package size matters
- Learning data manipulation concepts without the complexity of Pandas
- Scripts where you need basic data operations without heavy dependencies
- Distribution of tools where you want to minimize user installation burden
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tinytim-1.12.0.tar.gz.
File metadata
- Download URL: tinytim-1.12.0.tar.gz
- Upload date:
- Size: 44.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.8.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6bc53b5e33cd19445107ab62f5997f5f621e9360c82287852f4f39f11162a9ee
|
|
| MD5 |
0a013413c67d8e8fcd2dcb4bb3f00cb8
|
|
| BLAKE2b-256 |
4143ec22b6621f10b1f35a8f50b92d08abd861e35ebe4ef89c8c0d2547bb813f
|
File details
Details for the file tinytim-1.12.0-py3-none-any.whl.
File metadata
- Download URL: tinytim-1.12.0-py3-none-any.whl
- Upload date:
- Size: 33.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.8.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fec2bd5054a439121547982b836c708eaabe6f7f5e4128af82007989134ad0f9
|
|
| MD5 |
9e8d56daa1ec96b9befd5ef50005ca4d
|
|
| BLAKE2b-256 |
d86130cd1335d30aeafca31858ab501ccb2f5c829bff197d957be7ca3634b6bd
|