Skip to main content

Batch processing with progress tracking and error handling.

Project description

philiprehberger-batch-iter

Tests PyPI version Last updated

Batch processing with progress tracking and error handling.

Installation

pip install philiprehberger-batch-iter

Usage

from philiprehberger_batch_iter import batch, batch_map, collect_errors

# Split any iterable into fixed-size batches
for chunk in batch(range(10), size=3):
    print(chunk)
# [0, 1, 2]
# [3, 4, 5]
# [6, 7, 8]
# [9]

# Enable progress output to stderr
for chunk in batch(range(100), size=25, progress=True):
    process(chunk)
# batch 1: 25 items
# batch 2: 25 items
# ...

Batch map

from philiprehberger_batch_iter import batch_map

# Process items in batches and collect flattened results
results = batch_map(range(10), size=3, fn=lambda chunk: [x * 2 for x in chunk])
print(results)
# [0, 2, 4, 6, 8, 10, 12, 14, 16, 18]

Error collection

from philiprehberger_batch_iter import collect_errors

def process_batch(items):
    for item in items:
        if item < 0:
            raise ValueError(f"negative value: {item}")

result = collect_errors([1, 2, -3, 4, 5, -6], size=2, fn=process_batch)
print(result.processed)     # 6
print(len(result.errors))   # 2
print(result.duration_ms)   # 0.12

Async batching

from philiprehberger_batch_iter import batch_async

async def process():
    async for chunk in batch_async(async_data_source(), size=50):
        await handle(chunk)

API

Function / Class Description
batch(iterable, size, progress=False) Yield fixed-size batches from an iterable
batch_async(async_iterable, size) Async generator yielding fixed-size batches
collect_errors(iterable, size, fn) Process batches and collect errors into a result
BatchResult Dataclass with processed, errors, duration_ms

Development

pip install -e .
python -m pytest tests/ -v

Support

If you find this project useful:

Star the repo

🐛 Report issues

💡 Suggest features

❤️ Sponsor development

🌐 All Open Source Projects

💻 GitHub Profile

🔗 LinkedIn Profile

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

philiprehberger_batch_iter-0.2.0.tar.gz (6.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

philiprehberger_batch_iter-0.2.0-py3-none-any.whl (5.0 kB view details)

Uploaded Python 3

File details

Details for the file philiprehberger_batch_iter-0.2.0.tar.gz.

File metadata

File hashes

Hashes for philiprehberger_batch_iter-0.2.0.tar.gz
Algorithm Hash digest
SHA256 d47ea2ef8e2def07c60a9f6dec16c856dfb49d758c626e6c95c659d11c3f956e
MD5 98d81d7efca98c739af6b92e8a01d832
BLAKE2b-256 36372ae70646e00155246c58ace0ecd98053979d7934a054763118fc5cb3fac3

See more details on using hashes here.

File details

Details for the file philiprehberger_batch_iter-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for philiprehberger_batch_iter-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1fa83477c0dcb8052f1164ec3035b455e57d97c1fe2bd05ad47e110a9b7eeae1
MD5 2025e3c00211a1aa6c7217f91b85cc2b
BLAKE2b-256 3a2768595d36cf983308378953abf0e1369e358e7decbec3ebd40743b15564fa

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page