A tool used for processing data in batches, such as in parallel processing.
Project description
English description follows Japanese.
リストを指定サイズのバッチに分割するシンプルなPythonユーティリティです。
概要
batch-div はリストを指定サイズの部分リスト(バッチ)に分割します。並列処理やバッチ処理など、データをまとめて処理したい場面で便利です。
itertools.batched(Python 3.12以降)と同様の機能を、古いバージョンのPythonでも利用できます。結果はすべてリストとして返されるため、外側・内側ともに len() が使え、進捗表示に適しています。ただし、すべてリストとしてメモリに展開するため、メモリに載りきらないほど巨大なデータには不向きです。
インストール
pip install batch-div
使い方
import batch_div
tasks = [0, 1, 2, 3, 4, 5, 6]
for batch in batch_div(tasks, 3):
print(batch)
出力:
[0, 1, 2]
[3, 4, 5]
[6]
第1引数に分割したいリスト、第2引数にバッチサイズを指定します。リストは先頭から順にバッチサイズごとに分割され、割り切れない場合は最後のバッチに残りの要素が入ります。
活用例
import batch_div
items = list(range(100))
batches = batch_div(items, 10)
print(f"全バッチ数: {len(batches)}") # -> 10
for i, batch in enumerate(batches):
print(f"バッチ {i+1}/{len(batches)} を処理中 ({len(batch)} 件)")
# ここに並列処理などを記述
ライセンス
CC0 1.0 Universal(パブリックドメイン)
A simple Python utility for splitting a list into batches of a specified size.
Overview
batch-div divides a list into sublists (batches) of a given size. It is useful when you want to process data in chunks — for example, in parallel or batch processing workflows.
Similar to itertools.batched (Python 3.12+), but works on older Python versions. All results are returned as lists, so len() is supported on both the outer and inner collections — handy for progress tracking. Note that because everything is materialized as lists, this tool is not suitable for extremely large datasets that don't fit in memory.
Installation
pip install batch-div
Usage
import batch_div
tasks = [0, 1, 2, 3, 4, 5, 6]
for batch in batch_div(tasks, 3):
print(batch)
Output:
[0, 1, 2]
[3, 4, 5]
[6]
The first argument is the list to split, and the second is the batch size. The input is divided into consecutive chunks of that size, with the last batch containing the remaining elements if the list doesn't divide evenly.
Use Case Example
import batch_div
items = list(range(100))
batches = batch_div(items, 10)
print(f"Total batches: {len(batches)}") # -> 10
for i, batch in enumerate(batches):
print(f"Processing batch {i+1}/{len(batches)} ({len(batch)} items)")
# Insert your parallel or batch processing here
License
CC0 1.0 Universal (Public Domain Dedication)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file batch_div-0.0.1-py3-none-any.whl.
File metadata
- Download URL: batch_div-0.0.1-py3-none-any.whl
- Upload date:
- Size: 3.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
50f0eb433b3d5fd83a2b18e50f3d346c619f55e6973a9229ddd5d032279a1749
|
|
| MD5 |
b73277bebe0185e89dc47e6d633d02bc
|
|
| BLAKE2b-256 |
6be3959758e55096fe5c55e476216fa0e781889cdc5df6e569ffc453836ca0d8
|