pandas-streaming

Streaming DataFrame: streaming over pandas.

These details have not been verified by PyPI

Project links

Project description

https://dev.azure.com/xavierdupre3/pandas_streaming/_apis/build/status/sdpython.pandas_streaming

https://badge.fury.io/py/pandas_streaming.svg

https://codecov.io/gh/sdpython/pandas-streaming/branch/main/graph/badge.svg?token=0caHX1rhr8

pandas-streaming aims at processing big files with pandas, too big to hold in memory, too small to be parallelized with a significant gain. The module replicates a subset of pandas API and implements other functionalities for machine learning.

from pandas_streaming.df import StreamingDataFrame
sdf = StreamingDataFrame.read_csv("filename", sep="\t", encoding="utf-8")

for df in sdf:
    # process this chunk of data
    # df is a dataframe
    print(df)

The module can also stream an existing dataframe.

import pandas
df = pandas.DataFrame([dict(cf=0, cint=0, cstr="0"),
                       dict(cf=1, cint=1, cstr="1"),
                       dict(cf=3, cint=3, cstr="3")])

from pandas_streaming.df import StreamingDataFrame
sdf = StreamingDataFrame.read_df(df)

for df in sdf:
    # process this chunk of data
    # df is a dataframe
    print(df)

It contains other helpers to split datasets into train and test with some weird constraints.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.5.2

Dec 4, 2025

0.5.1

Sep 14, 2024

0.5.0

Jan 13, 2024

0.3.239

May 1, 2023

0.3.218

Oct 26, 2021

0.2.175

Aug 6, 2020

0.1.87

May 17, 2018

0.1.66

Feb 5, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pandas_streaming-0.5.2.tar.gz (35.6 kB view details)

Uploaded Dec 4, 2025 Source

File details

Details for the file pandas_streaming-0.5.2.tar.gz.

File metadata

Download URL: pandas_streaming-0.5.2.tar.gz
Upload date: Dec 4, 2025
Size: 35.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for pandas_streaming-0.5.2.tar.gz
Algorithm	Hash digest
SHA256	`18022bed8adcc55db17ed4851be26de3d6fca5888152c674f82df78c5199f9fd`
MD5	`5f3c4c50c1a0047f6ddea6fb8e2a59d6`
BLAKE2b-256	`c755e0096d6755c779a5e9be9f16d2e9b683dac7d2f5d0fbb5934562274ba312`

See more details on using hashes here.

pandas-streaming 0.5.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes