Skip to main content

Rolling window statistics for numpy arrays. No pandas needed.

Project description

rollit logo

rollit

Rolling window statistics for numpy arrays. No pandas needed.

rollit is a lightweight Python library for calculating fast rolling-window statistics (like moving averages, standard deviations, and sums) on NumPy arrays. Instead of writing slow Python loops or importing a heavy 35 MB library like pandas, it uses memory-efficient NumPy stride tricks to calculate moving windows instantly with zero extra dependencies.

CI PyPI Downloads License Python


The Problem

When working with raw numpy arrays (such as sensor data, stock prices, fitness metrics, or machine learning features), developers constantly need rolling window statistics. Since NumPy has no built-in function for this, developers typically resort to writing slow Python for-loops, converting arrays to pandas DataFrames (adding a 35 MB dependency), or copying complex stride_tricks snippets from the web.

Why not just write raw stride tricks?

While NumPy's as_strided is powerful, using it directly has significant drawbacks:

  • Danger of Segfaults: A small mistake in calculating strides can point to invalid memory, causing segmentation faults and crashing the Python interpreter. rollit handles the memory layout safely and locks the returned views as read-only.
  • No Out-of-the-Box Math: Stride tricks only group your data into windows. You still have to write the reduction math for functions like standard deviation, z-scores, or normalization. rollit provides these pre-optimized.
  • No Input Validation: Passing invalid window sizes or bad dimensions raises cryptic errors or causes memory issues. rollit validates all array parameters and raises clear Python exceptions.
  • No Support for Missing Data: rollit has built-in support for min_periods to handle and mask incomplete windows or missing data.

rollit solves this. Clean interface, fast execution, and zero dependencies beyond numpy.


Installation

pip install rollit

Quick Start

import numpy as np
import rollit

arr = np.array([1.0, 2.0, 3.0, 4.0, 5.0])

rollit.mean(arr, window=3)   # array([2., 3., 4.])
rollit.sum(arr, window=3)    # array([6., 9., 12.])
rollit.std(arr, window=3)    # array([1., 1., 1.])
rollit.min(arr, window=3)    # array([1., 2., 3.])
rollit.max(arr, window=3)    # array([3., 4., 5.])

Before & After

Before (pandas: 35 MB dependency)

import numpy as np
import pandas as pd

arr = np.random.randn(1_000_000)
df = pd.DataFrame(arr)
rolling_mean = df.rolling(window=30).mean().values.flatten()[29:]

Before (manual loop: ~500 ms)

import numpy as np

arr = np.random.randn(1_000_000)
rolling_mean = []
for i in range(len(arr) - 29):
    rolling_mean.append(np.mean(arr[i:i+30]))
rolling_mean = np.array(rolling_mean)

After (rollit: ~5 ms, zero extra dependencies)

import numpy as np
import rollit

arr = np.random.randn(1_000_000)
rolling_mean = rollit.mean(arr, window=30)

Reference

All functions share a unified signature:

rollit.<function>(arr, window, min_periods=None)
Parameter Type Description
arr np.ndarray Input 1D numpy array of numeric values
window int Size of the moving window (positive integer)
min_periods int, optional Minimum number of non-NaN observations required to produce a value. Defaults to None

Core Functions

Function Description Example Output
rollit.mean(arr, 3) Rolling average array([2., 3., 4.])
rollit.std(arr, 3) Rolling sample standard deviation (ddof=1) array([1., 1., 1.])
rollit.sum(arr, 3) Rolling sum array([6., 9., 12.])
rollit.min(arr, 3) Rolling minimum array([1., 2., 3.])
rollit.max(arr, 3) Rolling maximum array([3., 4., 5.])

For arr = [1.0, 2.0, 3.0, 4.0, 5.0]

Anomaly Detection: zscore()

Computes the z-score of the last value in each window: (last - mean) / std. Values above +2 or below -2 are statistical outliers.

# Flat signal with one spike
signal = np.ones(20)
signal[10] = 100.0

scores = rollit.zscore(signal, window=5)
# The window containing the spike will have a very high z-score

Note: Returns NaN when the standard deviation of a window is 0 (constant values).

Normalization: normalize()

Computes rolling min-max normalization of the last value in each window: (last - min) / (max - min). Scales each value to [0, 1] relative to its local window.

arr = np.array([1.0, 5.0, 2.0, 9.0, 1.0])
rollit.normalize(arr, window=3)
# array([0.25, 1.  , 0.  ])

Note: Returns NaN when all values in a window are equal (max == min).

Custom Function: apply()

Escape hatch for any function that cannot be vectorized. Runs a Python loop internally, so it is slower than the built-in functions.

# Rolling median
rollit.apply(arr, window=3, fn=np.median)
# array([2., 3., 4.])

# Rolling range
rollit.apply(arr, window=3, fn=lambda w: w.max() - w.min())
# array([2., 3., 3.])

Output Length

rollit only returns values for complete windows with no NaN padding:

input length  = L
window size   = W
output length = L - W + 1

For an array of length 10 with window 3:

  • Output length = 8
  • Output index 0 corresponds to arr[0:3]
  • Output index 7 corresponds to arr[7:10]

Performance

All functions (except apply) use numpy.lib.stride_tricks.as_strided to create read-only memory views, meaning no data is copied.

Array Size Window Function Time
10,000 7 mean < 1 ms
1,000,000 30 mean < 10 ms
1,000,000 30 apply (custom fn) ~800 ms

apply() uses a Python loop by design. Use vectorized functions whenever possible.


Important Notes

  • Input must be a 1D numpy array. Passing a Python list raises TypeError. Passing a 2D array raises ValueError.
  • std() uses ddof=1 (sample standard deviation), matching the pandas default.
  • Avoid from rollit import *: sum, min, and max will shadow Python's built-in functions. Use import rollit and call rollit.sum(), rollit.min(), and rollit.max() instead.
  • min_periods in v1.0: Counts non-NaN values per window and masks positions below the threshold with NaN. Note that the underlying numpy operations (such as np.mean) are not NaN-aware; a window containing any NaN will produce NaN regardless of min_periods. Full NaN-aware support (nanmean, etc.) is planned for v2.0.

Contributing

Contributions are welcome! Please see CONTRIBUTING.md for setup instructions, guidelines, and how to submit a PR.

Changelog

See CHANGELOG.md for version history.

License

MIT: see LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rollit-1.0.0.tar.gz (13.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rollit-1.0.0-py3-none-any.whl (8.3 kB view details)

Uploaded Python 3

File details

Details for the file rollit-1.0.0.tar.gz.

File metadata

  • Download URL: rollit-1.0.0.tar.gz
  • Upload date:
  • Size: 13.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rollit-1.0.0.tar.gz
Algorithm Hash digest
SHA256 5fa5899cff124f46264d8f82364c6a60966076c1abce5cdc96c4d211632b63ed
MD5 23a553526345647a4638bafba75229d7
BLAKE2b-256 5e0c1eaaf23b71e6139f8a73613bc691c7e78455ab0fc56732ad6f7ef1fb9002

See more details on using hashes here.

Provenance

The following attestation bundles were made for rollit-1.0.0.tar.gz:

Publisher: publish.yml on Ameya79/rollit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file rollit-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: rollit-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 8.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rollit-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 365b4bd01dabfb624485a951e7d201d802921decc87ffe8fbd7df6e9be0d9a5f
MD5 b49c11bbe16020c9ba7bc9be5ddb1e0d
BLAKE2b-256 7401106dd210128bda9b14849fd886df66e1850789ed64ac7eaf604e0ccd8d16

See more details on using hashes here.

Provenance

The following attestation bundles were made for rollit-1.0.0-py3-none-any.whl:

Publisher: publish.yml on Ameya79/rollit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page