Simple, light-weight data frames for Python
Project description
Simple, Light-Weight Data Frames for Python
Dataiter's DataFrame is a class for tabular data similar to R's
data.frame, implementing all common operations to manipulate data. It
is under the hood a dictionary of NumPy arrays and thus capable of fast
vectorized operations. You can consider it to be a light-weight
alternative to Pandas with a simple and consistent API. Performance-wise
Dataiter relies on NumPy and Numba and is likely to be at best
comparable to Pandas.
Installation
# Latest stable version
pip install -U dataiter
# Latest development version
pip install -U git+https://github.com/otsaloma/dataiter
# Numba (optional)
pip install -U numba
Dataiter optionally uses Numba to speed up certain operations. If you have Numba installed, Dataiter will use it automatically. It's currently not a hard dependency, so you need to install it separately.
Quick Start
>>> import dataiter as di
>>> data = di.read_csv("data/listings.csv")
>>> data.filter(hood="Manhattan", guests=2).sort(price=1).head()
.
id hood zipcode guests sqft price
int64 string string int64 float64 int64
──────── ───────── ─────── ────── ─────── ─────
0 42279170 Manhattan 10013 2 nan 0
1 42384530 Manhattan 10036 2 nan 0
2 18835820 Manhattan 10021 2 nan 10
3 20171179 Manhattan 10027 2 nan 10
4 14858544 Manhattan 2 nan 15
5 31397084 Manhattan 10002 2 nan 19
6 22289683 Manhattan 10031 2 nan 20
7 7760204 Manhattan 10040 2 nan 22
8 43292527 Manhattan 10033 2 nan 22
9 43268040 Manhattan 10033 2 nan 23
.
Documentation
https://dataiter.readthedocs.io/
If you're familiar with either dplyr (R) or Pandas (Python), the comparison table in the documentation will give you a quick overview of the differences and similarities in common operations.
https://dataiter.readthedocs.io/en/stable/comparison.html
Development
To install a virtualenv for development, use
make venv
or, for a specific Python version
make PYTHON=python3.X venv
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dataiter-0.9999.tar.gz.
File metadata
- Download URL: dataiter-0.9999.tar.gz
- Upload date:
- Size: 52.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c407c890cbb19f0663f80484a8ccddd502f5b7d3cc761b8b315557842e3dfde4
|
|
| MD5 |
9229f9979ab27d0546c780529f237b33
|
|
| BLAKE2b-256 |
5b2e92c13a98d757db6bdc85b4116532208946dfa5b86cdcfe3f3a41eba0a554
|
File details
Details for the file dataiter-0.9999-py3-none-any.whl.
File metadata
- Download URL: dataiter-0.9999-py3-none-any.whl
- Upload date:
- Size: 73.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6a8c4655e4f83cf4bb578e957a314a58b79d85b4adc7246641c48fc2d8d4e8b7
|
|
| MD5 |
8b15163deacbd2d7bff207f101453ff1
|
|
| BLAKE2b-256 |
e169147672e286b43a499be1cc88b22897f422bc30d4e8b1e5733c7848fdc724
|