Classes for data manipulation
Project description
Python Classes for Data Manipulation
Dataiter currently includes the following classes.
DataFrame
is a class for tabular data similar to R's data.frame
or pandas.DataFrame
. It is under the hood a dictionary of NumPy arrays
and thus capable of fast vectorized operations. You can consider this to
be a light-weight alternative to Pandas with a simple and consistent
API. Performance-wise Dataiter relies on NumPy and Numba and is likely
to be at best comparable to Pandas.
ListOfDicts
is a class useful for manipulating data from JSON
APIs. It provides functionality similar to libraries such as
Underscore.js, with manipulation functions that iterate over the data
and return a shallow modified copy of the original. attd.AttributeDict
is used to provide convenient access to dictionary keys.
GeoJSON
is a simple wrapper class that allows reading a GeoJSON
file into a DataFrame
and writing a data frame to a GeoJSON file. Any
operations on the data are thus done with methods provided by the data
frame class. Geometry is read as-is into the "geometry" column, but no
special geometric operations are currently supported.
Installation
# Latest stable version
pip install -U dataiter
# Latest development version
pip install -U git+https://github.com/otsaloma/dataiter#egg=dataiter
# Numba (optional)
pip install -U numba
Dataiter optionally uses Numba to speed up certain operations. If you have Numba installed and importing it succeeds, Dataiter will use it automatically. It's currently not a hard dependency, so you need to install it separately.
Documentation
https://dataiter.readthedocs.io/
If you're familiar with either R's dplyr or Python's Pandas, the comparison table in the documentation will give you a quick overview of the differences and similarities.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for dataiter-0.29.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ed5c1cbe4e74f0fc79a41430ac8d7da4d7854335307820591f1e41e8a244a6c8 |
|
MD5 | 393556bbf574e5dc3d288b153e489bff |
|
BLAKE2b-256 | 0100534e433e0e31659d4fd847fd50165b11a11337a43ece1fa172e5bdba0a99 |