An implementation of the DataFrame specification in Python
Project description
Pydf: An Implementation of the DataFrame specification in Python
This is the official implementation of the DataFrame specification provided by Raven Computing.
Getting Started
This library is available on PyPI.
Install via:
pip install raven-pydf
For more information see pypi.org.
After installation you can use the entire DataFrame API by importing one class:
from raven.struct.dataframe import DataFrame
# read a DataFrame file into memory
df = DataFrame.read("/path/to/myFile.df")
# show the first 10 rows on stdout
print(df.head(10))
Alternatively, you can import all concrete Column types directly, for example:
from raven.struct.dataframe import (DefaultDataFrame,
IntColumn,
FloatColumn,
StringColumn)
# create a DataFrame with 3 columns and 3 rows
df = DefaultDataFrame(
IntColumn("A", [1, 2, 3]),
FloatColumn("B", [4.4, 5.5, 6.6])
StringColumn("C", ["cat", "dog", "horse"]))
print(df)
Compatibility
This library requires Python3.7 or higher.
Internally, this library uses Numpy for array operations. The minimum required version is v1.19.0
Documentation
The official unified documentation will be made available shortly.
Development
If you want to change code of this library or if you want to include it manually as a dependency without installing via PIP, you can do so by cloning this repository.
Setup
We are using virtual environments and the virtualenvwrapper utilities for all of our Python projects. If you are running on Linux then you can set up your development environment by sourcing the setup.sh script. This will create a virtual environment pydf for you and install all dependencies:
source setup.sh
Running Tests
Execute all unit tests via:
python -m unittest
Linting
Run pylint to perform static code analysis of the source code via:
pylint raven
License
This library is licensed under the Apache License Version 2 - see the LICENSE for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file raven-pydf-1.0.0.tar.gz
.
File metadata
- Download URL: raven-pydf-1.0.0.tar.gz
- Upload date:
- Size: 61.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.1 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8b2257d11c0d2fad10a528ec1cd415b7d03c305bde56748b9a237eb24c67caa2 |
|
MD5 | 3450517e8770ae64deb5872ff5db1cfd |
|
BLAKE2b-256 | 3f7d8f4f2b8228c90aeaf18ac475d1fe5f74e27ef65d10034dc2b9a92d750872 |
File details
Details for the file raven_pydf-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: raven_pydf-1.0.0-py3-none-any.whl
- Upload date:
- Size: 88.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.1 setuptools/47.1.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2e013156a9ee9adbff3deeb43a42343e05f61519a2c102c2f3b41ccadc73f86c |
|
MD5 | 3a5651978baced7faa9bea1fb17ffd4e |
|
BLAKE2b-256 | 8a8162b841d51e64db7ac3cc5616017ed196597d696c26713bf4d0d7773990e6 |