Python DataFrames powered by Julia
Project description
grizzlys: User-friendly Python DataFrames powered by Julia
grizzlys is a Python package that provides a native interface on top of Julia's popular DataFrames.jl package.
As a user-friendly alternative to existing Python packages such as pandas and polars, it is designed to be a convenient & easy to use DataFrames tool for data analysts, data engineers and data scientists alike, while still providing high performance and abstractions, thanks to Julia's high-performance computing capabilities.
Why you might consider using grizzlys
:white_check_mark: You are transitioning into Python from a Julia or R programming background
:white_check_mark: You are accustomed to working with Jupyter notebooks (or a REPL) and performing exploratory data analysis (EDA) on-the-fly
:white_check_mark: You need a quick-and-dirty data wrangling tool that provides readymade macros and convenience functions out of the box
:white_check_mark: You work with statistics or linear algebra often and require a wide range of statistical/algebraic functions to be well-integrated with your DataFrames
What is grizzlys (currently) NOT well-suited for
:x: Larger-than-memory datasets - grizzlys' current implementation relies on data being stored in-memory, and therefore it is not a good choice if you work with datasets that don't fit in your machine's RAM.
For such cases, using Polars or Dask DataFrames would be a much better choice as of now.
:x: Lazy Evaluation - Similar to the above, grizzlys is currently designed to be fully eager, which means it always immediately executes your code, as opposed to building a task/computation graph or thereabout and delaying execution until it's needed.
:x: Backwards compatibility - grizzlys is based on a relatively new programming language in Julia, and is developed using an advanced version of Python, with little regard to end-of-life versions or any compatibility with Python 2.7, for example.
You should therefore not rely on grizzlys for integrations with very old code or any other legacy/deprecated tools and implementations.
:x: Best-in-class Performance - Though Julia is widely considered a very high-performance language (it is actually a major reason why it's used under the hood here), grizzlys is still a work-in-progress (WIP) and therefore does not currently aim to compete with, or outperform, other high-performance DataFrame libraries, such as Polars (written in Rust) or Modin (Multi-threaded pandas).
This, of course, might no longer be a limitation in the future, as grizzlys will have undergone optimizations and maturation.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file grizzlys-0.0.1.tar.gz.
File metadata
- Download URL: grizzlys-0.0.1.tar.gz
- Upload date:
- Size: 10.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
33f0512edb881a33efec6339fb2da275a34df0945b5e7d56c86214921e91a134
|
|
| MD5 |
ec28f9885153b92f6d610d3ef713f6e2
|
|
| BLAKE2b-256 |
2f261c0b14ad1aabf3eeb02ae3298164ad0f30e730523c16da1ca6ca40736b2c
|
File details
Details for the file grizzlys-0.0.1-py3-none-any.whl.
File metadata
- Download URL: grizzlys-0.0.1-py3-none-any.whl
- Upload date:
- Size: 10.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6157f3bd6effbf4751e8d8047e81448f558dfeba254374e8b0b93d61fd54709e
|
|
| MD5 |
444252123b91516d89bf5d5893d99fe0
|
|
| BLAKE2b-256 |
d71dfd8eab3c428570e76bd123bc9dd300c2fc1f52d7c6e6428affa893bb0e3c
|