Skip to main content

Python DataFrames powered by Julia

Project description

grizzlys


Code style: Ruff Linting: Ruff pre-commit

grizzlys: User-friendly Python DataFrames powered by Julia

grizzlys is a Python package that provides a native interface on top of Julia's popular DataFrames.jl package.

As a user-friendly alternative to existing Python packages such as pandas and polars, it is designed to be a convenient & easy to use DataFrames tool for data analysts, data engineers and data scientists alike, while still providing high performance and abstractions, thanks to Julia's high-performance computing capabilities.

Why you might consider using grizzlys

:white_check_mark: You are transitioning into Python from a Julia or R programming background

:white_check_mark: You are accustomed to working with Jupyter notebooks (or a REPL) and performing exploratory data analysis (EDA) on-the-fly

:white_check_mark: You need a quick-and-dirty data wrangling tool that provides readymade macros and convenience functions out of the box

:white_check_mark: You work with statistics or linear algebra often and require a wide range of statistical/algebraic functions to be well-integrated with your DataFrames

What is grizzlys (currently) NOT well-suited for

:x: Larger-than-memory datasets - grizzlys' current implementation relies on data being stored in-memory, and therefore it is not a good choice if you work with datasets that don't fit in your machine's RAM.

For such cases, using Polars or Dask DataFrames would be a much better choice as of now.

:x: Lazy Evaluation - Similar to the above, grizzlys is currently designed to be fully eager, which means it always immediately executes your code, as opposed to building a task/computation graph or thereabout and delaying execution until it's needed.

:x: Backwards compatibility - grizzlys is based on a relatively new programming language in Julia, and is developed using an advanced version of Python, with little regard to end-of-life versions or any compatibility with Python 2.7, for example.

You should therefore not rely on grizzlys for integrations with very old code or any other legacy/deprecated tools and implementations.

:x: Best-in-class Performance - Though Julia is widely considered a very high-performance language (it is actually a major reason why it's used under the hood here), grizzlys is still a work-in-progress (WIP) and therefore does not currently aim to compete with, or outperform, other high-performance DataFrame libraries, such as Polars (written in Rust) or Modin (Multi-threaded pandas).

This, of course, might no longer be a limitation in the future, as grizzlys will have undergone optimizations and maturation.


Go to Top

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grizzlys-0.0.1.tar.gz (10.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

grizzlys-0.0.1-py3-none-any.whl (10.1 kB view details)

Uploaded Python 3

File details

Details for the file grizzlys-0.0.1.tar.gz.

File metadata

  • Download URL: grizzlys-0.0.1.tar.gz
  • Upload date:
  • Size: 10.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.2

File hashes

Hashes for grizzlys-0.0.1.tar.gz
Algorithm Hash digest
SHA256 33f0512edb881a33efec6339fb2da275a34df0945b5e7d56c86214921e91a134
MD5 ec28f9885153b92f6d610d3ef713f6e2
BLAKE2b-256 2f261c0b14ad1aabf3eeb02ae3298164ad0f30e730523c16da1ca6ca40736b2c

See more details on using hashes here.

File details

Details for the file grizzlys-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: grizzlys-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 10.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.2

File hashes

Hashes for grizzlys-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6157f3bd6effbf4751e8d8047e81448f558dfeba254374e8b0b93d61fd54709e
MD5 444252123b91516d89bf5d5893d99fe0
BLAKE2b-256 d71dfd8eab3c428570e76bd123bc9dd300c2fc1f52d7c6e6428affa893bb0e3c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page