Skip to main content

Tidyverse-like functions for python polars with extended functionalities for science.

Project description

TidyPolars $^{4sci}$

tidypolars4sci provides functions that match as closely as possible to R's Tidyverse functions for manipulating data frames and conducting data analysis in Python using the blazingly fast Polars as backend.

The name tidypolars4sci reflects the module's main features:

  1. Matches the function names and functionalities of R's Tidyverse.
  2. Leverages the performance and efficiency of Polars under the hood.
  3. Tailored for scientific research, extending the default functionalities of both Polars and Tidyverse.

Details

tidypolars4sci is an extended API for Polars. One of the main advantages of using Polars as a data manipulation engine is its exceptional speed when compared to other alternatives (see here).

The primary distinction between tidypolars4sci and Polars lies in user interaction. The frontend functions are designed to closely resemble those available in R's Tidyverse, making it easier for users familiar with that ecosystem to transition to this library.

Another useful feature of tidypolars4sci is its extensive functionality aimed at facilitating data analysis and reporting for scientific research and academic publications. This includes the creation of LaTeX tables, which enhances the presentation of results.

Note: Due to the additional functionalities provided, tidypolars4sci may operate slightly slower than using Polars directly.

Documentation

Available here.

Installation

You can install tidypolars4sci with pip:

$ pip3 install tidypolars4sci

Or through conda:

$ conda install -c conda-forge tidypolars4sci

Basic usage

tidypolars4sci methods are designed to work like tidyverse functions:

import tidypolars4sci as tp

# create tibble data frame
df = tp.tibble(x = range(3),
               y = range(3, 6),
               z = ['a', 'a', 'b'])

(
    df
    .select('x', 'y', 'z')
    .filter(tp.col('x') < 4, tp.col('y') > 1)
    .arrange(tp.desc('z'), 'x')
    .mutate(double_x = tp.col('x') * 2,
            x_plus_y = tp.col('x') + tp.col('y')
            )
)
┌─────┬─────┬─────┬──────────┬──────────┐
 x    y    z    double_x  x_plus_y 
 ---  ---  ---  ---       ---      
 i64  i64  str  i64       i64      
╞═════╪═════╪═════╪══════════╪══════════╡
 2    5    b    4         7        
├╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
 0    3    a    0         3        
├╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
 1    4    a    2         5        
└─────┴─────┴─────┴──────────┴──────────┘

Converting to/from pandas data frames

If you need to use a package that requires pandas or polars data frames, you can convert from a tidypolars4sci tibble to either of those DataFrame formats.

# convert to pandas...
df = df.to_pandas()
# ... or convert to polars
df = df.to_polars()

To convert from a pandas or polars DataFrame to a tidypolars tibble:

# convert from pandas...
df = tp.from_pandas(df)
# or covert from polars
df = tp.from_polars(df)

Similar projects

  • tidypolars: tidypolars was the starting point of tidypolars4sci

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tidypolars4sci-0.0.1.23.tar.gz (4.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tidypolars4sci-0.0.1.23-py3-none-any.whl (714.9 kB view details)

Uploaded Python 3

File details

Details for the file tidypolars4sci-0.0.1.23.tar.gz.

File metadata

  • Download URL: tidypolars4sci-0.0.1.23.tar.gz
  • Upload date:
  • Size: 4.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.5

File hashes

Hashes for tidypolars4sci-0.0.1.23.tar.gz
Algorithm Hash digest
SHA256 7921b800293f088d0dde83d432ddad0b2bb5f56889db995e8b535827c2d45a88
MD5 69b5b9dc75467c4360e2e092f0020bf5
BLAKE2b-256 0757b705ca53f318eab6504a1ca0fcfa603492fbd6e3a40275857aab47ae3b69

See more details on using hashes here.

File details

Details for the file tidypolars4sci-0.0.1.23-py3-none-any.whl.

File metadata

File hashes

Hashes for tidypolars4sci-0.0.1.23-py3-none-any.whl
Algorithm Hash digest
SHA256 713b496f362cd83ddc7b0f4edf3ad93989448d82e0883d4b874ec138909a2cfb
MD5 1e9b4de126e587c2949bc8143084fb3b
BLAKE2b-256 d8de370593e1d352f2d65d8c5682fb48e75c039362f26b75097183279a14fac7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page