Tidyverse-like functions for python polars with extended functionalities for science.
Project description
TidyPolars $^{4sci}$
tidypolars4sci provides functions that match as closely as possible to R's Tidyverse functions for manipulating data frames and conducting data analysis in Python using the blazingly fast Polars as backend.
The name tidypolars4sci reflects the module's main features:
- Matches the function names and functionalities of R's Tidyverse.
- Leverages the performance and efficiency of Polars under the hood.
- Tailored for scientific research, extending the default functionalities of both Polars and Tidyverse.
Details
tidypolars4sci is an extended API for Polars. One of the main advantages of using Polars as a data manipulation engine is its exceptional speed when compared to other alternatives (see here).
The primary distinction between tidypolars4sci and Polars lies in user interaction. The frontend functions are designed to closely resemble those available in R's Tidyverse, making it easier for users familiar with that ecosystem to transition to this library.
Another useful feature of tidypolars4sci is its extensive functionality aimed at facilitating data analysis and reporting for scientific research and academic publications. This includes the creation of LaTeX tables, which enhances the presentation of results.
Note: Due to the additional functionalities provided, tidypolars4sci may operate slightly slower than using Polars directly.
Documentation
Available here.
Installation
You can install tidypolars4sci with pip:
$ pip3 install tidypolars4sci
Or through conda:
$ conda install -c conda-forge tidypolars4sci
Basic usage
tidypolars4sci methods are designed to work like tidyverse functions:
import tidypolars4sci as tp
# create tibble data frame
df = tp.tibble(x = range(3),
y = range(3, 6),
z = ['a', 'a', 'b'])
(
df
.select('x', 'y', 'z')
.filter(tp.col('x') < 4, tp.col('y') > 1)
.arrange(tp.desc('z'), 'x')
.mutate(double_x = tp.col('x') * 2,
x_plus_y = tp.col('x') + tp.col('y')
)
)
┌─────┬─────┬─────┬──────────┬──────────┐
│ x ┆ y ┆ z ┆ double_x ┆ x_plus_y │
│ --- ┆ --- ┆ --- ┆ --- ┆ --- │
│ i64 ┆ i64 ┆ str ┆ i64 ┆ i64 │
╞═════╪═════╪═════╪══════════╪══════════╡
│ 2 ┆ 5 ┆ b ┆ 4 ┆ 7 │
├╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ 0 ┆ 3 ┆ a ┆ 0 ┆ 3 │
├╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌┤
│ 1 ┆ 4 ┆ a ┆ 2 ┆ 5 │
└─────┴─────┴─────┴──────────┴──────────┘
Converting to/from pandas data frames
If you need to use a package that requires pandas or polars data frames, you can convert from a tidypolars4sci tibble to either of those DataFrame formats.
# convert to pandas...
df = df.to_pandas()
# ... or convert to polars
df = df.to_polars()
To convert from a pandas or polars DataFrame to a tidypolars tibble:
# convert from pandas...
df = tp.from_pandas(df)
# or covert from polars
df = tp.from_polars(df)
Similar projects
- tidypolars: tidypolars was the starting point of tidypolars4sci
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tidypolars4sci-0.0.1.23.tar.gz.
File metadata
- Download URL: tidypolars4sci-0.0.1.23.tar.gz
- Upload date:
- Size: 4.5 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7921b800293f088d0dde83d432ddad0b2bb5f56889db995e8b535827c2d45a88
|
|
| MD5 |
69b5b9dc75467c4360e2e092f0020bf5
|
|
| BLAKE2b-256 |
0757b705ca53f318eab6504a1ca0fcfa603492fbd6e3a40275857aab47ae3b69
|
File details
Details for the file tidypolars4sci-0.0.1.23-py3-none-any.whl.
File metadata
- Download URL: tidypolars4sci-0.0.1.23-py3-none-any.whl
- Upload date:
- Size: 714.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
713b496f362cd83ddc7b0f4edf3ad93989448d82e0883d4b874ec138909a2cfb
|
|
| MD5 |
1e9b4de126e587c2949bc8143084fb3b
|
|
| BLAKE2b-256 |
d8de370593e1d352f2d65d8c5682fb48e75c039362f26b75097183279a14fac7
|