Skip to main content

data_algebra is a data manipulation language that can both generate SQL queries and work on Pandas DataFrames.

Project description

data_algebra is a piped data wrangling system based on Codd's relational algebra and experience working with data manipulation languages at scale.
The primary purpose of the package is to support an easy to compose and maintain grammar of data processing steps that in turn can be used to generate database specific SQL. The package also implements the same transforms for Pandas and Polars DataFrames.

Currently the system is primarily adapted and testing for Pandas, Polars, Google BigQuery, PostgreSQL, SQLite, Spark, and MySQL.

R versions of the system are available as the rquery and rqdatatable packages.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data_algebra-1.7.1.tar.gz (185.3 kB view hashes)

Uploaded Source

Built Distribution

data_algebra-1.7.1-py3-none-any.whl (128.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page