Alpha version of rasgoQL open-source package.
Project description
RasgoQL
RasgoQL is a light-weight data transformation package to bridge the gap between dbt and pandas. It allow users to construct, print & run SQL queries using a familiar python syntax. Under the covers it sends all processing to your data warehouse, allowing efficient transformation of massive datasets.
RasgoQL does these things well:
- Pulls existing DataWarehouse tables into pandas DataFrames for analysis
- Constructs SQL queries using a syntax that feels like pandas
- Creates views in your DataWarehouse to save transformed data
- Exports runnable sql in .sql files or dbt-compliant yml files
- Offers dozens of free SQL transforms to use
- Coming Soon: allows users to create & add custom transforms
Documentation is available at: https://docs.rasgoql.com
Ways RasgoQL can help
-
If you use pandas to build features, but you are working on a massive set of data that won't fit in your machine's memory. RasgoQL can help!
-
If your organization uses dbt of another SQL tool to run production data flows, but you prefer to build features in pandas. RasgoQL can help!
-
If you know pandas, but not SQL and want to learn how queries will translate. RasgoQL can help!
Package Dependencies
- jinja2
- pandas
- pyyaml
- python-dotenv
- rasgotransforms
[snowflake]
- snowflake-connector-python
- snowflake-connector-python[pandas]
[bigquery]
- google-auth-oauthlib
- google-cloud-bigquery
[postgres]
- psycopg2
- SQLAlchemy
- psycopg2-binary # Linux
[redshift]
- redshift_connector
- SQLAlchemy
- sqlalchemy-redshift
Major Version Releases
-
v1.0.0 (Feb 23, 2022)
- Added support for BigQuery
- Added support for Snowflake
- Added import dataset from pandas workflow
- Added export to dbt workflow
-
v1.1.0 (Mar 8, 2022)
- Added support for Postgres
-
v1.2.0 (Mar 24, 2022)
- Added a parameter to allow batch returning of Pandas DataFrames from
to_df()
andquery_into_df()
methods
- Added a parameter to allow batch returning of Pandas DataFrames from
-
v1.3.0 (Mar 25, 2022)
- Added support for MySQL
-
v1.4.0 (April 8, 2022)
- Added failure and execution telemetry for methods in Transform and Dataset classes
-
v1.5.0 (April 9, 2022)
- Added support for Amazon Redshift
See Changelog for full minor version release notes
About Us
RasgoQL is maintained by Rasgo. Rasgo's enterprise feature store integrates with your data warehouse to help users build features faster, collaborate with team members, and serve features to models in production.
Built for Data Scientists, by Data Scientists
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file rasgoql-1.6.4.tar.gz
.
File metadata
- Download URL: rasgoql-1.6.4.tar.gz
- Upload date:
- Size: 32.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.7.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f70421fa7c4fa06c8d8185ad5b0c84ee2aabce3b7bc2bf5f6bc8018bb577405d |
|
MD5 | f738021ffdcf8d05cdeba31376d9c712 |
|
BLAKE2b-256 | dcb761f755adbfdc05ceb4833ba830bc42e39ab6e5e3c7d9487c659cff53ef95 |
File details
Details for the file rasgoql-1.6.4-py3-none-any.whl
.
File metadata
- Download URL: rasgoql-1.6.4-py3-none-any.whl
- Upload date:
- Size: 49.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.7.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8f187b8e0b292eafe21ea7a9d7017c613c0c6ae6702443febda7f2b6a6c667f0 |
|
MD5 | 0e470463268ef5267ba6fef461f50edf |
|
BLAKE2b-256 | 5d256a51a9979b2e13d09cbf030d27cb91e2402d807725139bff78f8affa32cc |