GDMO native classes for standardized interaction with data objects within Azure Databricks
Reason this release was yanked:
bugged
Project description
gdmo
GDMO native classes for standardized interaction with data objects within Azure Databricks
This custom library allows our engineering team to use standardized packages that strip away a load of administrative and repetitive tasks from their daily object interactions. The current classes supported (V0.1.0) are:
Installation
Install this library using pip
:
pip install gdmo
Usage
Forecast - Forecast
Standardized way of forecasting a dataset. Input a dataframe with a Series, a Time, and a Value column, and see the function automatically select the right forecasting model and generate an output.
Example usage:
forecaster = Forecast(spark, 'Invoiced Revenue').set_columns('InvoiceDate', 'ProductCategory', 'RevenueUSD')\
.set_forecast_length(forecast_length)\
.set_last_data_point(lastdatamonth)\
.set_input(df)\
.set_growth_cap(0.02)\
.set_use_cap_growth(True)\
.set_modelselection_breakpoints(12, 24)\
.set_track_outcome(False)\
.build_forecast()
Future expansions
API - APIRequest
Class to perform a standard API Request using the request library, which allows a user to just add their endpoint / authentication / method data, and get the data returned without the need of writing error handling or need to understand how to properly build a request.
Tables - Landing
Class to land a dataframe or csv file to the databricks landing zone, and optionally convert this to the bronze layer data. Just say where to store it, and the class will take care of it with error handling associated and a normalized routine is followed.
Tables - Delta
No longer one needs to write a twelve-command notebook to create a table. Call this class once and see it happen.
Development
To contribute to this library, first checkout the code. Then create a new virtual environment:
cd gdmo
python -m venv venv
source venv/bin/activate
Now install the dependencies and test dependencies:
python -m pip install -e '.[test]'
To run the tests:
python -m pytest
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file gdmo-0.0.10.tar.gz
.
File metadata
- Download URL: gdmo-0.0.10.tar.gz
- Upload date:
- Size: 20.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5b9b5386b6f874f2b99062a542f94d797abd76a689ce1e41edb9fe26d9c57b3b |
|
MD5 | 20280e892a5da8b688772d087aaf7544 |
|
BLAKE2b-256 | 09445d0617055dbbaccb3b326096dc15335bd613eb7898b90c7876b4b8d9c374 |
File details
Details for the file gdmo-0.0.10-py3-none-any.whl
.
File metadata
- Download URL: gdmo-0.0.10-py3-none-any.whl
- Upload date:
- Size: 19.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f9c311b1a39629734f6c5e109158e6b6ad1959c94ef1f7bca03bed836f01ccf0 |
|
MD5 | 9f299ce680032f5824bba28a02cceff8 |
|
BLAKE2b-256 | 0879f9ed642a995df7fe42c9b79658c2593551cd7fb1030999544eb392e79911 |