Skip to main content

GDMO native classes for standardized interaction with data objects within Azure Databricks

Reason this release was yanked:

bugged

Project description

gdmo

PyPI Tests Changelog License

GDMO native classes for standardized interaction with data objects within Azure Databricks

This custom library allows our engineering team to use standardized packages that strip away a load of administrative and repetitive tasks from their daily object interactions. The current classes supported (V0.1.0) are:

Installation

Install this library using pip:

pip install gdmo

Usage

Forecast - Forecast

Standardized way of forecasting a dataset. Input a dataframe with a Series, a Time, and a Value column, and see the function automatically select the right forecasting model and generate an output.

Example usage:

forecaster = Forecast(spark, 'Invoiced Revenue').set_columns('InvoiceDate', 'ProductCategory', 'RevenueUSD')\
                                               .set_forecast_length(forecast_length)\
                                               .set_last_data_point(lastdatamonth)\
                                               .set_input(df)\
                                               .set_growth_cap(0.02)\
                                               .set_use_cap_growth(True)\
                                               .set_modelselection_breakpoints(12, 24)\
                                               .set_track_outcome(False)\
                                               .build_forecast()

Future expansions

API - APIRequest

Class to perform a standard API Request using the request library, which allows a user to just add their endpoint / authentication / method data, and get the data returned without the need of writing error handling or need to understand how to properly build a request.

Tables - Landing

Class to land a dataframe or csv file to the databricks landing zone, and optionally convert this to the bronze layer data. Just say where to store it, and the class will take care of it with error handling associated and a normalized routine is followed.

Tables - Delta

No longer one needs to write a twelve-command notebook to create a table. Call this class once and see it happen.

Development

To contribute to this library, first checkout the code. Then create a new virtual environment:

cd gdmo
python -m venv venv
source venv/bin/activate

Now install the dependencies and test dependencies:

python -m pip install -e '.[test]'

To run the tests:

python -m pytest

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gdmo-0.0.10.tar.gz (20.3 kB view details)

Uploaded Source

Built Distribution

gdmo-0.0.10-py3-none-any.whl (19.2 kB view details)

Uploaded Python 3

File details

Details for the file gdmo-0.0.10.tar.gz.

File metadata

  • Download URL: gdmo-0.0.10.tar.gz
  • Upload date:
  • Size: 20.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.9

File hashes

Hashes for gdmo-0.0.10.tar.gz
Algorithm Hash digest
SHA256 5b9b5386b6f874f2b99062a542f94d797abd76a689ce1e41edb9fe26d9c57b3b
MD5 20280e892a5da8b688772d087aaf7544
BLAKE2b-256 09445d0617055dbbaccb3b326096dc15335bd613eb7898b90c7876b4b8d9c374

See more details on using hashes here.

File details

Details for the file gdmo-0.0.10-py3-none-any.whl.

File metadata

  • Download URL: gdmo-0.0.10-py3-none-any.whl
  • Upload date:
  • Size: 19.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.9

File hashes

Hashes for gdmo-0.0.10-py3-none-any.whl
Algorithm Hash digest
SHA256 f9c311b1a39629734f6c5e109158e6b6ad1959c94ef1f7bca03bed836f01ccf0
MD5 9f299ce680032f5824bba28a02cceff8
BLAKE2b-256 0879f9ed642a995df7fe42c9b79658c2593551cd7fb1030999544eb392e79911

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page