Skip to main content

GDMO native classes for standardized interaction with data objects within Azure Databricks

Reason this release was yanked:

bugged

Project description

gdmo

PyPI Tests Changelog License

GDMO native classes for standardized interaction with data objects within Azure Databricks

This custom library allows our engineering team to use standardized packages that strip away a load of administrative and repetitive tasks from their daily object interactions. The current classes supported (V0.1.0) are:

Installation

Install this library using pip:

pip install gdmo

Usage

Forecast - Forecast

Standardized way of forecasting a dataset. Input a dataframe with a Series, a Time, and a Value column, and see the function automatically select the right forecasting model and generate an output.

Example usage:

forecaster = Forecast(spark, 'Invoiced Revenue').set_columns('InvoiceDate', 'ProductCategory', 'RevenueUSD')\
                                               .set_forecast_length(forecast_length)\
                                               .set_last_data_point(lastdatamonth)\
                                               .set_input(df)\
                                               .set_growth_cap(0.02)\
                                               .set_use_cap_growth(True)\
                                               .set_modelselection_breakpoints(12, 24)\
                                               .set_track_outcome(False)\
                                               .build_forecast()

Future expansions

API - APIRequest

Class to perform a standard API Request using the request library, which allows a user to just add their endpoint / authentication / method data, and get the data returned without the need of writing error handling or need to understand how to properly build a request.

Tables - Landing

Class to land a dataframe or csv file to the databricks landing zone, and optionally convert this to the bronze layer data. Just say where to store it, and the class will take care of it with error handling associated and a normalized routine is followed.

Tables - Delta

No longer one needs to write a twelve-command notebook to create a table. Call this class once and see it happen.

Development

To contribute to this library, first checkout the code. Then create a new virtual environment:

cd gdmo
python -m venv venv
source venv/bin/activate

Now install the dependencies and test dependencies:

python -m pip install -e '.[test]'

To run the tests:

python -m pytest

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gdmo-0.0.9.tar.gz (20.3 kB view details)

Uploaded Source

Built Distribution

gdmo-0.0.9-py3-none-any.whl (19.2 kB view details)

Uploaded Python 3

File details

Details for the file gdmo-0.0.9.tar.gz.

File metadata

  • Download URL: gdmo-0.0.9.tar.gz
  • Upload date:
  • Size: 20.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.9

File hashes

Hashes for gdmo-0.0.9.tar.gz
Algorithm Hash digest
SHA256 862e1ea0b60fff8d62f7f94286c845b17288286f07c9632f430f4238a6b61477
MD5 a1b1f7d5b38de3f0bb6897c30e2ced8c
BLAKE2b-256 23b4be5bbafa17747fded79bd50a23b77dd6967d40cbb063046c0911c43658c6

See more details on using hashes here.

File details

Details for the file gdmo-0.0.9-py3-none-any.whl.

File metadata

  • Download URL: gdmo-0.0.9-py3-none-any.whl
  • Upload date:
  • Size: 19.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.9

File hashes

Hashes for gdmo-0.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 872946642829b3d4d6ef61da6402d872a153e74687982dd46150d080e6520fc7
MD5 6e473808706a075408fe2ecf6cbb804f
BLAKE2b-256 dd91a41516727aada986e3c104f51c77dfa1cbce03b3e2eadc3282865e6f2783

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page