Skip to main content

GDMO native classes for standardized interaction with data objects within Azure Databricks

Reason this release was yanked:

bugged

Project description

gdmo

PyPI Tests Changelog License

GDMO native classes for standardized interaction with data objects within Azure Databricks

This custom library allows our engineering team to use standardized packages that strip away a load of administrative and repetitive tasks from their daily object interactions. The current classes supported (V0.1.0) are:

Installation

Install this library using pip:

pip install gdmo

Usage

Forecast - Forecast

Standardized way of forecasting a dataset. Input a dataframe with a Series, a Time, and a Value column, and see the function automatically select the right forecasting model and generate an output.

Example usage:

forecaster = Forecast(spark, 'Invoiced Revenue').set_columns('InvoiceDate', 'ProductCategory', 'RevenueUSD')\
                                               .set_forecast_length(forecast_length)\
                                               .set_last_data_point(lastdatamonth)\
                                               .set_input(df)\
                                               .set_growth_cap(0.02)\
                                               .set_use_cap_growth(True)\
                                               .set_modelselection_breakpoints(12, 24)\
                                               .set_track_outcome(False)\
                                               .build_forecast()

Future expansions

API - APIRequest

Class to perform a standard API Request using the request library, which allows a user to just add their endpoint / authentication / method data, and get the data returned without the need of writing error handling or need to understand how to properly build a request.

Tables - Landing

Class to land a dataframe or csv file to the databricks landing zone, and optionally convert this to the bronze layer data. Just say where to store it, and the class will take care of it with error handling associated and a normalized routine is followed.

Tables - Delta

No longer one needs to write a twelve-command notebook to create a table. Call this class once and see it happen.

Development

To contribute to this library, first checkout the code. Then create a new virtual environment:

cd gdmo
python -m venv venv
source venv/bin/activate

Now install the dependencies and test dependencies:

python -m pip install -e '.[test]'

To run the tests:

python -m pytest

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gdmo-0.0.11.tar.gz (20.3 kB view details)

Uploaded Source

Built Distribution

gdmo-0.0.11-py3-none-any.whl (19.2 kB view details)

Uploaded Python 3

File details

Details for the file gdmo-0.0.11.tar.gz.

File metadata

  • Download URL: gdmo-0.0.11.tar.gz
  • Upload date:
  • Size: 20.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.9

File hashes

Hashes for gdmo-0.0.11.tar.gz
Algorithm Hash digest
SHA256 3ab4c77848d446717f1ce0c0c9ed636fca16c28ddca0699f6552914d5b9a7ebc
MD5 81c21c1fa5c04ab955c1bf62a1078ce8
BLAKE2b-256 1b1a1de85ec04f9473fb6c3499f45a75c669a5859c4f3826ee61794736c9350d

See more details on using hashes here.

File details

Details for the file gdmo-0.0.11-py3-none-any.whl.

File metadata

  • Download URL: gdmo-0.0.11-py3-none-any.whl
  • Upload date:
  • Size: 19.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.9

File hashes

Hashes for gdmo-0.0.11-py3-none-any.whl
Algorithm Hash digest
SHA256 023f7fcb2871933092f4cb16e4326caec49858d2c65830021fe68dff422d59dd
MD5 2602b748b75d6e5195e80b0f9fec4633
BLAKE2b-256 1d8a57afb14a34a197fe5897c066cf0f388216ea98dc82e3f7e5ff078cc7bb1b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page