Skip to main content

Common Python tools and utilities for ML work

Project description

data-ml-utils

A utility python package that covers the common libraries we use

Installation

This is an open source library hosted on pypi. Run the following command to install the library.

pip install data-ml-utils --upgrade

Documentation

Head over to https://data-ml-utils.readthedocs.io/en/latest/index.html# to read our library documentation

Feature

Pyathena client initialisation

Almost one liner

import os
from data_ml_utils.pyathena_client.client import PyAthenaClient

os.environ["AWS_ACCESS_KEY_ID"] = "xxx"
os.environ["AWS_SECRET_ACCESS_KEY"] = "xxx" # pragma: allowlist secret
os.environ["S3_BUCKET"] = "xxx"

pyathena_client = PyAthenaClient()

Pyathena client initialisation

Pyathena query

Almost one liner

query = """
    SELECT
        *
    FROM
        dev.example_pyathena_client_table
    LIMIT 10
"""

df_raw = pyathena_client.query_as_pandas(final_query=query)

Pyathena query

MLflow utils

Visit link

More to Come

  • You suggest, raise a feature request issue and we will review!

Tutorials

Pyathena

There is a jupyter notebook to show how to use the package utility package for pyathena: notebook

MLflow utils

There is a jupyter notebook to show how to use the package utility package for mlflow_databricks: notebook

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sm_data_ml_utils-1.0.0.tar.gz (15.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sm_data_ml_utils-1.0.0-py3-none-any.whl (17.7 kB view details)

Uploaded Python 3

File details

Details for the file sm_data_ml_utils-1.0.0.tar.gz.

File metadata

  • Download URL: sm_data_ml_utils-1.0.0.tar.gz
  • Upload date:
  • Size: 15.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.13.5 Darwin/24.5.0

File hashes

Hashes for sm_data_ml_utils-1.0.0.tar.gz
Algorithm Hash digest
SHA256 98524a82af8004ed8ab4d78b6547b3b652dc309e1a41a2955293611a472513e5
MD5 6a4c14ad3c0341a364d7aa4a2bee3107
BLAKE2b-256 1778386f42a98638324cdcd137a785b84687e43c563e754e542f5abf3b84cc13

See more details on using hashes here.

File details

Details for the file sm_data_ml_utils-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: sm_data_ml_utils-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 17.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.13.5 Darwin/24.5.0

File hashes

Hashes for sm_data_ml_utils-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f9b62e6e40dd877d624b8e95de9d3e5887207fab2973baa9e2d33f61cf743833
MD5 824e491786328f25a190db1968f9d0da
BLAKE2b-256 553eea990ed7c5dc8c4d8331067b2672c96eb1f699c0bb737f1b420fa346e44e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page