Skip to main content

Common Python tools and utilities for ML work

Project description

data-ml-utils

A utility python package that covers the common libraries we use

Installation

This is an open source library hosted on pypi. Run the following command to install the library.

pip install data-ml-utils --upgrade

Documentation

Head over to https://data-ml-utils.readthedocs.io/en/latest/index.html# to read our library documentation

Feature

Pyathena client initialisation

Almost one liner

import os
from data_ml_utils.pyathena_client.client import PyAthenaClient

os.environ["AWS_ACCESS_KEY_ID"] = "xxx"
os.environ["AWS_SECRET_ACCESS_KEY"] = "xxx" # pragma: allowlist secret
os.environ["S3_BUCKET"] = "xxx"

pyathena_client = PyAthenaClient()

Pyathena client initialisation

Pyathena query

Almost one liner

query = """
    SELECT
        *
    FROM
        dev.example_pyathena_client_table
    LIMIT 10
"""

df_raw = pyathena_client.query_as_pandas(final_query=query)

Pyathena query

MLflow utils

Visit link

More to Come

  • You suggest, raise a feature request issue and we will review!

Tutorials

Pyathena

There is a jupyter notebook to show how to use the package utility package for pyathena: notebook

MLflow utils

There is a jupyter notebook to show how to use the package utility package for mlflow_databricks: notebook

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sm_data_ml_utils-1.0.4.tar.gz (15.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sm_data_ml_utils-1.0.4-py3-none-any.whl (17.7 kB view details)

Uploaded Python 3

File details

Details for the file sm_data_ml_utils-1.0.4.tar.gz.

File metadata

  • Download URL: sm_data_ml_utils-1.0.4.tar.gz
  • Upload date:
  • Size: 15.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.13.5 Darwin/24.5.0

File hashes

Hashes for sm_data_ml_utils-1.0.4.tar.gz
Algorithm Hash digest
SHA256 4bf20013c20f53c2341a61b52a814f320e12729eb35c1d8df35e7a476739d1ba
MD5 6948f92e9b9dee7d63eded541695347b
BLAKE2b-256 86b0cb26fff5dbaffcab7245ede66dd9ab5faa7fe01e271780ad631c53c4c970

See more details on using hashes here.

File details

Details for the file sm_data_ml_utils-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: sm_data_ml_utils-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 17.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.13.5 Darwin/24.5.0

File hashes

Hashes for sm_data_ml_utils-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 720a76a28ee6cba735f1297f2023759a6ba5c640ff4be80b069f8c8274f0f37c
MD5 374d0ad464887d7b66f64bab10af939d
BLAKE2b-256 a022da80e00e92d5448703c4b5fe79bd09ae56788c57f4aff79f902ba6c49cdb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page