Skip to main content

Pre-processing logic used for both model-training and model-service.

Project description

lib-ml

Contains the pre-processing logic for data that is used for training or queries. Is automatically versioned and uploaded to PyPi. Uses a tokenizer hosted on Google Drive, but can also load from local or create a new tokenizer and save it.

Usage

Install with

pip install remla24-team8-lib-ml

Use as

from lib_ml import DataProcessor 

data_processor = DataProcessor()

Publish a release

To publish a new release, simply create a new release in GitHub with the appropriate tag name. Be sure that this tag name matches the version in the pyproject.toml. GitHub Actions will then automatically release and publish this. If the release failed, then remove the tag while in the Git repo using git push origin --delete <tag name>, after which you can then re-release the (now draft) release on GitHub.

When using Poetry, you can also depend on the latest commit using the following dependency:

remla24-team8-lib-ml = { git = "https://github.com/remla24-team8/lib-ml.git", branch = "main" }

PyPi organization

For now it is owned by tmtenbrink on PyPI, but hopefully we get an organization so we can put it there.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

remla24_team8_lib_ml-0.1.5.tar.gz (3.0 kB view hashes)

Uploaded Source

Built Distribution

remla24_team8_lib_ml-0.1.5-py3-none-any.whl (3.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page