Skip to main content

Recommends config to use for model tuning.

Project description

Tuning Config Recommender

⚡️ Supercharge your fine-tuning users with completely automated tuning configurations for their choice of model, and dataset enabling instant one-click tuning deployment 🚀.

Installation

pip install -e .

or from PyPI

pip install tuning_config_recommender

Library Usage

An example is given in lib_usage.py file which generates tuning configuration for fms-hf-tuning stack given model ibm-granite/granite-4.0-h-350m and dataset with HF ID ought/raft.

python lib_usage.py

CLI usage

python src/recommender/cli.py --tuning-data-config ./artifacts/test/data_config.yaml --accelerate-config ./artifacts/test/accelerate_config.yaml --tuning-config ./artifacts/test/tuning_config.yaml --compute-config ./artifacts/test/compute_config.yaml --output-dir ./output

Custom rules-dir usage

python src/recommender/cli.py --tuning-data-config ./artifacts/test/data_config.yaml --accelerate-config ./artifacts/test/accelerate_config.yaml --tuning-config ./artifacts/test/tuning_config.yaml --compute-config ./artifacts/test/compute_config.yaml --output-dir ./output --rules-dir custom_rules_dir

Writing custom action rules for custom modification would require following the below

  1. Should start with name "Custom_"
  2. Should subclass from Action class

An example can be found at custom_rules_dir.

API Usage

After installing it as a module you can start an API as

uvicorn tuning_config_recommender.api:app --reload

/docs endpoint provides details on the endpoint to make requests.

Architecture

Concepts

Intermediate Representation (IR)

IR is the standard format that all input formats have to be converted to so that rule engine can work on the IR without worrying about the exact input or output formats which could differ from one stack to another. There is a caveat with IR, not all arguments in the IR are strictly defined in the sense that the actions currently written may add some new arguments and IR or rule engine are restrictive about it. Its up to do the adapter if it wishes to consume it or not for the target format.

Action

An action takes IR as input at its current state and performs some heuristics and constructs a new IR object which is used as a JSON Merge patch by the rule-engine. Addtionally, the returned new IR object can also hold various information about the patch such as severity, type and natural language comments. As shown in the architecture, an action would be called multiple times by the rule engine until it explicitly calls out skip. When to skip is the responsibility of the action which could be a heuristic based on the state of the IR when its called. Some example actions can be seen here.

Rule Engine

Rule engine passes the IR across actions in the sequence they are defined and collects all JSON merge patches. These JSON merge patches are then applied over the IR. This process is again iterated until all actions call out for a skip. Finally, JSON patches (is different from the merge patch) with respect to the orginal IR provided to the rule engine is prepared while preserving all the metadata (comments etc) for each of the patch along with the final IR to adapters.

Adapter

Adapter converts source format to required IR format and consumes final IR and json patches as needed to deliver the target format. Adapters can be found here.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tuning_config_recommender-0.1.2-py3-none-any.whl (73.5 kB view details)

Uploaded Python 3

File details

Details for the file tuning_config_recommender-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for tuning_config_recommender-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 26d5f533b48a29561e4ae4a90d680ae71143f031e7d1e93637c1af9d8c1aea97
MD5 9ce27eaeabf814fa027988ac952e0969
BLAKE2b-256 ce7fb9b9f915e596b5ad6857c859ec706bfb2bde15891b16582f031cfe3f43fc

See more details on using hashes here.

Provenance

The following attestation bundles were made for tuning_config_recommender-0.1.2-py3-none-any.whl:

Publisher: publish.yml on foundation-model-stack/tuning-config-recommender

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page