FMS Acceleration Plugin Framework

These details have been verified by PyPI

Maintainers

ashokponkumar dushyantbehl fabianlim kmehant praveenj

These details have not been verified by PyPI

Project description

FMS Acceleration Framework Library

This contains the library code that implements the acceleration plugin framework, in particular the classes:

AccelerationFramework
AccelerationPlugin

The library is envisioned to:

Provide single integration point into Huggingface.
Manage AccelerationPlugin in a flexible manner.
Load plugins from single configuration YAML, while enforcing compatiblity rules on how plugins can be combined.

See following resources:

Instructions for running acceleration framework with fms-hf-tuning
Sample plugin YAML configurations for important accelerations.

Using AccelerationFramework with HF Trainer

Being by instantiating an AccelerationFramework object, passing a YAML configuration (say via a path_to_config):

from fms_acceleration import AccelerationFramework
framework = AccelerationFramework(path_to_config)

Plugins automatically configured based on configuration; for more details on how plugins are configured, see below.

Some plugins may require custom model loaders (in replacement of the typical AutoModel.from_pretrained). In this case, call framework.model_loader:

model = framework.model_loader(model_name_or_path, ...)

E.g., in the GPTQ example, see sample GPTQ QLoRA configuration, we require model_name_or_path to be custom loaded from a quantized checkpoint.

We provide a flag framework.requires_custom_loading to check if plugins require custom loading.

Also some plugins will require the model to be augmented, e.g., replacing layers with plugin-compliant PEFT adapters. In this case:

# will also take in some other configs that may affect augmentation
# some of these args may be modified due to the augmentation
# e.g., peft_config will be consumed in augmentation, and returned as None 
#       to prevent SFTTrainer from doing extraneous PEFT logic
model, (peft_config,) = framework.augmentation(
    model, 
    train_args, modifiable_args=(peft_config,),
)

We also provide framework.requires_augmentation to check if augumentation is required by the plugins.

Finally pass the model to train:

# e.g. using transformers.Trainer. Pass in model (with training enchancements)
trainer = Trainer(model, ...)

# call train
trainer.train()

Thats all! the model will not be reap all acceleration speedups based on the plugins that were installed!

Configuration of Plugins

Each package in this monorepo:

can be independently installed. Install only the libraries you need:

pip install fms-acceleration/plugins/accelerated-peft
pip install fms-acceleration/plugins/fused-ops-and-kernels

can be independently configured. Each plugin is registed under a particular configuration path. E.g., the autogptq plugin is reqistered under the config path peft.quantization.auto_gptq.

AccelerationPlugin.register_plugin(
    AutoGPTQAccelerationPlugin,
    configuration_and_paths=["peft.quantization.auto_gptq"], 
)

This means that it will be configured under theat exact stanza:

plugins:
    peft:
        quantization:
            auto_gptq:
                # everything under here will be passed to plugin 
                # when instantiating
                ...

When instantiating fms_acceleration.AccelerationFramework, it internally parses through the configuration stanzas. For plugins that are installed, it will instantiate them; for those that are not, it will simply passthrough.
AccelerationFramework will manage plugins transparently for user. User only needs to call AccelerationFramework.model_loader and AccelerationFramework.augmentation.

Adding New Plugins

To add new plugins:

Create an appropriately pip-packaged plugin in plugins; the package needs to be named like fms-acceleration-<postfix> .
For framework to properly load and manage plugin, add the package <postfix> to constants.py:
```
PLUGINS = [
    "peft",
    "foak",
    "<postfix>",
]
```
Create a sample template YAML file inside the <PLUGIN_DIR>/configs to demonstrate how to configure the plugin. As an example, reference the sample config for accelerated peft.

Update generate_sample_configurations.py and run tox -e gen-configs on the top level directory to generate the sample configurations.

KEY_AUTO_GPTQ = "auto_gptq"
KEY_BNB_NF4 = "bnb-nf4"
PLUGIN_A = "<NEW PLUGIN NAME>"

CONFIGURATIONS = {
    KEY_AUTO_GPTQ: "plugins/accelerated-peft/configs/autogptq.yaml",
    KEY_BNB_NF4: (
        "plugins/accelerated-peft/configs/bnb.yaml",
        [("peft.quantization.bitsandbytes.quant_type", "nf4")],
    ),
    PLUGIN_A: (
        "plugins/<plugin>/configs/plugin_config.yaml",
        [
            (<1st field in plugin_config.yaml>, <value>),
            (<2nd field in plugin_config.yaml>, <value>),
        ]
    )
}

# Passing a tuple of configuration keys will combine the templates together
COMBINATIONS = [
    ("accelerated-peft-autogptq", (KEY_AUTO_GPTQ,)),
    ("accelerated-peft-bnb-nf4", (KEY_BNB_NF4,)),    
    (<"combined name with your plugin">), (KEY_AUTO_GPTQ, PLUGIN_A)
    (<"combined name with your plugin">), (KEY_BNB_NF4, PLUGIN_A)
]

After sample configuration is generated by tox -e gen-configs, update CONTENTS.yaml with the shortname and the configuration fullpath.
Update scenarios YAML to configure benchmark test scenarios that will be triggered when running tox -e run-benches on the top level directory.
Update the top-level tox.ini to install the plugin for the run-benches.

Project details

These details have been verified by PyPI

Maintainers

ashokponkumar dushyantbehl fabianlim kmehant praveenj

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.6.4

Oct 9, 2025

0.6.3

Sep 29, 2025

This version

0.6.2

Jun 16, 2025

0.6.1

Feb 12, 2025

0.6.0

Jan 15, 2025

0.5.0

Jan 2, 2025

0.4.0

Sep 16, 2024

0.3

Sep 5, 2024

0.2.0

Aug 2, 2024

0.1.2

Jul 17, 2024

0.1.1

Jul 1, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fms_acceleration-0.6.2-py3-none-any.whl (25.1 kB view details)

Uploaded Jun 16, 2025 Python 3

File details

Details for the file fms_acceleration-0.6.2-py3-none-any.whl.

File metadata

Download URL: fms_acceleration-0.6.2-py3-none-any.whl
Upload date: Jun 16, 2025
Size: 25.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for fms_acceleration-0.6.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8edba41553526ff6e6a803b0c2c997afde632fe2be457e8f7d74487b0e68e229`
MD5	`e41ba2d7de6a6f48f225a7307e3287a3`
BLAKE2b-256	`3029ff9a36156d4b773c5c23bf28c86bddcb8428efcbb166fa3d02313f820c6d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fms_acceleration-0.6.2-py3-none-any.whl:

Publisher: build-and-publish.yml on foundation-model-stack/fms-acceleration

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fms_acceleration-0.6.2-py3-none-any.whl
- Subject digest: 8edba41553526ff6e6a803b0c2c997afde632fe2be457e8f7d74487b0e68e229
- Sigstore transparency entry: 239691443
- Sigstore integration time: Jun 16, 2025
Source repository:
- Permalink: foundation-model-stack/fms-acceleration@d8439eb4809cfacfe03db1186544ae9a2404b9e9
- Branch / Tag: refs/tags/v0.6.2
- Owner: https://github.com/foundation-model-stack
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: build-and-publish.yml@d8439eb4809cfacfe03db1186544ae9a2404b9e9
- Trigger Event: release

fms-acceleration 0.6.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

FMS Acceleration Framework Library

Using AccelerationFramework with HF Trainer

Configuration of Plugins

Adding New Plugins

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

Provenance