Swift: Scalable lightWeight Infrastructure for Fine-Tuning

These details have not been verified by PyPI

Project links

Homepage

Project description

SWIFT(Scalable lightWeight Infrastructure for Fine-Tuning)

Introduction

SWIFT (Scalable lightWeight Infrastructure for Fine-Tuning) is an extensible framwork designed to faciliate lightweight model fine-tuning and inference. It integrates implementations for various efficient fine-tuning methods, by embracing approaches that is parameter-efficient, memory-efficient, and time-efficient. SWIFT integrates seamlessly into ModelScope ecosystem and offers the capabilities to finetune various models, with a primary emphasis on LLMs and vision models. Additionally, SWIFT is fully compatible with PEFT, enabling users to leverage the familiar Peft interface to finetune ModelScope models.

Currently supported approches (and counting):

LoRA: LORA: LOW-RANK ADAPTATION OF LARGE LANGUAGE MODELS
Adapter: Parameter-Efficient Transfer Learning for NLP
Prompt Tuning: Visual Prompt Tuning
Side: Side-Tuning: A Baseline for Network Adaptation via Additive Side Networks
ResTuning-Bypass
All tuners offered on PEFT

Key features:

By integrating the ModelScope library, models can be readily obatined via a model-id.
Tuners provided by SWIFT can be combined together to allow exploration of multiple tuners on a model for best result.
Support calling activate_adapter or deactivate_adapter or set_active_adapters to activate/deactivate tuners. User can inference with one model and multiple tuners in different threads independently.

Users can check the documentation of Swift to get detail tutorials.

LLM SFT Example

code link

supported SFT methods: LoRA, QLoRA, full(full parameter fine-tuning)
supported models:
1. qwen series: qwen-7b, qwen-7b-chat
2. qwen-vl series: qwen-vl, qwen-vl-chat
3. baichuan series: baichuan-7b, baichuan-13b, baichuan-13b-chat, baichuan2-7b, baichuan2-7b-chat, baichuan2-13b, baichuan2-13b-chat
4. chatglm2 series: chatglm2-6b, chatglm2-6b-32k
5. llama series: llama2-7b, llama2-7b-chat, llama2-13b, llama2-13b-chat, llama2-70b, llama2-70b-chat
6. openbuddy-llama series: openbuddy-llama2-13b, openbuddy-llama-65b, openbuddy-llama2-70b
7. internlm series: internlm-7b, internlm-7b-chat, internlm-7b-chat-8k, internlm-20b, internlm-20b-chat
8. other: polylm-13b, seqgpt-560m
supported features: quantization, DDP, model parallelism(device map), gradient checkpointing, gradient accumulation, pushing to modelscope hub, custom datasets, multimodal and agent SFT, mutli-round chat, ...
supported datasets:
1. NLP: alpaca-en(gpt4), alpaca-zh(gpt4), finance-en, multi-alpaca-all, code-en, instinwild-en, instinwild-zh, cot-en, cot-zh, firefly-all-zh, poetry-zh, instruct-en, gpt4all-en, cmnli-zh, jd-zh, dureader-robust-zh, medical-en, medical-zh, medical-mini-zh, sharegpt-en, sharegpt-zh, code-python-zh, advertise-gen
2. agent: damo-agent-zh, damo-agent-mini-zh
3. multi-modal: coco-en
4. other: cls-fudan-news-zh, ner-jave-zh
supported templates: chatml(qwen), baichuan, chatglm2, llama, openbuddy-llama, default, default-generation

Installation

SWIFT is running in Python environment. Please make sure your python version is higher than 3.8.

Install SWIFT by the pip command:

pip install ms-swift -U

Install SWIFT by source code(for running sft/infer examples), please run:

git clone https://github.com/modelscope/swift.git
cd swift
pip install -e .

SWIFT requires torch>=1.13.

Use SWIFT in our docker image:

docker pull registry.cn-hangzhou.aliyuncs.com/modelscope-repo/modelscope:ubuntu20.04-cuda11.8.0-py38-torch2.0.1-tf2.13.0-1.9.1

Getting Started

SWIFT supports multiple tuners, as well as tuners provided by PEFT. To use these tuners, simply call:

from swift import Swift, LoRAConfig
config = LoRAConfig(...)
model = Swift.prepare_model(model, config, extra_state_keys=['...'])

The code snippet above initialized the tuner randomly. The input model is an instance of torch.nn.Module, the config is a subclass instance of SwiftConfig or PeftConfig. extra_state_keys is the extra module weights(like the linear head) to be trained and stored in the output dir.

You may combine multiple tuners by:

from swift import Swift, LoRAConfig, PromptConfig
model = Swift.prepare_model(model, {'lora': LoRAConfig(...), 'prompt': PromptConfig(...)})

Call save_pretrained and push_to_hub after finetuning:

from swift import push_to_hub
model.save_pretrained('some-output-folder')
push_to_hub('my-group/some-repo-id-modelscope', 'some-output-folder', token='some-ms-token')

Assume my-group/some-repo-id-modelscope is the model-id in the hub, and some-ms-token is the token for uploading.

Using the model-id to do later inference:

from swift import Swift
model = Swift.from_pretrained(model, 'my-group/some-repo-id-modelscope')

Here shows a runnable example:

import os
import tempfile

# Please install modelscope by `pip install modelscope`
from modelscope import Model

from swift import LoRAConfig, SwiftModel, Swift, push_to_hub

tmp_dir = tempfile.TemporaryDirectory().name
if not os.path.exists(tmp_dir):
    os.makedirs(tmp_dir)


model = Model.from_pretrained('modelscope/Llama-2-7b-ms', device_map='auto')
lora_config = LoRAConfig(target_modules=['q_proj', 'k_proj', 'v_proj'])
model: SwiftModel = Swift.prepare_model(model, lora_config)
# Do some finetuning here
model.save_pretrained(tmp_dir)

push_to_hub('my-group/swift_llama2', output_dir=tmp_dir)
model = Model.from_pretrained('modelscope/Llama-2-7b-ms', device_map='auto')
model = SwiftModel.from_pretrained(model, 'my-group/swift_llama2', device_map='auto')

This is a example that uses transformers for model creation uses SWIFT for efficient tuning.

from swift import Swift, LoRAConfig, AdapterConfig, PromptConfig
from transformers import AutoModelForImageClassification

# init vit model
model = AutoModelForImageClassification.from_pretrained("google/vit-base-patch16-224")

# init lora tuner config
lora_config = LoRAConfig(
    r=10,  # the rank of the LoRA module
    target_modules=['query', 'key', 'value'],  # the modules to be replaced with the end of the module name
    merge_weights=False  # whether to merge weights
)

# init adapter tuner config
adapter_config = AdapterConfig(
    dim=768,  # the dimension of the hidden states
    hidden_pos=0,  # the position of the hidden state to passed into the adapter
    target_modules=r'.*attention.output.dense$',  # the modules to be replaced with regular expression
    adapter_length=10  # the length of the adapter length
)

# init prompt tuner config
prompt_config = PromptConfig(
    dim=768,  # the dimension of the hidden states
    target_modules=r'.*layer\.\d+$',  # the modules to be replaced with regular expression
    embedding_pos=0,    # the position of the embedding tensor
    prompt_length=10,   # the length of the prompt tokens
    attach_front=False  # Whether prompt is attached in front of the embedding
)

# create model with swift. In practice, you can use any of these tuners or a combination of them.
model = Swift.prepare_model(model, {"lora_tuner": lora_config, "adapter_tuner": adapter_config, "prompt_tuner": prompt_config})

# get the trainable parameters of model
model.get_trainable_parameters()
# 'trainable params: 838,776 || all params: 87,406,432 || trainable%: 0.9596273189597764'

You can use the features offered by Peft in SWIFT:

from swift import LoraConfig, Swift
from peft import TaskType
lora_config = LoraConfig(target_modules=['query', 'key', 'value'], task_type=TaskType.CAUSAL_LM)
model_wrapped = Swift.prepare_model(model, lora_config)

# or call from_pretrained to load weights in the modelhub
model_wrapped = Swift.from_pretrained(model, 'some-id-in-the-modelscope-modelhub')

The saving strategy between Swift tuners and Peft tuners are slightly different. You can name a tuner by:

model = Swift.prepare_model(model, {'default': LoRAConfig(...)})
model.save_pretrained('./output')

In the output dir, you will have a dir structure like this:

output
    |-- default
        |-- adapter_config.json
        |-- adapter_model.bin
    |-- adapter_config.json
    |-- adapter_model.bin

The config/weights stored in the output dir is the config of extra_state_keys and the weights of it. This is different from PEFT, which stores the weights and config of the default tuner.

Learn More

ModelScope library

ModelScope Library is the model library of ModelScope project, which contains a large number of popular models.
Contribute your own model to ModelScope

License

This project is licensed under the Apache License (Version 2.0).

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

4.0.1

Mar 8, 2026

4.0.0

Mar 3, 2026

3.12.6

Feb 28, 2026

3.12.5

Feb 14, 2026

3.12.4

Feb 3, 2026

3.12.3

Jan 24, 2026

3.12.2

Jan 17, 2026

3.12.1

Jan 8, 2026

3.12.0

Dec 30, 2025

3.11.3

Dec 28, 2025

3.11.2

Dec 21, 2025

3.11.1

Dec 15, 2025

3.11.0

Dec 9, 2025

3.10.3

Nov 30, 2025

3.10.2

Nov 23, 2025

3.10.1

Nov 16, 2025

3.10.0

Nov 11, 2025

3.9.3

Nov 4, 2025

3.9.2

Oct 26, 2025

3.9.1

Oct 19, 2025

3.9.0

Oct 13, 2025

3.8.3

Oct 1, 2025

3.8.2

Sep 23, 2025

3.8.1

Sep 15, 2025

3.8.0

Sep 9, 2025

3.7.3

Aug 30, 2025

3.7.2

Aug 21, 2025

3.7.1

Aug 16, 2025

3.7.0

Aug 7, 2025

3.6.4

Aug 2, 2025

3.6.3

Jul 29, 2025

3.6.2

Jul 18, 2025

3.6.1

Jul 11, 2025

3.6.0

Jul 8, 2025

3.5.3

Jun 27, 2025

3.5.2

Jun 20, 2025

3.5.1

Jun 13, 2025

3.5.0

Jun 8, 2025

3.4.1.post1

May 18, 2025

3.4.1

May 13, 2025

3.4.0

Apr 30, 2025

3.3.1

Apr 26, 2025

3.3.0.post1

Apr 12, 2025

3.3.0

Apr 11, 2025

3.2.2

Mar 25, 2025

3.2.1

Mar 14, 2025

3.2.0.post2

Mar 6, 2025

3.2.0

Mar 4, 2025

3.1.1.post1

Feb 24, 2025

3.1.1

Feb 20, 2025

3.1.0

Feb 7, 2025

3.0.3

Jan 22, 2025

3.0.2.post1

Jan 10, 2025

3.0.2

Jan 7, 2025

3.0.1.post1

Dec 29, 2024

3.0.1

Dec 27, 2024

3.0.0

Dec 20, 2024

2.6.1

Nov 29, 2024

2.6.0.post2

Nov 23, 2024

2.6.0.post1

Nov 19, 2024

2.6.0

Nov 13, 2024

2.5.2.post1

Nov 8, 2024

2.5.2

Nov 2, 2024

2.5.1.post1

Oct 26, 2024

2.5.1

Oct 21, 2024

2.5.0.post1

Oct 14, 2024

2.4.2.post2

Sep 24, 2024

2.4.2.post1

Sep 23, 2024

2.4.2

Sep 18, 2024

2.4.1

Sep 9, 2024

2.4.0.post1

Sep 2, 2024

2.4.0

Sep 2, 2024

2.3.2.post1

Aug 28, 2024

2.3.2

Aug 24, 2024

2.3.1

Aug 19, 2024

2.3.0.post1

Aug 11, 2024

2.3.0

Aug 9, 2024

2.2.5

Aug 2, 2024

2.2.4

Jul 26, 2024

2.2.3

Jul 20, 2024

2.2.2

Jul 13, 2024

2.2.1

Jul 8, 2024

2.2.0

Jul 5, 2024

2.1.1.post2

Jun 18, 2024

2.1.1.post1

Jun 17, 2024

2.1.1

Jun 17, 2024

2.1.0

Jun 7, 2024

2.0.5.post1

May 28, 2024

2.0.5

May 22, 2024

2.0.4

May 1, 2024

2.0.3.post1

Apr 24, 2024

2.0.3

Apr 23, 2024

2.0.2

Apr 17, 2024

2.0.1

Apr 17, 2024

2.0.0

Apr 15, 2024

1.7.3

Mar 17, 2024

1.7.2

Mar 11, 2024

1.7.1

Mar 10, 2024

1.7.0

Mar 9, 2024

1.6.3

Feb 29, 2024

1.6.2

Feb 28, 2024

1.6.1

Feb 21, 2024

1.6.0

Feb 7, 2024

1.5.4

Jan 31, 2024

1.5.3

Jan 22, 2024

1.5.2

Jan 9, 2024

1.5.1

Jan 7, 2024

1.5.0

Jan 1, 2024

1.4.0

Dec 7, 2023

1.3.0

Nov 7, 2023

1.2.1

Oct 18, 2023

1.2.0

Oct 10, 2023

This version

1.1.0

Sep 22, 2023

1.0.0

Aug 2, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ms-swift-1.1.0.tar.gz (63.6 kB view details)

Uploaded Sep 22, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ms_swift-1.1.0-py3-none-any.whl (82.2 kB view details)

Uploaded Sep 22, 2023 Python 3

File details

Details for the file ms-swift-1.1.0.tar.gz.

File metadata

Download URL: ms-swift-1.1.0.tar.gz
Upload date: Sep 22, 2023
Size: 63.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.7.17

File hashes

Hashes for ms-swift-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`478aa617b839f8a079b07074e8ece3c16a059143c08b0199d370e77b79c8c759`
MD5	`e03ea9557aecdb2b15f8476c7d6279d4`
BLAKE2b-256	`8ffc104884c8c096e603937364751baeaffe4d4a327d141bb88fa4a57ab41c70`

See more details on using hashes here.

File details

Details for the file ms_swift-1.1.0-py3-none-any.whl.

File metadata

Download URL: ms_swift-1.1.0-py3-none-any.whl
Upload date: Sep 22, 2023
Size: 82.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.7.17

File hashes

Hashes for ms_swift-1.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ed108a66fc25e05a4fd8fd71669140ec16a33d27d953e0b62ee91fc82c0fe20c`
MD5	`a60ebc375579d20118a9441f3c9111e7`
BLAKE2b-256	`4eea625a61ec5d3006b6f1d6c4ecdbc1cb6c28d16c4e1234edaa46d7d63406f8`

See more details on using hashes here.

ms-swift 1.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

SWIFT(Scalable lightWeight Infrastructure for Fine-Tuning)

Introduction

LLM SFT Example

Installation

Getting Started

Learn More

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes