Model defuser helper for HF Transformers.
Project description
Defuser
Model defuser helper for HF Transformers >= 5.0. In HF Transformers 5.x releases, many MoE modules became auto-stacked or auto-fused by new modeling code which has benefits but also downsides.
- Goal is to provide naive module/layer forwarding code for all models supported by HF transformers where run-time weight and structure level optimizations such weight merging, stacking, fusing are reversed so the model is operating in a simple naive state.
- There are cases, quantization libraries, where we need to run inference where module input/output needs to be individually captured and this pkg can help complete this task.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
defuser-0.0.2.tar.gz
(6.6 kB
view details)
File details
Details for the file defuser-0.0.2.tar.gz.
File metadata
- Download URL: defuser-0.0.2.tar.gz
- Upload date:
- Size: 6.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
09b6dde641ee5bf5d8507898535c8cf9e3680350b66a618059ac60a527fd73a9
|
|
| MD5 |
41ad10bc819ddcaae1d85d4b64fd516a
|
|
| BLAKE2b-256 |
ffe32e1318bd7f77b16fd476c58e22ffb58ebc8ef3f5fe56dda70c555e7fd549
|