Skip to main content

Model defuser helper for HF Transformers.

Project description

image

Defuser

Model defuser helper for HF Transformers >= 5.0. In HF Transformers 5.x releases, many MoE modules became auto-stacked or auto-fused by new modeling code which has benefits but also downsides.

  • Goal is to provide naive module/layer forwarding code for all models supported by HF transformers where run-time weight and structure level optimizations such weight merging, stacking, fusing are reversed so the model is operating in a simple naive state.
  • There are cases, quantization libraries, where we need to run inference where module input/output needs to be individually captured and this pkg can help complete this task.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

defuser-0.0.2.tar.gz (6.6 kB view details)

Uploaded Source

File details

Details for the file defuser-0.0.2.tar.gz.

File metadata

  • Download URL: defuser-0.0.2.tar.gz
  • Upload date:
  • Size: 6.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for defuser-0.0.2.tar.gz
Algorithm Hash digest
SHA256 09b6dde641ee5bf5d8507898535c8cf9e3680350b66a618059ac60a527fd73a9
MD5 41ad10bc819ddcaae1d85d4b64fd516a
BLAKE2b-256 ffe32e1318bd7f77b16fd476c58e22ffb58ebc8ef3f5fe56dda70c555e7fd549

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page