Femtosense Model Optimization Toolkit
Project description
FemtoFlow
femtoflow
is a Python package that enables pruning and quantization of TensorFlow models for optimized deployment to Femtosense's SPU (Sparse Processing Unit). The package provides a straightforward and flexible interface for reducing model size and complexity while maintaining high-performance inference on the SPU.
Features
- Pruning: Reduce model size by removing unimportant connections and neurons from your TensorFlow model.
- Quantization: Lower the precision of weights and activations to reduce memory requirements and computational costs while preserving model accuracy.
Getting Started
You can install femtoflow
using pip
:
pip install femtoflow
Documentation
Detailed documentation for femtoflow
, including tutorials, API reference, and examples, can be found on the official website: https://femtoflow.femtosense.ai
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
femtoflow-0.1.7-py3-none-any.whl
(25.9 kB
view hashes)
Close
Hashes for femtoflow-0.1.7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | cf16038f3566c390c759a48b438aeecdf5852561e7b0ee146fd070e37dcdf39a |
|
MD5 | be735eed3d7bbe8698bf141b03a296a9 |
|
BLAKE2b-256 | 34146b8e1c93161f2f49b449e2905ba7f4656752cbf213688f6bbe50bf148609 |