Femtosense Model Optimization Toolkit
Project description
FemtoFlow
femtoflow
is a Python package that enables pruning and quantization of TensorFlow models for optimized deployment to Femtosense's SPU (Sparse Processing Unit). The package provides a straightforward and flexible interface for reducing model size and complexity while maintaining high-performance inference on the SPU.
Features
- Pruning: Reduce model size by removing unimportant connections and neurons from your TensorFlow model.
- Quantization: Lower the precision of weights and activations to reduce memory requirements and computational costs while preserving model accuracy.
Getting Started
You can install femtoflow
using pip
:
pip install femtoflow
Documentation
Detailed documentation for femtoflow
, including tutorials, API reference, and examples, can be found on the official website: https://femtoflow.femtosense.ai
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
femtoflow-0.1.8-py3-none-any.whl
(76.6 kB
view details)
File details
Details for the file femtoflow-0.1.8-py3-none-any.whl
.
File metadata
- Download URL: femtoflow-0.1.8-py3-none-any.whl
- Upload date:
- Size: 76.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4b095f8b463680429a40b66957647d47f6602e8806e470761e6100fb19684f49 |
|
MD5 | 9a084c94c05573b99b893d63d9fe41af |
|
BLAKE2b-256 | 5d2fe9ee1b92464c835167e55124a0747596571b1bcde77aa88e8239b7b4b19a |