Skip to main content

Femtosense Model Optimization Toolkit

Project description

Build Status

FemtoFlow

femtoflow is a Python package that enables pruning and quantization of TensorFlow models for optimized deployment to Femtosense's SPU (Sparse Processing Unit). The package provides a straightforward and flexible interface for reducing model size and complexity while maintaining high-performance inference on the SPU.

Features

  • Pruning: Reduce model size by removing unimportant connections and neurons from your TensorFlow model.
  • Quantization: Lower the precision of weights and activations to reduce memory requirements and computational costs while preserving model accuracy.

Getting Started

You can install femtoflow using pip:

pip install femtoflow

Documentation

Detailed documentation for femtoflow, including tutorials, API reference, and examples, can be found on the official website: https://femtoflow.femtosense.ai

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

femtoflow-0.1.8-py3-none-any.whl (76.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page