encoderfile

Python bindings for encoderfile.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mzai

These details have not been verified by PyPI

Project description

Project logo

🚀 Overview

Encoderfile packages transformer encoders—optionally with classification heads—into a single, self-contained executable. No Python runtime, no dependencies, no network calls. Just a fast, portable binary that runs anywhere.

While Llamafile focuses on generative models, Encoderfile is purpose-built for encoder architectures with optional classification heads. It supports embedding, sequence classification, and token classification models—covering most encoder-based NLP tasks, from text similarity to classification and tagging—all within one compact binary.

Under the hood, Encoderfile uses ONNX Runtime for inference, ensuring compatibility with a wide range of transformer architectures.

Why?

Smaller footprint: a single binary measured in tens-to-hundreds of megabytes, not gigabytes of runtime and packages
Compliance-friendly: deterministic, offline, security-boundary-safe
Integration-ready: drop into existing systems as a CLI, microservice, or API without refactoring your stack

Encoderfiles can run as:

REST API
gRPC microservice
CLI for batch processing
MCP server (Model Context Protocol)

Architecture Diagram

Supported Architectures

Encoderfile supports the following Hugging Face model classes (and their ONNX-exported equivalents):

Task	Supported classes	Example models
Embeddings / Feature Extraction	`AutoModel`, `AutoModelForMaskedLM`	`bert-base-uncased`, `distilbert-base-uncased`
Sequence Classification	`AutoModelForSequenceClassification`	`distilbert-base-uncased-finetuned-sst-2-english`, `roberta-large-mnli`
Token Classification	`AutoModelForTokenClassification`	`dslim/bert-base-NER`, `bert-base-cased-finetuned-conll03-english`

✅ All architectures must be encoder-only transformers — no decoders, no encoder–decoder hybrids (so no T5, no BART).
⚙️ Models must have ONNX-exported weights (path/to/your/model/model.onnx).
🧠 The ONNX graph input must include input_ids and optionally attention_mask.
🚫 Models relying on generation heads (AutoModelForSeq2SeqLM, AutoModelForCausalLM, etc.) are not supported.
XLNet, Transformer XL, and derivative architectures are not yet supported.

📦 Installation

Option 1: Download Pre-built CLI Tool (Recommended)

Download the encoderfile CLI tool to build your own model binaries:

curl -fsSL https://raw.githubusercontent.com/mozilla-ai/encoderfile/main/install.sh | sh

Note for Windows users: Pre-built binaries are not available for Windows. Please see our guide on building from source for instructions on building from source.

Move the binary to a location in your PATH:

# Linux/macOS
sudo mv encoderfile /usr/local/bin/

# Or add to your user bin
mkdir -p ~/.local/bin
mv encoderfile ~/.local/bin/

Option 2: Build CLI Tool from Source

See our guide on building from source for detailed instructions on building the CLI tool from source.

Quick build:

cargo build --bin encoderfile --release
./target/release/encoderfile --help

🚀 Quick Start

Step 1: Prepare Your Model

First, you need an ONNX-exported model. Export any HuggingFace model:

Requires Python 3.13+ for ONNX export

# Install optimum for ONNX export
pip install optimum[onnx]

# Export a sentiment analysis model
optimum-cli export onnx \
  --model distilbert-base-uncased-finetuned-sst-2-english \
  --task text-classification \
  ./sentiment-model

Step 2: Create Configuration File

Create sentiment-config.yml:

encoderfile:
  name: sentiment-analyzer
  path: ./sentiment-model
  model_type: sequence_classification
  output_path: ./build/sentiment-analyzer.encoderfile

Step 3: Build Your Encoderfile

Use the downloaded encoderfile CLI tool:

encoderfile build -f sentiment-config.yml

This creates a self-contained binary at ./build/sentiment-analyzer.encoderfile.

Step 4: Run Your Model

Start the server:

./build/sentiment-analyzer.encoderfile serve

The server will start on http://localhost:8080 by default.

Making Predictions

Sentiment Analysis:

curl -X POST http://localhost:8080/predict \
  -H "Content-Type: application/json" \
  -d '{
    "inputs": [
      "This is the cutest cat ever!",
      "Boring video, waste of time",
      "These cats are so funny!"
    ]
  }'

Response:

{
  "results": [
    {
      "logits": [0.00021549065, 0.9997845],
      "scores": [0.00021549074, 0.9997845],
      "predicted_index": 1,
      "predicted_label": "POSITIVE"
    },
    {
      "logits": [0.9998148, 0.00018516644],
      "scores": [0.9998148, 0.0001851664],
      "predicted_index": 0,
      "predicted_label": "NEGATIVE"
    },
    {
      "logits": [0.00014975034, 0.9998503],
      "scores": [0.00014975043, 0.9998503],
      "predicted_index": 1,
      "predicted_label": "POSITIVE"
    }
  ],
  "model_id": "sentiment-analyzer"
}

Embeddings:

curl -X POST http://localhost:8080/predict \
  -H "Content-Type: application/json" \
  -d '{
    "inputs": ["Hello world"],
    "normalize": true
  }'

Token Classification (NER):

curl -X POST http://localhost:8080/predict \
  -H "Content-Type: application/json" \
  -d '{
    "inputs": ["Apple Inc. is located in Cupertino, California"]
  }'

🎯 Usage Modes

Mode	Command	Default
REST API	`./my-model.encoderfile serve`	`http://localhost:8080`
gRPC	`./my-model.encoderfile serve`	`localhost:50051`
CLI	`./my-model.encoderfile infer "text"`	stdout
MCP Server	`./my-model.encoderfile mcp`	—

Both HTTP and gRPC servers start by default. Use --disable-grpc or --disable-http to run only one.

See the CLI Reference for all server options, port configuration, and output formats.

📚 Documentation

Getting Started Guide - Step-by-step tutorial
Building Guide - Build encoderfiles from ONNX models
CLI Reference - Complete command-line documentation
API Reference - REST, gRPC, and MCP API docs

🛠️ Building Custom Encoderfiles

Once you have the encoderfile CLI tool installed, you can build binaries from any compatible HuggingFace model.

See our guide on building from source for detailed instructions including:

How to export models to ONNX format
Configuration file options
Advanced features (Lua transforms, custom paths, etc.)
Troubleshooting tips

Quick workflow:

Export your model to ONNX: optimum-cli export onnx ...
Create a config file: config.yml
Build the binary: encoderfile build -f config.yml
Deploy anywhere: ./build/my-model.encoderfile serve

See our guide on building from source for detailed instructions.

🤝 Contributing

We welcome contributions! See CONTRIBUTING.md for guidelines.

Development Setup

Make sure you have Just installed.

# Clone the repository
git clone https://github.com/mozilla-ai/encoderfile.git
cd encoderfile

# Set up development environment
just setup

# Run tests
just test

# Build documentation 
just docs

📄 License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

🙏 Acknowledgments

Built with ONNX Runtime
Inspired by Llamafile
Powered by the Hugging Face model ecosystem

💬 Community

Discord - Join our community
GitHub Issues - Report bugs or request features
GitHub Discussions - Ask questions and share ideas

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mzai

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.6.2

Apr 13, 2026

0.6.2rc2 pre-release

Apr 13, 2026

0.6.1

Mar 27, 2026

This version

0.6.1rc1 pre-release

Mar 27, 2026

0.6.0

Mar 25, 2026

0.6.0rc1 pre-release

Mar 24, 2026

0.6.0b5 pre-release

Mar 23, 2026

0.6.0b4 pre-release

Mar 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

encoderfile-0.6.1rc1-cp313-abi3-manylinux_2_38_x86_64.whl (14.1 MB view details)

Uploaded Mar 27, 2026 CPython 3.13+manylinux: glibc 2.38+ x86-64

encoderfile-0.6.1rc1-cp313-abi3-manylinux_2_38_aarch64.whl (13.5 MB view details)

Uploaded Mar 27, 2026 CPython 3.13+manylinux: glibc 2.38+ ARM64

encoderfile-0.6.1rc1-cp313-abi3-macosx_11_0_arm64.whl (11.6 MB view details)

Uploaded Mar 27, 2026 CPython 3.13+macOS 11.0+ ARM64

encoderfile-0.6.1rc1-cp313-abi3-macosx_10_12_x86_64.whl (13.0 MB view details)

Uploaded Mar 27, 2026 CPython 3.13+macOS 10.12+ x86-64

File details

Details for the file encoderfile-0.6.1rc1-cp313-abi3-manylinux_2_38_x86_64.whl.

File metadata

Download URL: encoderfile-0.6.1rc1-cp313-abi3-manylinux_2_38_x86_64.whl
Upload date: Mar 27, 2026
Size: 14.1 MB
Tags: CPython 3.13+, manylinux: glibc 2.38+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.2 {"installer":{"name":"uv","version":"0.11.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for encoderfile-0.6.1rc1-cp313-abi3-manylinux_2_38_x86_64.whl
Algorithm	Hash digest
SHA256	`e0df3f0cd6996a758afad98d6828cefb96a92d3fb9d22a72fbf2361a7cff1597`
MD5	`b34e8e78290b666a428d561eba028a2e`
BLAKE2b-256	`2155781e7e4d0ddb12b8e82b15031c193a5325c84ee17d2db5687b8ee1455b90`

See more details on using hashes here.

File details

Details for the file encoderfile-0.6.1rc1-cp313-abi3-manylinux_2_38_aarch64.whl.

File metadata

Download URL: encoderfile-0.6.1rc1-cp313-abi3-manylinux_2_38_aarch64.whl
Upload date: Mar 27, 2026
Size: 13.5 MB
Tags: CPython 3.13+, manylinux: glibc 2.38+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.2 {"installer":{"name":"uv","version":"0.11.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for encoderfile-0.6.1rc1-cp313-abi3-manylinux_2_38_aarch64.whl
Algorithm	Hash digest
SHA256	`97aafd2a62055352359901820ac9f463866088b10e4da67d6601a096fe49af86`
MD5	`3aac90eb405f43edf8b592f257442eb7`
BLAKE2b-256	`66205712ba76f41a8763c664b00963eaa08dcae23e4a26821db63cd5a2a20745`

See more details on using hashes here.

File details

Details for the file encoderfile-0.6.1rc1-cp313-abi3-macosx_11_0_arm64.whl.

File metadata

Download URL: encoderfile-0.6.1rc1-cp313-abi3-macosx_11_0_arm64.whl
Upload date: Mar 27, 2026
Size: 11.6 MB
Tags: CPython 3.13+, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.2 {"installer":{"name":"uv","version":"0.11.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for encoderfile-0.6.1rc1-cp313-abi3-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`bb48546c5d09d837f83217ed3c825e496defeef47e2b91c5f9fb1d09a73f2554`
MD5	`3385590c4be19f615ca406a51da0c669`
BLAKE2b-256	`255a1ab6da937bcf2a40a161dd614063f2ecd4669f2dcbf3559fe8b74ea7af1b`

See more details on using hashes here.

File details

Details for the file encoderfile-0.6.1rc1-cp313-abi3-macosx_10_12_x86_64.whl.

File metadata

Download URL: encoderfile-0.6.1rc1-cp313-abi3-macosx_10_12_x86_64.whl
Upload date: Mar 27, 2026
Size: 13.0 MB
Tags: CPython 3.13+, macOS 10.12+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.2 {"installer":{"name":"uv","version":"0.11.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for encoderfile-0.6.1rc1-cp313-abi3-macosx_10_12_x86_64.whl
Algorithm	Hash digest
SHA256	`5823ac85c5939bd8d56381ecd0eec84ed78114dca9ed29ca9ac0a4ffade6b944`
MD5	`1910dc2dab02d8718ad0ea383b02a5f7`
BLAKE2b-256	`2b663928763d35dac7b0e3a28dcb32c13536bbe3a20e34c7a907d27ff17b70a1`

See more details on using hashes here.

encoderfile 0.6.1rc1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

🚀 Overview

Supported Architectures

📦 Installation

Option 1: Download Pre-built CLI Tool (Recommended)

Option 2: Build CLI Tool from Source

🚀 Quick Start

Step 1: Prepare Your Model

Step 2: Create Configuration File

Step 3: Build Your Encoderfile

Step 4: Run Your Model

Making Predictions

🎯 Usage Modes

📚 Documentation

🛠️ Building Custom Encoderfiles

🤝 Contributing

Development Setup

📄 License

🙏 Acknowledgments

💬 Community

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes