Skip to main content

Octomil — serve, deploy, and observe ML models on edge devices

Project description

Octomil
Run ML models locally. Train them together.

CI PyPI MIT


Install

pip install octomil-sdk

Quick Start

Serve a model locally with an OpenAI-compatible API:

octomil serve gemma-1b
curl http://localhost:8080/v1/chat/completions \
  -d '{"model": "gemma-1b", "messages": [{"role": "user", "content": "Hello"}]}'

SDK Usage

from octomil import Octomil

client = Octomil(api_key="oct_...", org_id="org_123")

# Register a model
model = client.registry.ensure_model(name="sentiment", framework="pytorch")

# Gradual rollout
client.rollouts.create(model_id=model["id"], version="1.0.0", rollout_percentage=10)

# A/B test
client.experiments.create(name="v1-vs-v2", model_id=model["id"],
                          control_version="1.0.0", treatment_version="1.1.0")

Federated Learning

Train across devices without centralizing data:

from octomil import DeviceAuthClient, FederatedClient

auth = DeviceAuthClient(base_url="https://api.octomil.com", org_id="org_123",
                        device_identifier="runtime-001")
await auth.bootstrap(bootstrap_bearer_token=token)

client = FederatedClient(auth_token_provider=lambda: auth.get_access_token_sync(),
                         org_id="org_123")
client.register()

assignment = client.get_round_assignment()
if assignment:
    client.participate_in_round(round_id=assignment["round_id"],
                                local_train_fn=my_train_fn)

CLI

Command
octomil serve <model> Local inference server (OpenAI-compatible)
octomil pull <model> Download a model
octomil push <file> Upload a model
octomil deploy <model> Deploy to devices
octomil convert <file> Convert to CoreML / TFLite
octomil check <file> Validate a model
octomil scan <path> Security scan
octomil benchmark <model> Latency benchmarks
octomil login Authenticate

Documentation

docs.octomil.com/sdks/python

Contributing

See CONTRIBUTING.md.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

octomil_sdk-2.0.3.tar.gz (386.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

octomil_sdk-2.0.3-py3-none-any.whl (249.2 kB view details)

Uploaded Python 3

File details

Details for the file octomil_sdk-2.0.3.tar.gz.

File metadata

  • Download URL: octomil_sdk-2.0.3.tar.gz
  • Upload date:
  • Size: 386.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for octomil_sdk-2.0.3.tar.gz
Algorithm Hash digest
SHA256 e4b8f852098f76bb5dce6b4480547b5b23f540c48dc1759e7d0bce5a4dc459c0
MD5 ccb2c9db43af5d94fcefd0a3478286ef
BLAKE2b-256 b1301886889313b307f15ef164a1e41f8e435d0bd43251c32a4c886be7709017

See more details on using hashes here.

File details

Details for the file octomil_sdk-2.0.3-py3-none-any.whl.

File metadata

  • Download URL: octomil_sdk-2.0.3-py3-none-any.whl
  • Upload date:
  • Size: 249.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for octomil_sdk-2.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 0891e40c40133bf09944c1c8d023469a8e1a1bfa0d307a6854dc0a0d4978c54a
MD5 6187518b7b72425fccc84fc27e716095
BLAKE2b-256 4d546850746dc1ecb972c3d37a7a572b2bbcc7e8eb6077c150474aaad8fe76c5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page