Skip to main content

CLI and OpenAI-compatible HTTP server for Apple Foundation Models (Apple Intelligence)

Project description

apple-fm-cli

A small command-line interface and local HTTP server that drive Apple’s on-device Foundation Models (Apple Intelligence) from Python. The repo bundles a ctypes-based apple_fm_sdk wrapper around the system model plus apple_fm_cli, which adds prompting, optional tools, JSON-schema-guided output, and an OpenAI-shaped API for clients like Codex.

Requirements

  • macOS with Foundation Models available (the SDK checks availability and reports a reason if not).
  • Python ≥ 3.14

Install

From PyPI (macOS, Python 3.14+):

pip install apple-fm-cli
# or: uv pip install apple-fm-cli

From a git checkout (editable):

pip install -e .
# or: uv pip install -e .

Entry point: apple-fm-cli.

The published wheel includes a prebuilt libFoundationModels.dylib (Apple silicon / arm64). To rebuild it after changing foundation-models-c, run swift build -c release in src/apple_fm_sdk/foundation-models-c and copy .build/*/release/libFoundationModels.dylib to src/apple_fm_sdk/lib/ before building distributions.

Publishing to PyPI (maintainers)

Automated (recommended): .github/workflows/publish.yml runs when you push a version tag v* (for example v0.1.3). It checks that the tag matches version in pyproject.toml, builds, uploads to PyPI, then creates a GitHub Release (with generated notes) if one does not already exist. One-time: configure PyPI trusted publishing for workflow publish.yml.

Typical release steps:

  1. Bump version in pyproject.toml and run uv lock if you track uv.lock in git.

  2. Rebuild and commit src/apple_fm_sdk/lib/libFoundationModels.dylib if the Swift bridge changed.

  3. Commit and push to main, then tag and push the tag:

    git tag v0.1.3
    git push origin v0.1.3
    

Manual dispatch: Actions → Publish to PyPI → Run workflow still works for the current branch (no tag check); use sparingly.

Manual upload: uv run --with build python -m build then uv run --with twine twine upload dist/* using an API token. Prefer trusted publishing in CI over storing long-lived tokens in ~/.pypirc or GitHub secrets.

CLI

Query the model (plain text or structured JSON):

apple-fm-cli query "Summarize this idea in one sentence."
apple-fm-cli query --format json --schema '{"type":"object","properties":{"title":{"type":"string"}}}' "Name this topic."

Optional tools (comma-separated): bash (local shell), google_search (DuckDuckGo lite + page fetch).

apple-fm-cli query --tools bash,google_search "What’s in README.md in the cwd?"

Legacy-style flags are still accepted: -q / --query, --output, --output-schema.

Server

Starts a FastAPI app that mimics parts of the OpenAI Chat Completions and Responses APIs (including SSE for streaming), so tools that expect those endpoints can point at your machine instead of a cloud provider. Large agent system prompts are truncated heuristically to fit smaller local context windows.

apple-fm-cli server --host 0.0.0.0 --port 8000
  • POST /v1/chat/completions
  • POST /v1/responses

Codex

  1. Start the server (see above), e.g. on port 8000.

  2. Add a provider and profile to ~/.codex/config.toml:

    [model_providers.apple]
    name = "apple"
    base_url = "http://localhost:8000/v1"
    env_key = "OPENAI_API_KEY"
    
    [profiles.apple]
    model = "fm"
    model_provider = "apple"
    model_context_window = 4096
    
  3. Run Codex with that profile:

    codex -p apple
    

env_key is the environment variable Codex uses for the bearer token. The local server does not need a real OpenAI key; set OPENAI_API_KEY to any non-empty placeholder if your Codex build requires it to be present.

Other agent harnesses

Anything that can target an OpenAI-compatible HTTP API (Chat Completions and/or Responses, including SSE) can point base_url at http://<host>:<port>/v1 and use a model id string of your choice—the server echoes the requested model name. Prefer a context window that matches what the on-device model can handle (4096 is a reasonable default for local sessions). If the client insists on an API key, keep using a dummy value in the configured env var unless you add auth in front of the server yourself.

Layout

Path Role
src/apple_fm_sdk/ Session, tools, guided generation, tokenizer, native bridge bindings
src/apple_fm_cli/ query / server, built-in tools, schema → Generable helpers
notes/ Design notes (Responses SSE lifecycle, native bridge, e2e testing)

Licensing

apple_fm_sdk source files carry Apple Inc. copyright headers; refer to any accompanying license text distributed with that SDK.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

apple_fm_cli-0.2.0.tar.gz (162.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

apple_fm_cli-0.2.0-py3-none-any.whl (135.8 kB view details)

Uploaded Python 3

File details

Details for the file apple_fm_cli-0.2.0.tar.gz.

File metadata

  • Download URL: apple_fm_cli-0.2.0.tar.gz
  • Upload date:
  • Size: 162.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for apple_fm_cli-0.2.0.tar.gz
Algorithm Hash digest
SHA256 fa7fbfd00f706d2dc1439e908f04397b54af4a7ac5431e997800ed7bd241e59c
MD5 9fcfcd19be988ec755a9c83498be66f2
BLAKE2b-256 79a1b7d8c4bd467526b4397f857165ae31756992af27de77cfb4394714330856

See more details on using hashes here.

Provenance

The following attestation bundles were made for apple_fm_cli-0.2.0.tar.gz:

Publisher: publish.yml on sohail288/apple-fm-cli

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file apple_fm_cli-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: apple_fm_cli-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 135.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for apple_fm_cli-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6e9d490c6a87f27915a05f2ce21f8a6d977e688ac3431370cc5f1394f609aa1f
MD5 a556d311e73e70b27ecce3122bc5f565
BLAKE2b-256 6ef8cdb5b076b6db86381f648902fffa816a352597459a327007a53de8547db2

See more details on using hashes here.

Provenance

The following attestation bundles were made for apple_fm_cli-0.2.0-py3-none-any.whl:

Publisher: publish.yml on sohail288/apple-fm-cli

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page