Python bindings for Apple's FoundationModels framework - on-device AI (requires macOS 26.0+)

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

btucker

These details have not been verified by PyPI

Project description

apple-foundation-models

Python bindings for Apple's FoundationModels framework - Direct access to on-device Apple Intelligence.

Features

High-level Pythonic API: Context managers, async/await, type hints
Structured Outputs: JSON Schema and Pydantic model support
Async Streaming: Native async for support for streaming responses
Type Safety: Full type annotations with mypy support
Memory Safe: Automatic resource cleanup, no manual memory management
Thread Safe: All operations are thread-safe

Requirements

macOS 26.0+ (macOS Sequoia or later)
Python 3.9 or higher
Apple Intelligence enabled on your device

Installation

From PyPI

pip install apple-foundation-models

Optional dependencies:

# For Pydantic model support in structured outputs
pip install apple-foundation-models[pydantic]

From Source

# Clone the repository
git clone https://github.com/btucker/apple-foundation-models-py.git
cd apple-foundation-models-py

# Install (automatically builds Swift dylib and Cython extension)
pip install -e .

Requirements:

macOS 26.0+ (Sequoia) with Apple Intelligence enabled
Xcode command line tools (xcode-select --install)
Python 3.9 or higher

Note: The Swift dylib is built automatically during installation.

Quick Start

Basic Usage

from applefoundationmodels import Client

# Create a client (library auto-initializes)
with Client() as client:
    # Check if Apple Intelligence is available
    if not client.is_ready():
        print("Apple Intelligence is not available")
        print(client.get_availability_reason())
        return

    # Create a session
    session = client.create_session(
        instructions="You are a helpful assistant.",
        enable_guardrails=True
    )

    # Generate a response
    response = session.generate("What is the capital of France?")
    print(response)

    # Get conversation history
    history = session.get_history()
    for msg in history:
        print(f"{msg['role']}: {msg['content']}")

Async Streaming

import asyncio
from applefoundationmodels import Client

async def main():
    with Client() as client:
        session = client.create_session()

        # Stream response chunks as they arrive
        async for chunk in session.generate_stream("Tell me a story about a robot"):
            print(chunk, end='', flush=True)
        print()  # Newline after stream

asyncio.run(main())

Structured Output

from applefoundationmodels import Client

with Client() as client:
    session = client.create_session()

    # Define a JSON schema
    schema = {
        "type": "object",
        "properties": {
            "name": {"type": "string"},
            "age": {"type": "integer"},
            "city": {"type": "string"}
        },
        "required": ["name", "age", "city"]
    }

    # Generate structured response
    result = session.generate_structured(
        "Extract person info: Alice is 28 and lives in Paris",
        schema=schema
    )

    print(result)  # {'name': 'Alice', 'age': 28, 'city': 'Paris'}

Using Pydantic Models

You can also use Pydantic models for structured outputs (requires pip install pydantic>=2.0):

from applefoundationmodels import Client
from pydantic import BaseModel

class Person(BaseModel):
    name: str
    age: int
    city: str

with Client() as client:
    session = client.create_session()

    # Pass Pydantic model directly - no need for JSON schema!
    result = session.generate_structured(
        "Extract person info: Alice is 28 and lives in Paris",
        schema=Person
    )

    print(result)  # {'name': 'Alice', 'age': 28, 'city': 'Paris'}

    # Parse directly into a Pydantic model for validation
    person = Person(**result)
    print(person.name, person.age, person.city)  # Alice 28 Paris

Tool Calling

Tool calling allows the model to call your Python functions to access real-time data, perform actions, or integrate with external systems. Tools work with a simple decorator-based API:

from applefoundationmodels import Client

with Client() as client:
    session = client.create_session()

    # Register a tool with the @session.tool decorator
    @session.tool(description="Get current weather for a location")
    def get_weather(location: str, units: str = "celsius") -> str:
        """Fetch weather information from your weather API."""
        # Your implementation here
        return f"Weather in {location}: 22°{units[0].upper()}, sunny"

    @session.tool()
    def calculate(expression: str) -> float:
        """Evaluate a mathematical expression safely."""
        # Your implementation here
        return eval(expression)  # Use safe_eval in production!

    # The model will automatically call tools when needed
    response = session.generate(
        "What's the weather in Paris and what's 15 times 23?"
    )
    print(response)
    # "The weather in Paris is 22°C and sunny. 15 times 23 equals 345."

    # View the full conversation including tool calls
    for entry in session.transcript:
        print(f"{entry['type']}: {entry.get('content', '')}")

Features:

Automatic schema generation from Python type hints
Parallel tool execution when the model calls multiple tools
Full transcript access showing all tool calls and outputs
Error handling with detailed error information
Type-safe with complete type annotations

Schema Extraction:

The library automatically extracts JSON schemas from your Python functions:

@session.tool(description="Search documentation")
def search_docs(query: str, limit: int = 10, category: str = "all") -> list:
    """Search the documentation database."""
    # Implementation...
    return results

# Automatically generates:
# {
#   "name": "search_docs",
#   "description": "Search documentation",
#   "parameters": {
#     "type": "object",
#     "properties": {
#       "query": {"type": "string"},
#       "limit": {"type": "integer"},
#       "category": {"type": "string"}
#     },
#     "required": ["query"]
#   }
# }

Transcript Access:

View the complete conversation history including tool interactions:

# After generating with tools
for entry in session.transcript:
    match entry['type']:
        case 'prompt':
            print(f"User: {entry['content']}")
        case 'tool_calls':
            for call in entry['tool_calls']:
                print(f"Calling tool: {call['id']}")
        case 'tool_output':
            print(f"Tool result: {entry['content']}")
        case 'response':
            print(f"Assistant: {entry['content']}")

Supported Parameter Types:

Tool calling works with various parameter signatures:

No parameters
Single parameters (string, int, float, bool)
Multiple parameters with mixed types
Optional parameters with default values
Lists and nested objects

See examples/tool_calling_comprehensive.py for complete examples of all supported patterns.

Generation Parameters

# Control generation with parameters
response = session.generate(
    "Write a creative story",
    temperature=1.5,      # Higher = more creative (0.0-2.0)
    max_tokens=500,       # Limit response length
    seed=42               # Reproducible outputs
)

Session Management

with Client() as client:
    # Create multiple sessions
    chat_session = client.create_session(
        instructions="You are a friendly chatbot"
    )
    code_session = client.create_session(
        instructions="You are a code review assistant"
    )

    # Each session maintains separate conversation history
    chat_response = chat_session.generate("Hello!")
    code_response = code_session.generate("Review this code: ...")

    # Clear history while keeping session
    chat_session.clear_history()

    # Manually add messages
    chat_session.add_message("system", "Be concise")

Statistics

with Client() as client:
    session = client.create_session()

    # Generate some responses
    for i in range(5):
        session.generate(f"Question {i}")

    # Get statistics
    stats = client.get_stats()
    print(f"Total requests: {stats['total_requests']}")
    print(f"Success rate: {stats['successful_requests'] / stats['total_requests'] * 100:.1f}%")
    print(f"Avg response time: {stats['average_response_time']:.2f}s")

    # Reset statistics
    client.reset_stats()

API Reference

Client

The main entry point for using libai.

class Client:
    def __init__() -> None: ...
    def __enter__() -> Client: ...
    def __exit__(...) -> None: ...

    @staticmethod
    def check_availability() -> Availability: ...
    @staticmethod
    def get_availability_reason() -> str: ...
    @staticmethod
    def is_ready() -> bool: ...
    @staticmethod
    def get_version() -> str: ...
    @staticmethod
    def get_supported_languages() -> List[str]: ...

    def create_session(...) -> Session: ...
    def get_stats() -> Stats: ...
    def reset_stats() -> None: ...
    def close() -> None: ...

Session

Manages conversation state and text generation.

class Session:
    def __enter__() -> Session: ...
    def __exit__(...) -> None: ...

    def generate(prompt: str, **params) -> str: ...
    def generate_structured(prompt: str, schema: dict, **params) -> dict: ...
    async def generate_stream(prompt: str, **params) -> AsyncIterator[str]: ...

    def tool(description: str = None, name: str = None) -> Callable: ...
    @property
    def transcript() -> List[dict]: ...

    def get_history() -> List[dict]: ...
    def clear_history() -> None: ...
    def add_message(role: str, content: str) -> None: ...
    def close() -> None: ...

Types

class Availability(IntEnum):
    AVAILABLE = 1
    DEVICE_NOT_ELIGIBLE = -1
    NOT_ENABLED = -2
    MODEL_NOT_READY = -3

class SessionConfig(TypedDict):
    instructions: Optional[str]
    tools_json: Optional[str]
    enable_guardrails: bool
    prewarm: bool

class GenerationParams(TypedDict):
    temperature: float
    max_tokens: int
    seed: int

class Stats(TypedDict):
    total_requests: int
    successful_requests: int
    failed_requests: int
    total_tokens_generated: int
    average_response_time: float
    total_processing_time: float

Exceptions

All exceptions inherit from FoundationModelsError:

InitializationError - Library initialization failed
NotAvailableError - Apple Intelligence not available
InvalidParametersError - Invalid parameters
MemoryError - Memory allocation failed
JSONParseError - JSON parsing error
GenerationError - Text generation failed
TimeoutError - Operation timeout
SessionNotFoundError - Session not found
StreamNotFoundError - Stream not found
GuardrailViolationError - Content blocked by safety filters
ToolNotFoundError - Tool not registered
ToolExecutionError - Tool execution failed
ToolCallError - Tool call error (validation, schema, etc.)
UnknownError - Unknown error

Examples

See the examples/ directory for complete working examples:

basic_chat.py - Simple conversation
streaming_chat.py - Async streaming
structured_output.py - JSON schema validation
tool_calling_comprehensive.py - Complete tool calling demonstration with all parameter types

Development

Building from Source

This project uses uv for fast, reliable builds and dependency management:

# Install uv (if not already installed)
curl -LsSf https://astral.sh/uv/install.sh | sh

# Install development dependencies
uv sync --extra dev

# Run tests
uv run pytest

# Type checking
uv run mypy applefoundationmodels

# Format code
uv run black applefoundationmodels examples

# Build wheels
uv build --wheel

You can also use pip if preferred:

pip install -e ".[dev]"
pytest
mypy applefoundationmodels
black applefoundationmodels examples

Project Structure

apple-foundation-models-py/
├── applefoundationmodels/   # Python package
│   ├── __init__.py     # Public API
│   ├── _foundationmodels.pyx  # Cython bindings
│   ├── _foundationmodels.pxd  # C declarations
│   ├── client.py       # High-level Client
│   ├── session.py      # Session management
│   ├── types.py        # Type definitions
│   ├── exceptions.py   # Exception classes
│   └── swift/          # Swift FoundationModels bindings
│       ├── foundation_models.swift  # Swift implementation
│       └── foundation_models.h      # C FFI header
├── lib/                # Swift dylib and modules (auto-generated)
│   └── libfoundation_models.dylib    # Compiled Swift library
├── examples/           # Example scripts
└── tests/              # Unit tests

Architecture

apple-foundation-models-py uses a layered architecture for optimal performance:

Python API (client.py, session.py)
         ↓
    Cython FFI (_foundationmodels.pyx)
         ↓
    C FFI Layer (foundation_models.h)
         ↓
  Swift Implementation (foundation_models.swift)
         ↓
  FoundationModels Framework (Apple Intelligence)

Key Design Decisions:

Direct FoundationModels Integration: No intermediate C library - Swift calls FoundationModels directly
Minimal Overhead: C FFI layer provides thin wrapper for Python/Swift communication
Async Coordination: Uses semaphores to bridge Swift's async/await with synchronous C calls
Streaming: Real-time delta calculation from FoundationModels snapshot-based streaming

Performance

Cython-compiled for near-C performance
Direct Swift → FoundationModels calls (no intermediate libraries)
Async streaming with delta-based chunk delivery
No GIL during Swift library calls (when possible)

Troubleshooting

Apple Intelligence not available

If you get NotAvailableError:

Ensure you're running macOS 26.0 (Sequoia) or later
Check System Settings → Apple Intelligence → Enable
Wait for models to download (check with client.get_availability_reason())

Import errors

If you get import errors after installation:

# Rebuild everything (Swift dylib + Cython extension)
pip install --force-reinstall --no-cache-dir -e .

Compilation errors

Ensure you have Xcode command line tools:

xcode-select --install

If the Swift build fails during installation:

Verify macOS version: sw_vers -productVersion (should be 26.0+)
Check Swift compiler: swiftc --version
Clean and reinstall: pip install --force-reinstall --no-cache-dir -e .

License

MIT License - see LICENSE file for details

Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch
Add tests for new functionality
Ensure all tests pass
Submit a pull request

Acknowledgments

This project was inspired by and learned from several excellent works:

libai by 6over3 Institute - The original C library wrapper for FoundationModels that demonstrated the possibility of non-Objective-C access to Apple Intelligence. While we ultimately chose a direct Swift integration approach, the libai project's API design and documentation heavily influenced our Python API structure.
apple-on-device-ai by Meridius Labs - The Node.js bindings that showed the path to direct FoundationModels integration via Swift. Their architecture of using Swift → C FFI → JavaScript inspired our Swift → C FFI → Cython → Python approach, and their code examples were invaluable for understanding the FoundationModels API.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

btucker

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.2

Nov 13, 2025

0.2.1

Nov 12, 2025

0.2.0

Nov 12, 2025

0.1.9

Nov 10, 2025

0.1.8

Nov 9, 2025

0.1.7

Nov 9, 2025

0.1.6

Nov 8, 2025

This version

0.1.5

Nov 7, 2025

0.1.3

Nov 6, 2025

0.1.0

Nov 6, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

apple_foundation_models-0.1.5.tar.gz (144.8 kB view details)

Uploaded Nov 7, 2025 Source

File details

Details for the file apple_foundation_models-0.1.5.tar.gz.

File metadata

Download URL: apple_foundation_models-0.1.5.tar.gz
Upload date: Nov 7, 2025
Size: 144.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for apple_foundation_models-0.1.5.tar.gz
Algorithm	Hash digest
SHA256	`ab6974b642ca8d33808572883ddba3d99305490e58dee2bb76a06c1b104df7a5`
MD5	`65e543dd501aec18083bf2d765e658b1`
BLAKE2b-256	`78e55b3b267970a9d8a555ccb401caf60f7ed1812dab9f91917af29ab01bf45d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for apple_foundation_models-0.1.5.tar.gz:

Publisher: publish-to-pypi.yml on btucker/apple-foundation-models-py

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: apple_foundation_models-0.1.5.tar.gz
- Subject digest: ab6974b642ca8d33808572883ddba3d99305490e58dee2bb76a06c1b104df7a5
- Sigstore transparency entry: 684095989
- Sigstore integration time: Nov 7, 2025
Source repository:
- Permalink: btucker/apple-foundation-models-py@29e58e36d3cac3d9913aedd56d55acd80a445c3d
- Branch / Tag: refs/tags/v0.1.5
- Owner: https://github.com/btucker
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yml@29e58e36d3cac3d9913aedd56d55acd80a445c3d
- Trigger Event: release

apple-foundation-models 0.1.5

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

apple-foundation-models

Features

Requirements

Installation

From PyPI

From Source

Quick Start

Basic Usage

Async Streaming

Structured Output

Using Pydantic Models

Tool Calling

Generation Parameters

Session Management

Statistics

API Reference

Client

Session

Types

Exceptions

Examples

Development

Building from Source

Project Structure

Architecture

Performance

Troubleshooting

Apple Intelligence not available

Import errors

Compilation errors

License

Contributing

Links

Acknowledgments

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes

Provenance