Lightweight Python abstractions and connectors for LLM providers (OpenAI, Claude, Gemini, Ollama).

These details have not been verified by PyPI

Project links

Project description

modelito

Modelito is a compact, dependency-light Python library that provides provider- agnostic abstractions and connectors for large language models (LLMs). It offers lightweight shims for OpenAI, Claude, Gemini and local Ollama deployments, plus utilities for token counting, timeout estimation, and small helpers to manage Ollama servers when needed. The library is designed for easy integration into applications and CI pipelines.

Quick start

Install

To install the latest released version from PyPI:

pip install modelito

For development / contributor setup (editable install and dev dependencies):

pip install -e .[dev]
pip install -r dev-requirements.txt

# Optional extras
pip install -e .[ollama,tokenization,openai,anthropic]

Run tests (for contributors):

pytest -q

Install from TestPyPI (preview builds)

If you need to test a preview build published to TestPyPI, use the TestPyPI index. TestPyPI packages are for testing only and may not be stable.

python -m pip install --index-url https://test.pypi.org/simple/ \
	--extra-index-url https://pypi.org/simple modelito==<version>

If installation from the index fails, download the wheel from the TestPyPI "Files" page and install it directly.

Build and install

To build a source distribution and wheel locally:

python -m pip install --upgrade build
python -m build

Install from the built wheel:

pip install dist/*.whl

See the docs/ folder for more details on calibration and migration.

Providers

This package provides compatibility shims and small, dependency-light implementations for common provider interfaces. When optional extras are installed the package will attempt to use real SDK clients; otherwise the shims provide safe offline-friendly fallbacks suitable for testing.

Provided shims and utilities:

OllamaProvider — HTTP-aware provider that will call a local Ollama HTTP API when available. If the HTTP API is unavailable the provider will attempt to use the local Ollama CLI as a best-effort fallback before returning a deterministic stub useful for tests and examples.
GeminiProvider, GrokProvider — lightweight shims.
OpenAIProvider, ClaudeProvider — will use the official SDKs when installed, falling back to deterministic behavior otherwise.

License / AS IS

This software is provided "AS IS" and without warranties of any kind. See the included LICENSE file for the full MIT license text.

CI / Integration Tests

This repository includes a GitHub Actions workflow at .github/workflows/ci.yml. The workflow runs mypy and the unit test suite on push and pull requests.

Ollama integration tests are intentionally gated and will only run when you explicitly enable them. To run integration tests locally or in CI set the environment variable RUN_OLLAMA_INTEGRATION=1. Additional optional flags:

ALLOW_OLLAMA_INSTALL=1 — permit the integration tests to attempt installing Ollama when missing.
ALLOW_OLLAMA_DOWNLOAD=1 — permit downloading remote models during integration tests.
ALLOW_OLLAMA_UPDATE=1 — permit running update flows during integration tests.

Example (local):

RUN_OLLAMA_INTEGRATION=1 pytest tests/test_ollama_integration.py -q

Provider integration tests for external services (OpenAI, Anthropic, etc.) are also gated and will be skipped unless the corresponding API keys are present in the environment or configured as repository secrets in CI (for example OPENAI_API_KEY, ANTHROPIC_API_KEY). Add those secrets to your CI settings to enable provider integration jobs.

Provider interface

modelito exposes a minimal structural Provider Protocol that codifies the small runtime surface expected from provider implementations and third-party adapters. The Protocol is intentionally small to remain compatible with existing duck-typed providers — it requires only:

list_models() -> list[str]
summarize(messages, settings=None) -> str

All built-in providers shipped with the package (OpenAIProvider, ClaudeProvider, GeminiProvider, OllamaProvider, GrokProvider) now explicitly subclass Provider. The Provider Protocol is decorated with @runtime_checkable, so you can use isinstance() checks at runtime when you need to enforce the contract in application code.

Example usage:

from modelito import Provider, OllamaProvider

p: Provider = OllamaProvider()
if isinstance(p, Provider):
	from modelito.messages import Message
	resp = p.summarize([Message(role="user", content="hello")])
	print(resp)

The package provides typed Message/Response dataclasses and exposes a small set of optional Protocols for provider surfaces:

SyncProvider (alias: Provider) — existing synchronous summarize()/list_models() surface.
AsyncProvider — async acomplete() surface for providers that support awaitable calls.
StreamingProvider — streaming stream() generator surface.
EmbeddingProvider — embed() surface for vector embeddings.

modelito exposes Message and Response dataclasses; connectors and provider surfaces accept Message instances. Example usage with the current API:

from modelito import Provider, Message, OllamaProvider, OllamaConnector

p: Provider = OllamaProvider()
if isinstance(p, Provider):
    resp_text = p.summarize([Message(role="user", content="hello")])
    print(resp_text)

conn = OllamaConnector(provider=p)
res = conn.complete(conv_id="example", new_messages=[Message(role="user", content="hello")])
print(res.text)

Streaming semantics

Modelito normalizes provider streaming into a simple incremental text stream. Providers may emit data at different granularities; the connector/streaming helpers attempt to normalize these into a sequence of text chunks that are safe to concatenate to form the final output. Common shapes you will encounter:

Token-level: Backends (e.g., OpenAI SDK) may stream individual token deltas. These are emitted as short text fragments; consumers should append fragments in order to reconstruct the full output.
Chunk-level: Some providers deliver logical chunks or events (for example, chunked JSON payloads). Modelito extracts the textual portion and yields it as incremental chunks.
Line-delimited / SSE: HTTP services (like Ollama's /api/generate) may send newline-delimited JSON or SSE frames. Modelito reads and normalizes the frames and yields textual content as it becomes available.

Behavioral notes:

The stream() generator yields str pieces; each yielded item is intended to be appended to reconstruct the response incrementally.
When you need token-level control (e.g., streaming token-by-token), prefer providers that expose token deltas (OpenAI SDK). Modelito will still yield those token deltas as text fragments.
Offline/deterministic fallbacks yield the full text in a single chunk.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.4.0

May 7, 2026

1.3.0

May 7, 2026

1.2.2

May 6, 2026

1.2.0

Apr 26, 2026

1.1.0

Apr 26, 2026

This version

1.0.8

Apr 22, 2026

1.0.7

Apr 22, 2026

1.0.6

Apr 22, 2026

1.0.5

Apr 22, 2026

1.0.3

Apr 20, 2026

1.0.2

Apr 20, 2026

1.0.1

Apr 19, 2026

1.0.0

Apr 19, 2026

0.2.3

Apr 19, 2026

0.2.2

Apr 19, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

modelito-1.0.8.tar.gz (50.6 kB view details)

Uploaded Apr 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

modelito-1.0.8-py3-none-any.whl (48.0 kB view details)

Uploaded Apr 22, 2026 Python 3

File details

Details for the file modelito-1.0.8.tar.gz.

File metadata

Download URL: modelito-1.0.8.tar.gz
Upload date: Apr 22, 2026
Size: 50.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for modelito-1.0.8.tar.gz
Algorithm	Hash digest
SHA256	`ae9f7bd41948982555f0dce13bd16854df8ed652d2971dab2c1911baccf0b478`
MD5	`4b46fa4f01f1b29a34630c5135db37b6`
BLAKE2b-256	`d0290f69692c5968dd999f9d13fed99aff8806f1edcbbf2a0b6d1a7e95c367c3`

See more details on using hashes here.

File details

Details for the file modelito-1.0.8-py3-none-any.whl.

File metadata

Download URL: modelito-1.0.8-py3-none-any.whl
Upload date: Apr 22, 2026
Size: 48.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for modelito-1.0.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`782181c8521ee524cf002e746bad89281834dabf309bbbe4fb69837ff4dd4aa7`
MD5	`72d8125c2e7460d97b38aa9ec5985bea`
BLAKE2b-256	`170213c9e87a9b89d6818cb1f0b25d9ce1a1c1795e3d3a9f304596342c4bcfdc`

See more details on using hashes here.

modelito 1.0.8

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

modelito

Quick start

Install

Install from TestPyPI (preview builds)

Build and install

Providers

License / AS IS

CI / Integration Tests

Provider interface

Streaming semantics

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes