AI library and helpers (Python/Poetry/Typer - LM Studio & llama.cpp)

These details have not been verified by PyPI

Project links

Project description

TransAI

AI library and helpers (Python/Poetry/Typer - LM Studio & llama.cpp).

Primary use case: Python API/interface with local AI models
Works with: local AI models via LM Studio or llama.cpp
Status: stable
License: Apache-2.0

Since version 1.0.0 it is a PyPI package: https://pypi.org/project/transai/

TransAI

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License here.

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Third-party notices

This project depends on third-party software. Key runtime dependencies:

transcrypto (Apache-2.0) — CLI modules, logging, utilities
llama-cpp-python (MIT) — llama.cpp Python bindings
lmstudio — LM Studio client library
Pillow (MIT-CMU) — image processing
pydantic (MIT) — data validation and JSON schema

See pyproject.toml for the full dependency list.

Installation

To use in your project:

pip3 install transai

and then import the library:

from transai.core import ai, lms, llama
from transai.utils import images

For the CLI tool, after installation just run:

poetry run transai --help

Supported platforms

OS: Linux, macOS, Windows (wherever llama-cpp-python and lmstudio are supported)
Architectures: x86_64, arm64
Python: 3.12+

Known dependencies (Prerequisites)

python 3.12+ — documentation
transcrypto 2.5+ — CLI modules, logging, humanization, config management, etc. — documentation
Pillow 12.2+ — image processing and format conversion
pydantic 2.12+ — data validation and JSON schema generation
llama-cpp-python 0.3.20+ — llama.cpp Python bindings for local GGUF model inference
lmstudio 1.5+ — LM Studio client library for the LM Studio API
rich — terminal output formatting (via transcrypto)
typer — CLI framework (via transcrypto)

What TransAI is

TransAI is a Python library and CLI tool that provides a unified interface for running local AI models through two backends:

LM Studio (LMStudioWorker): connects to a running LM Studio server on localhost via the lmstudio client library. This is the recommended and default backend.
llama.cpp (LlamaWorker): loads GGUF model files directly into memory using llama-cpp-python. Useful when you want full control without running an LM Studio server.

Both backends share the same abstract interface (AIWorker), so you can swap backends without changing your application code. Models can be queried with plain text prompts or with structured output (Pydantic models), vision models can process images, and tool-capable models can call Python functions.

What TransAI is not

Not a cloud AI service — it only works with local models
Not a model downloader — you must have models available locally (via LM Studio or as GGUF files)
Not a training framework — inference only
Not a high-level agent framework — it provides the low-level model interface layer

Key concepts and terminology

AIWorker: abstract base class defining the interface for loading and querying AI models
LMStudioWorker: concrete worker that connects to a local LM Studio server
LlamaWorker: concrete worker that loads GGUF files directly via llama.cpp
AIModelConfig: TypedDict with all model loading parameters (context, temperature, GPU, seed, etc.)
Model ID: a string identifying the model, typically in the format model-name@quantization (e.g., qwen3-8b@Q8_0); should match what you would use with lms get <model_id> or https://huggingface.co/<model_id>
GGUF: the quantized model file format used by llama.cpp
CLIP projector: a companion model file enabling vision capabilities in multi-modal models
Speculative decoding: a technique for faster inference by generating multiple tokens in parallel

Known limitations

LM Studio backend requires a running LM Studio server on localhost (127.0.0.1)
llama.cpp backend requires GGUF model files on disk
Vision support in llama.cpp depends on CLIP projector file availability and supported architectures (Qwen2-VL, MiniCPM, Llama3-Vision, Moondream, NanoLLava, Obsidian, Llava)
No telemetry, no network calls beyond localhost (LM Studio server)

Library API usage

Loading a model

transai.core.ai exposes a convenience constructor MakeAIModelConfig(**overrides) which returns a fully-populated AIModelConfig TypedDict with sensible defaults.

from transai.core import ai, lms, llama

# --- Using LM Studio ---
with lms.LMStudioWorker() as worker:
  config, metadata = worker.LoadModel(
    ai.MakeAIModelConfig(
      model_id='qwen3-vl-32b-instruct@q8_0',
      vision=True,
      temperature=0.5,  # only override the ones you care about!
      # all other fields will have sensible defaults; currently also supported are:
      # seed, context, gpu_ratio, gpu_layers, use_mmap, fp16, flash, spec_tokens, kv_cache
    )
  )
  # ... use worker.ModelCall() ...

# --- Using llama.cpp ---
import pathlib

with llama.LlamaWorker(pathlib.Path('~/.lmstudio/models/')) as worker:
  config, metadata = worker.LoadModel(
    ai.AIModelConfig(
      model_id='qwen3-8b@q8_0',
      # ... same config field possibilities ...
    )
  )
  # ... use worker.ModelCall() ...

Querying a model (text)

ModelCall() returns a tuple (result, chat_history_dict) — the second element is the chat history as a JSON-serializable dict that can be fed back in follow-up calls.

with lms/llama.*Worker() as worker:  # open...
  worker.LoadModel(...)              # load model...
  response, _history = worker.ModelCall(
      'qwen3-8b@q8_0', 'You are a helpful assistant.', 'What is the capital of France?', str)
print(response)  # "The capital of France is Paris."

Querying a model (structured JSON)

To get a structured object back from the model, just create a pydantic.BaseModel class as shown below. Make sure to add pydocs and pydantic.Field description to the fields, as all the information (name, type, descriptions) are sent to the model.

import pydantic

class CityInfo(pydantic.BaseModel):
  """City information"""

  city: str = pydantic.Field(description='city name')
  country: str = pydantic.Field(description='country name')
  population: int = pydantic.Field(description='city population')
  districts: list[str] =  pydantic.Field(description='list of city district names')

with lms/llama.*Worker() as worker:  # open...
  worker.LoadModel(...)              # load model...
  result: CityInfo
  result, _history = worker.ModelCall(
      'qwen3-8b@q8_0',
      'Extract a city information, its country, population, and list of districts.',
      'Tell me about Paris, France.',
      CityInfo,
  )
print(result.city)        # "Paris"
print(result.population)  # 2161000

Vision models (images)

import pathlib

with lms/llama.*Worker() as worker:  # open...
  worker.LoadModel(...)              # load model...
  response, _history = worker.ModelCall(
    'qwen3-vl-32b-instruct@q8_0', 'Describe what you see.', 'What is in this image?', str,
    images=[pathlib.Path('photo.jpg')],  # or raw bytes, or file path strings
  )
print(response)

Images are automatically resized to fit within 1024px (longest edge) before being sent to the model.

Tool use (function calling)

Pass Python callables (or fully-qualified dotted names) as tools. The model may invoke them during the conversation and TransAI handles the execution round-trips automatically. Make sure the methods you want the LLM to call:

have good pydocs
are typed
can be called by name, i.e., they accept calls like Method(**args_dict)
has sane exceptions: if the method raises an exception the LLM will get the exception text back, and might adapt to the information there, so try to have good, sane, exceptions with relevant messages.

All the information on the methods (pydocs, types, etc) is sent to the model. Beware of parameter passing: it can go wrong. Just as one example: some handlers/LLMs will handle arbitrary length integers well and feed them correctly to a method that takes integers, like math.gcd, while other handlers/LLMs might get stuck on sending very large integers as scientific notation, which won't parse correctly and even if they did the results would be incorrect, and the handler/LLM will get stuck on this behavior repeatedly retrying and getting the same error messages. (These possible issues with parameters in calls are not something under the control of this library: you will have to experiment and see what works better or not.)

def celsius_to_fahrenheit(celsius: float) -> float:
  """Convert Celsius to Fahrenheit.

  Args:
    celsius: temperature in Celsius

  Returns:
    temperature in Fahrenheit

  """
  return celsius * 9 / 5 + 32

with lms/llama.*Worker() as worker:  # open...
  worker.LoadModel(...)              # load model...
  # tools must be a list of callables or strings; the model may call them zero or more times
  response, _history = worker.ModelCall(
    'qwen3-8b@q8_0',
    'You are a helpful assistant.',
    'What is 23°C in Fahrenheit? Also, what is the GCD of 48 and 36?',
    str,
    tools=[celsius_to_fahrenheit, 'math.gcd'],
  )
print(response)

Because of the complexities involved it is very important to test your use cases very thoroughly. Remember that setting log levels to debug (-vvv on the CLI) will allow you to see the messages going to-and-fro the library and the LLMs.

Multi-turn conversations (chat history)

ModelCall() always returns (result, chat_history_dict). Pass the returned dict back as chat_history to continue the conversation — the system prompt is ignored in subsequent calls, but the user prompt and any images are appended to the existing history.

with lms/llama.*Worker() as worker:  # open...
  worker.LoadModel(...)              # load model...
  # first turn: start conversation
  reply1, history = worker.ModelCall(
    'qwen3-8b@q8_0',
    'You are a helpful cooking assistant.',
    'Give me a simple pasta recipe.',
    str,
  )
  print(reply1)
  # second turn: continue using the returned history
  reply2, history = worker.ModelCall(
    'qwen3-8b@q8_0',
    '',  # system prompt ignored when chat_history is provided
    'Can you make it vegetarian?',
    str,
    chat_history=history,  # pass the previous turn's history back in
  )
  print(reply2)
  # third turn: continue again...
  reply3, history = worker.ModelCall(
    'qwen3-8b@q8_0', '', 'How long does it take to make?', str, chat_history=history
  )
  print(reply3)

Note: the chat_history dict is mutable — passing it back in will extend it in-place for the llama.cpp backend. For safety, make a copy.deepcopy() of the dict if you want to branch the conversation.

Image utilities

The transai.utils.images module provides helpers for image preprocessing:

from transai.utils import images

# Resize an image for vision models (max 1024px, returns PNG bytes)
png_bytes: bytes = images.ResizeImageForVision(raw_image_bytes)

# Extract frames from an animated image (GIF, APNG, etc.)
for frame_png in images.AnimationFrames(animated_gif_bytes):
  # each frame is PNG bytes, resized to max 336px
  pass

Workable AI Models Guide

Models suggestions as of April/2026. Just an opinion, not to be taken seriously. Do your own tests.

Vision Models

These models can process images.

Model Flag Value	Size	Type	Tool?	Reason?	Comment
`qwen3-vl-32b-instruct@Q8_0`	36GB	`llm/qwen3vl/GGUF`	Y		Very good, slow.
`qwen3-vl-32b-instruct@F16`	67GB	`llm/qwen3vl/GGUF`	Y		`--fp16` - Very good, slow. Q8_0 version is faster-ish and still very good.
`qwen3.5-35b-a3b@Q8_0` *	38GB	`llm/qwen35moe/GGUF`	Y	Y	Decent, slow.
`zai-org/glm-4.6v-flash@8bit` *	12GB	`llm/glm4v/MLX`	Y	Y	Decent, slow.

Blind Models

These models cannot process images (blind).

Model Flag Value	Size	Type	Tool?	Reason?	Comment
`qwen3-8b@Q8_0`	8.7GB	`llm/qwen3/GGUF`	Y		Good, medium-speed.
`gpt-oss-20b@MXFP4` *	12GB	`llm/gpt_oss/MLX`	Y	Y	Poor, slow.
`zai-org/glm-4.7-flash@8bit` *	32GB	`llm/glm4v/MLX`	Y	Y	Good, inconsistent.

CLI Interface

Quick start

Query a local AI model via LM Studio (server must be running):

poetry run transai query "What is the capital of France?"

Query using the llama.cpp backend (direct GGUF loading, no server needed):

poetry run transai --no-lms --root ~/.lmstudio/models/ query "Give me an onion soup recipe."

Query with tool use (pass fully-qualified Python callable names; model calls them automatically):

poetry run transai query --tools transcrypto.core.modmath.GCD --tools os.getcwd "What is the GCD of 48 and 36? Also what is my current directory?"

Global flags

Flag	Description	Default
`--help`	Show help	off
`--version`	Show version and exit	off
`-v`, `-vv`, `-vvv`, `--verbose`	Verbosity (nothing=ERROR, `-v`=WARNING, `-vv`=INFO, `-vvv`=DEBUG)	ERROR
`--color`/`--no-color`	Force enable/disable colored output (respects `NO_COLOR` env var if not provided)	`--color`
`-r`, `--root`	Local models root directory (only needed for `--no-lms`)	LM Studio default if it exists
`--lms`/`--no-lms`	Use LM Studio backend vs llama.cpp backend	`--lms`
`-m`, `--model`	Model to load (e.g., `qwen3-8b@Q8_0`)	`qwen3-8b@Q8_0`
`-t`, `--tokens`	Speculative decoding tokens (2-200)	disabled
`-s`, `--seed`	Random seed for reproducibility	random
`--context`	Max context tokens (16-16777216)	32768
`-x`, `--temperature`	Sampling temperature (0.0-2.0)	0.15
`-g`, `--gpu`	GPU ratio (0.1-1.0)	0.80
`--gpu-layers`	GPU layers to offload (-1 = as many as possible)	-1
`--fp16`/`--no-fp16`	FP16 precision mode	`--no-fp16`
`--mmap`/`--no-mmap`	Memory-mapped file loading	`--mmap`
`--flash`/`--no-flash`	Flash attention	`--flash`
`--kv-cache`	KV-cache precision type (GGML type, 4-128)	model default
`--timeout`	Timeout, in seconds; use zero (0) for no limit	300s

CLI Commands Documentation

This software auto-generates docs for CLI apps:

transai documentation

Color and formatting

Rich provides color output in logging and CLI output. The app:

Respects NO_COLOR environment variable
Has --no-color / --color flag: if given, overrides the NO_COLOR environment variable
If there is no environment variable and no flag is given, defaults to having color

To control color see Rich's markup conventions.

Test Queries

For all the queries below, remember to add -vv or -vvv to see info/debug logs: it will allow you to see the messages going to-and-fro the library and the LLMs.

The queries should be reproducible as shown here, as long as you have the same models installed... We give the hashes of the models used here and all the queries set the --seed for reproducibility.

No Vision & No Tools

LMS/Llama queries for qwen3-8b model (this one), version:

SHA256(Qwen3-8B-Q8_0.gguf) = 408b955510e196121c1c375201744783b5c9a43c7956d73fc78df54c66e883d6

$ poetry run transai -m qwen3-8b@Q8_0 --lms --seed 666 query "what is the capital of france?" --free
The capital of France is **Paris**. It is a major cultural, political, and economic center in Europe, known for landmarks like the Eiffel Tower, the Louvre
Museum, and Notre-Dame Cathedral.

$ poetry run transai -m qwen3-8b-gguf --no-lms --seed 666 query "what is the capital of france?"
The capital of France is **Paris**. It is a major global city known for its rich history, cultural landmarks, and influence in art, fashion, and cuisine.
Paris has been the political, economic, and cultural center of France since the 3rd century. 🇫🇷✨

Vision Use

LMS/Llama queries for qwen3-vl-32b-instruct model (this one), version:

SHA256(Qwen3VL-32B-Instruct-Q8_0.gguf) = 936dcdc564cf9f907af80cc581c53e01a275725b29d06ad753b923c1463f6751

SHA256(mmproj-Qwen3VL-32B-Instruct-F16.gguf) = 8617824839df91f84b4840ad5084dcf50a1403a435a1f4cfc4d8c84ce6cac2fc

$ poetry run transai -m qwen3-vl-32b-instruct@Q8_0 --lms --seed 666 query "With few words, describe the image. Who can it be?" --images ~/py/transai/tests/data/images/100.jpg --free
This is a portrait of Johann Sebastian Bach, the renowned German composer and musician of the Baroque era. He is depicted with his characteristic white
wig, formal dark coat, and holding sheet music, reflecting his identity as a master of classical composition.

$ poetry run transai -m qwen3-vl-32b-instruct@Q8_0 --no-lms --seed 666 query "With few words, describe the image. Who can it be?" --images ~/py/transai/tests/data/images/100.jpg
This is a portrait of Johann Sebastian Bach, the renowned German composer and musician of the Baroque period. He is depicted with his characteristic white
wig, formal black coat, and holding a sheet of music, reflecting his identity as a master of classical composition.

Tools Use

Facts we can use to make queries that are hard to solve without tools and do not depend on disk or network:

$13791229 \times 14270111 = 196802368656419$
$13791229 \times 12469153 = 171964944459037$
$\operatorname{GCD}\left(196802368656419, 171964944459037\right) = 13791229$
$12357067 \times 10557757 \equiv 1 \pmod{11812343}$

Armed with this we can have LMS/Llama queries for qwen3-8b model (the same as above):

$ poetry run transai -m qwen3-8b@Q8_0 --lms --seed 666 query "Compute the exact GCD of 196802368656419 and 171964944459037. Compute the exact modular inverse of 12357067 modulus 11812343." --tools transcrypto.core.modmath.GCD --tools transcrypto.core.modmath.ModInv --free
The greatest common divisor (GCD) of 196802368656419 and 171964944459037 is **13791229**.

The modular inverse of 12357067 modulo 11812343 is **10557757**, which satisfies $(12357067 \times 10557757) \bmod 11812343 = 1$.

These results are computed using the Euclidean algorithm for GCD and the extended Euclidean algorithm for the modular inverse.

$ poetry run transai -m qwen3-8b-gguf --no-lms --seed 666 query "Compute the exact GCD of 196802368656419 and 171964944459037. Compute the exact modular inverse of 12357067 modulus 11812343." --tools transcrypto.core.modmath.GCD --tools transcrypto.core.modmath.ModInv
The greatest common divisor (GCD) of 196802368656419 and 171964944459037 is **13791229**.

The modular inverse of 12357067 modulo 11812343 is **10557757**.

Final answers:
$$
\text{GCD} = \boxed{13791229}, \quad \text{Modular Inverse} = \boxed{10557757}
$$

Project Design

Architecture overview

TransAI uses an abstract base class pattern for backend abstraction:

CLI (transai.py + cli/query.py)
  │
  ├─ LMStudioWorker (core/lms.py)  ──▶  LM Studio server (localhost)
  │
  └─ LlamaWorker (core/llama.py)   ──▶  GGUF files on disk
  │
  └─ Both implement AIWorker (core/ai.py)
       │
       └─ Image utilities (utils/images.py)

AIWorker defines LoadModel() and ModelCall() as the public interface
LMStudioWorker and LlamaWorker implement _Load() and _Call() internally
The CLI layer (transai.py, cli/query.py) orchestrates configuration and delegates to workers
Image preprocessing is handled by utils/images.py

Modules

Module	Responsibility
`transai.py`	CLI app definition, global options, `TransAIConfig` dataclass
`cli/query.py`	`query` command implementation
`core/ai.py`	`AIWorker` abstract base class, `AIModelConfig`, shared constants and types
`core/lms.py`	`LMStudioWorker` — LM Studio backend implementation
`core/llama.py`	`LlamaWorker` — llama.cpp backend implementation (GGUF loading, CLIP detection, vision handlers)
`utils/images.py`	Image resizing for vision models, animation frame extraction

Development Instructions

File structure

.
├── CHANGELOG.md                  ⟸ latest changes/releases
├── LICENSE
├── Makefile
├── transai.md                    ⟸ auto-generated CLI doc (by `make docs` or `make ci`)
├── poetry.lock                   ⟸ maintained by Poetry, do not manually edit
├── pyproject.toml                ⟸ most important configurations live here
├── README.md                     ⟸ this documentation
├── SECURITY.md                   ⟸ security policy
├── requirements.txt
├── .pre-commit-config.yaml       ⟸ pre-submit configs
├── .github/
│   ├── copilot-instructions.md   ⟸ GitHub Copilot project-specific instructions
│   ├── dependabot.yaml           ⟸ Github dependency update pipeline
│   └── workflows/
│       ├── ci.yaml               ⟸ Github CI pipeline
│       └── codeql.yaml           ⟸ Github security scans and code quality pipeline
├── .vscode/
│   └── settings.json             ⟸ VSCode configs
├── scripts/
│   └── make_test_images.py       ⟸ helper script for generating test images
├── src/
│   └── transai/
│       ├── __init__.py           ⟸ version and package metadata
│       ├── __main__.py           ⟸ `python -m transai` entry point
│       ├── transai.py            ⟸ main CLI app entry point (Run(), Main())
│       ├── py.typed              ⟸ PEP 561 marker for type stubs
│       ├── cli/
│       │   └── query.py          ⟸ `transai query` command implementation
│       ├── core/
│       │   ├── ai.py             ⟸ AIWorker abstract base class, AIModelConfig, shared types
│       │   ├── llama.py          ⟸ LlamaWorker (llama.cpp backend)
│       │   └── lms.py            ⟸ LMStudioWorker (LM Studio backend)
│       └── utils/
│           └── images.py         ⟸ image preprocessing for vision models
├── tests/                        ⟸ unit tests
│   ├── transai_test.py
│   ├── cli/
│   │   └── query_test.py
│   ├── core/
│   │   ├── ai_test.py
│   │   ├── llama_test.py
│   │   └── lms_test.py
│   └── utils/
│       └── images_test.py
└── tests_integration/
    └── test_installed_cli.py     ⟸ integration tests (wheel build + install)

Development Setup

Install Python

On Linux:

sudo apt-get update
sudo apt-get upgrade
sudo apt-get install git python3 python3-dev python3-venv build-essential software-properties-common

sudo add-apt-repository ppa:deadsnakes/ppa
sudo apt-get update
sudo apt-get install python3.12

On Mac:

brew update
brew upgrade
brew cleanup -s

brew install git python@3.12

Install Poetry (recommended: `pipx`)

Poetry reference.

Install pipx (if you don't have it):

python3 -m pip install --user pipx
python3 -m pipx ensurepath

If you previously had Poetry installed, but not through pipx make sure to remove it first: brew uninstall poetry (mac) / sudo apt-get remove python3-poetry (linux). You should install Poetry with pipx and configure poetry to create .venv/ locally. This keeps Poetry isolated from project virtual environments and python for the environments is isolated from python for Poetry. Do:

pipx install poetry
poetry --version

If you will use PyPI to publish:

poetry config pypi-token.pypi <TOKEN>  # add your personal PyPI project token, if any

Make sure `.venv` is local

This project expects a project-local virtual environment at ./.venv (VSCode settings assume it).

poetry config virtualenvs.in-project true

Get the repository

git clone https://github.com/balparda/transai.git transai
cd transai

Create environment and install dependencies

From the repository root:

poetry env use python3.12  # creates the .venv with the correct Python version
poetry sync                # sync env to project's poetry.lock file
poetry env info            # no-op: just to check that environment looks good
poetry check               # no-op: make sure all pyproject.toml fields are being used correctly

poetry run transai --help    # simple test if everything loaded OK
make ci                    # should pass OK on clean repo

To activate and use the environment do:

poetry env activate        # (optional) will print activation command for environment, but you can just use:
source .venv/bin/activate  # because .venv SHOULD BE LOCAL
...
pytest -vvv  # for example, or other commands you want to execute in-environment
...
deactivate  # to close environment

Optional: VSCode setup

This repo ships a .vscode/settings.json configured to:

use ./.venv/bin/python
run pytest
use Ruff as formatter
disable deprecated pylint/flake8 integrations
configure Google-style docstrings via autoDocstring
use Code Spell Checker

Recommended VSCode extensions:

Python (ms-python.python)
Python Environments (ms-python.vscode-python-envs)
Python Debugger (ms-python.debugpy)
Pylance (ms-python.vscode-pylance)
Mypy Type Checker (ms-python.mypy-type-checker)
Ruff (charliermarsh.ruff)
autoDocstring – Python Docstring Generator (njpwerner.autodocstring)
Code Spell Checker (streetsidesoftware.code-spell-checker)
markdownlint (davidanson.vscode-markdownlint)
Markdown All in One (yzhang.markdown-all-in-one) - helps maintain this README.md table of contents
Markdown Preview Enhanced (shd101wyy.markdown-preview-enhanced, optional)
GitHub Copilot (github.copilot) - AI assistant; reads .github/copilot-instructions.md for project-specific coding conventions (indentation, naming, workflow)

Testing

Unit tests / Coverage

make test               # plain test run, no integration tests
make integration        # run the integration tests
poetry run pytest -vvv  # verbose test run, includes integration tests

make cov  # coverage run, equivalent to: poetry run pytest --cov=src --cov-report=term-missing

A test can be marked with a "tag" by just adding a decorator:

@pytest.mark.slow
def test_foo_method() -> None:
  """Test."""
  ...

These tags are defined in pyproject.toml, in section [tool.pytest.ini_options.markers]:

Tag	Meaning
`slow`	test is slow (> 1s)
`flaky`	AVOID! — test is known to be flaky
`stochastic`	test is capable of failing (even if very unlikely)
`integration`	integration test (wheel build + install)

You can use them to filter tests:

poetry run pytest -vvv -m slow  # run only the slow tests

You can find the slowest tests by running:

poetry run pytest -vvv -q --durations=20

You can search for flaky tests by running make flakes, which runs all tests 100 times.

Instrumenting your code

You can instrument your code to find bottlenecks:

$ source .venv/bin/activate
$ which transai
/path/to/.venv/bin/transai  # <== place this in the command below:
$ pyinstrument -r html -o output1.html -- /path/to/.venv/bin/transai <your-cli-command> <your-cli-flags>
$ deactivate

This will save a file output1.html to the project directory with the timings for all method calls. Make sure to cleanup these html files later.

Integration / e2e tests

Integration tests validate packaging and the installed console script by:

building a wheel from the repository
installing that wheel into a fresh temporary virtualenv
running the installed console script(s) to verify behavior (e.g., --version and basic commands)

The canonical integration test is tests_integration/test_installed_cli.py. Tests in this suite are marked with pytest.mark.integration.

Run the integration tests with:

make integration  # or: poetry run pytest -m integration -q

Linting / formatting / static analysis

make lint  # equivalent to: poetry run ruff check .
make fmt   # equivalent to: poetry run ruff format .

To check formatting without rewriting:

poetry run ruff format --check --diff .

Type checking

make type  # equivalent to: poetry run mypy src tests tests_integration

(Pyright is primarily for editor-time; MyPy is what CI enforces.)

Versioning and releases

Versioning scheme

This project follows a pragmatic versioning approach:

Patch: bug fixes / docs / small improvements.
Minor: new features or non-breaking changes.
Major: breaking API changes.

See: CHANGELOG.md

Updating versions

Bump project version (patch/minor/major)

Poetry can bump versions:

# bump the version!
poetry version minor  # updates 1.0.0 to 1.1.0, for example
# or:
poetry version patch  # updates 1.0.0 to 1.0.1
# or:
poetry version <version-number>
# (also updates `pyproject.toml` and `poetry.lock`)

This updates [project].version in pyproject.toml. Remember to also update src/transai/__init__.py to match (this repo gets/prints __version__ from there)!

Update dependency versions

The project has a dependabot config file in .github/dependabot.yaml that weekly (defaulting to Tuesdays) scans both Github actions and the project dependencies and creates PRs to update them.

To update poetry.lock file to more current versions do poetry update, it will ignore the current lock, update, and rewrite the poetry.lock file. If you have cache problems poetry cache clear PyPI --all will clean it.

To add a new dependency you should do:

poetry add "pkg>=1.2.3"  # regenerates lock, updates env (adds dep to prod code)
poetry add -G dev "pkg>=1.2.3"  # adds dep to dev code ("group" dev)
# also remember: "pkg@^1.2.3" = latest 1.* ; "pkg@~1.2.3" = latest 1.2.* ; "pkg@1.2.3" exact

Keep tool versions aligned. Remember to check your diffs before submitting (especially poetry.lock) to avoid surprises!

Exporting the `requirements.txt` file

This project does not generate requirements.txt automatically (Poetry uses poetry.lock). If you need a requirements.txt for Docker/legacy tooling, use Poetry's export plugin (poetry-plugin-export) by simply running:

make req  # or: poetry export --format requirements.txt --without-hashes --output requirements.txt

CI and docs

Make sure to run make docs or even better make ci. Both will update the CLI markdown docs and requirements.txt automatically.

Git tag and commit

Publish to GIT, including a TAG:

git commit -a -m "release version 1.0.0"
git tag 1.0.0
git push
git push --tags

Publish to PyPI

If you already have your PyPI token registered with Poetry (see Install Poetry) then just:

poetry build
poetry publish

Remember to update CHANGELOG.md.

Security

Please refer to the security policy in SECURITY.md for supported versions and how to report vulnerabilities.

The project has a codeql config file in .github/workflows/codeql.yaml that weekly (defaulting to Fridays) scans the project for code quality and security issues. It will also run on all commits. Github security issues will be opened in the project if anything is found.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.3.3

Jun 9, 2026

1.3.2

Jun 9, 2026

1.3.1

Jun 8, 2026

1.3.0

May 11, 2026

1.2.0

Apr 9, 2026

1.1.0

Apr 8, 2026

1.0.2

Apr 7, 2026

1.0.0

Apr 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transai-1.3.3.tar.gz (56.2 kB view details)

Uploaded Jun 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

transai-1.3.3-py3-none-any.whl (49.9 kB view details)

Uploaded Jun 9, 2026 Python 3

File details

Details for the file transai-1.3.3.tar.gz.

File metadata

Download URL: transai-1.3.3.tar.gz
Upload date: Jun 9, 2026
Size: 56.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.4.1 CPython/3.13.5 Darwin/25.5.0

File hashes

Hashes for transai-1.3.3.tar.gz
Algorithm	Hash digest
SHA256	`2fdc76b58c582348d2961d770031697361fea9e9c8c01f73f53ab0699ec74090`
MD5	`269e1f00a4a184cd18fbe8958efdbcab`
BLAKE2b-256	`6eb463a935b767ad42ce3a97d7d5d69858c8087ca735d7dd33e2cb48ce93199f`

See more details on using hashes here.

File details

Details for the file transai-1.3.3-py3-none-any.whl.

File metadata

Download URL: transai-1.3.3-py3-none-any.whl
Upload date: Jun 9, 2026
Size: 49.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.4.1 CPython/3.13.5 Darwin/25.5.0

File hashes

Hashes for transai-1.3.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`85491c50844f54808131de7366f5cb70978a0a3f8889a6bdeb8d923759643025`
MD5	`54bd61df1a15a0daf1df123fa71721d4`
BLAKE2b-256	`47fc8224920cb60e7a14d7f99a6b83b27a70f01f9f2d4967b65326b376ab2117`

See more details on using hashes here.

transai 1.3.3

Navigation

Verified details

Project links

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TransAI

Table of contents

License

Third-party notices

Installation

Supported platforms

Known dependencies (Prerequisites)

What TransAI is

What TransAI is not

Key concepts and terminology

Known limitations

Library API usage

Loading a model

Querying a model (text)

Querying a model (structured JSON)

Vision models (images)

Tool use (function calling)

Multi-turn conversations (chat history)

Image utilities

Workable AI Models Guide

Vision Models

Blind Models

CLI Interface

Quick start

Global flags

CLI Commands Documentation

Color and formatting

Test Queries

No Vision & No Tools

Vision Use

Tools Use

Project Design

Architecture overview

Modules

Development Instructions

File structure

Development Setup

Install Python

Install Poetry (recommended: pipx)

Make sure .venv is local

Get the repository

Create environment and install dependencies

Optional: VSCode setup

Testing

Unit tests / Coverage

Instrumenting your code

Integration / e2e tests

Linting / formatting / static analysis

Type checking

Versioning and releases

Versioning scheme

Updating versions

Bump project version (patch/minor/major)

Update dependency versions

Exporting the requirements.txt file

CI and docs

Git tag and commit

Publish to PyPI

Security

Project details

Verified details

Project links

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Install Poetry (recommended: `pipx`)

Make sure `.venv` is local

Exporting the `requirements.txt` file