Embeddable AI engine for inference, embeddings, vector search, and fine-tuning (CUDA 12)

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

Jammi AI

Jammi is an embeddable AI engine that brings model inference into your data pipeline. Register data sources, run SQL queries, generate embeddings, search with vector similarity, fine-tune models on your domain, and evaluate results — all without leaving your application.

Install

pip install jammi-ai

For CUDA/GPU support:

pip install jammi-ai-cu12

Quickstart

The 5-minute walkthrough — install, connect, register a source, generate embeddings, search — lives in cookbook/quickstart/ with a runnable quickstart.py gated by CI. The condensed version:

import jammi_ai

db = jammi_ai.connect(gpu_device=-1)
db.add_source("corpus", url="cookbook/fixtures/tiny_corpus.parquet", format="parquet")

MODEL = "sentence-transformers/all-MiniLM-L6-v2"
db.generate_text_embeddings(source="corpus", model=MODEL, columns=["content"], key="id")

query_vec = db.encode_text_query(MODEL, "quantum computing applications")
results = db.search("corpus", query=query_vec, k=5).run()
print(results.to_pandas())

For runnable end-to-end recipes — mutable tables, trigger streams, eval, fine-tuning, Flight SQL — see cookbook/.

Features

SQL over local files — query Parquet, CSV, and JSON via DataFusion
Federated queries — join local files with PostgreSQL or MySQL
Text embeddings — load any BERT-family model from Hugging Face Hub (or local safetensors / ONNX) and persist results to Parquet with ANN indexes
Image embeddings — CLIP-style vision encoders
Vector search — ANN similarity search with automatic brute-force fallback
SearchBuilder — fluent API for .filter(), .sort(), .join(), .annotate(), .limit(), .select(), .run()
Evidence provenance — retrieved_by and annotated_by tracking on every search result
Fine-tuning — LoRA / deep LoRA adapters with contrastive loss to improve embeddings for your domain
Evaluation — recall@k, precision@k, MRR, nDCG, accuracy, F1, and A/B model comparison
Model caching — LRU eviction, ref-counted guards, single-flight loading
GPU scheduling — memory-budget admission control with RAII permits
Crash recovery — recovers embedding tables stuck in "building" state on restart

SearchBuilder

search = db.search("patents", query=query_vec, k=20)
search.filter("year >= 2020")
search.sort("similarity", descending=True)
search.limit(5)
search.select(["id", "title", "similarity"])
results = search.run()   # pyarrow.Table

All results are returned as pyarrow.Table — zero-copy from the Rust engine.

Fine-tuning

job = db.fine_tune(
    source="patents",
    model="sentence-transformers/all-MiniLM-L6-v2",
    triplets="triplets_train.parquet",
)
job.wait()

Requirements

Python 3.9+
Linux (x86_64) or macOS (Apple Silicon or Intel)

Windows is not yet supported due to a dependency on POSIX memory-mapping APIs.

Running the OSS server

For deployments that need a long-running Flight SQL + gRPC service rather than an embedded library, the workspace ships a Docker image:

docker run --rm \
  -p 8080:8080 -p 8081:8081 \
  -v jammi_data:/var/lib/jammi \
  ghcr.io/f-inverse/jammi-ai-server:latest

curl http://localhost:8080/healthz
# {"status":"ok","version":"0.8.0"}

The OSS server is single-tenant — the deployer's network is the auth boundary. See Deploy as a Server for the full guide.

Documentation

Full documentation, including guides for SQL queries, embeddings, search, fine-tuning, and evaluation:

https://f-inverse.github.io/jammi-ai/

License

Apache-2.0

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

vchakilam

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.9.0

May 26, 2026

This version

0.8.0

May 26, 2026

0.7.0

May 26, 2026

0.5.9

May 24, 2026

0.5.8

May 24, 2026

0.5.7

May 24, 2026

0.5.4

May 23, 2026

0.5.0

May 22, 2026

0.3.0

May 20, 2026

0.2.0

Mar 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

jammi_ai_cu12-0.8.0-cp39-abi3-manylinux_2_28_x86_64.whl (59.2 MB view details)

Uploaded May 26, 2026 CPython 3.9+manylinux: glibc 2.28+ x86-64

File details

Details for the file jammi_ai_cu12-0.8.0-cp39-abi3-manylinux_2_28_x86_64.whl.

File metadata

Download URL: jammi_ai_cu12-0.8.0-cp39-abi3-manylinux_2_28_x86_64.whl
Upload date: May 26, 2026
Size: 59.2 MB
Tags: CPython 3.9+, manylinux: glibc 2.28+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for jammi_ai_cu12-0.8.0-cp39-abi3-manylinux_2_28_x86_64.whl
Algorithm	Hash digest
SHA256	`364f4eff6b406b66e0613c07261435ce4f659ada673669eaef352884c7749437`
MD5	`61c676a1a75db8e05ea1a3c69594de0f`
BLAKE2b-256	`8b9deafae7aaf9478283917e43567f49c27733364abcf44e6ff30415cf5aae1e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for jammi_ai_cu12-0.8.0-cp39-abi3-manylinux_2_28_x86_64.whl:

Publisher: pypi-cuda.yml on f-inverse/jammi-ai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: jammi_ai_cu12-0.8.0-cp39-abi3-manylinux_2_28_x86_64.whl
- Subject digest: 364f4eff6b406b66e0613c07261435ce4f659ada673669eaef352884c7749437
- Sigstore transparency entry: 1633023702
- Sigstore integration time: May 26, 2026
Source repository:
- Permalink: f-inverse/jammi-ai@4ae79296677110e0cd2ca1702cca0d1e8017dd0b
- Branch / Tag: refs/tags/py-cu-v0.8.0
- Owner: https://github.com/f-inverse
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-cuda.yml@4ae79296677110e0cd2ca1702cca0d1e8017dd0b
- Trigger Event: push

jammi-ai-cu12 0.8.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Jammi AI

Install

Quickstart

Features

SearchBuilder

Fine-tuning

Requirements

Running the OSS server

Documentation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

Provenance