Mojo-first late-interaction retrieval engine

Project description

Kayak Python SDK

kayak is a Python SDK for late-interaction retrieval.

Its job is to make late interaction programmable in normal Python while keeping query/document vector counts, layouts, and MaxSim semantics explicit.

Fundamentally, late interaction here means token-level MaxSim over explicit query and document vector groups. The SDK does not hide that structure behind a fake dense tensor API.

It gives you explicit objects for:

queries
query batches
documents
packed indexes
MaxSim scores
top-k search hits

The API is designed to feel natural for NumPy and PyTorch users without hiding the parts that matter in late interaction:

query vector count stays explicit
document vector count stays explicit
layout stays explicit
backend choice stays explicit

For the higher-level product positioning and the split between the open Python SDK and the hosted engine, see docs/python_sdk_charter.md. For the execution plan behind that position, see docs/python_sdk_roadmap.md.

Install

With pip:

pip install kayak

With UV:

uv add kayak

With Pixi and PyPI:

pixi add --pypi kayak

Optional Mojo Backend

The default Python SDK path uses the NumPy reference backend.

The Python package does not expose a separate "Mojo-mode" import surface. You still write normal Python:

import kayak
build query, documents, and index
opt into the Mojo backend explicitly on the operation call

You only need Mojo if you want the explicit exact CPU Mojo backend:

kayak.MOJO_EXACT_CPU_BACKEND

Examples:

# global or activated environment
pip install kayak
mojo --version

# Pixi project
pixi add python=3.11 mojo
pixi add --pypi kayak
pixi run python app.py

Then select the backend explicitly:

scores = kayak.maxsim(
    query,
    index,
    backend=kayak.MOJO_EXACT_CPU_BACKEND,
)

Kayak does not silently switch to the Mojo backend just because Mojo is installed. The backend choice stays explicit.

If you are running inside an activated virtual environment or pixi run python, Kayak first checks that active Python environment for a usable mojo binary before falling back to PATH.

Current CLI discovery order for the Mojo backend:

KAYAK_MOJO_CLI
a usable mojo binary in the active Python environment
mojo on PATH
pixi run mojo

If you do not pass backend=kayak.MOJO_EXACT_CPU_BACKEND, Kayak stays on the NumPy reference backend and does not require Mojo.

Core API

Create a query:

import kayak
import numpy as np

query = kayak.query(
    np.array(
        [
            [1.0, 0.0],
            [0.0, 1.0],
        ],
        dtype=np.float32,
    )
)

Create a document collection:

documents = kayak.documents(
    ["doc-a", "doc-b"],
    [
        np.array([[1.0, 0.0], [0.0, 1.0]], dtype=np.float32),
        np.array([[1.0, 0.0], [0.5, 0.5]], dtype=np.float32),
    ],
)

Pack documents into an index:

index = documents.pack()

Score with MaxSim:

scores = kayak.maxsim(query, index)

Search:

hits = kayak.search(query, index, k=2)

Create an explicit query batch without pretending it is one dense tensor:

def dim128(index: int) -> np.ndarray:
    vector = np.zeros(128, dtype=np.float32)
    vector[index] = 1.0
    return vector

batch = kayak.query_batch(
    [
        np.stack([dim128(0), dim128(1)]),
        np.stack([dim128(0), dim128(1), dim128(2)]),
    ]
)

index = kayak.documents(
    ["doc-a", "doc-b"],
    [
        np.stack([dim128(0), dim128(1), dim128(2)]),
        np.stack([dim128(0), dim128(1)]),
    ],
).pack()

scores_batch = kayak.maxsim_batch(batch, index)

Candidate-window rescoring stays explicit:

hits = kayak.search(query, index, k=10)
candidate_index = index.select([hit.doc_id for hit in hits])
candidate_scores = kayak.maxsim(query, candidate_index)

That is still late interaction as a primitive: explicit selection plus MaxSim, not a hidden rerank mode.

Layouts

Kayak keeps layout changes explicit.

flat_dim128 and hybrid_flat_dim128 require vector_dim == 128.

Example:

import kayak
import numpy as np

def dim128(index: int) -> np.ndarray:
    vector = np.zeros(128, dtype=np.float32)
    vector[index] = 1.0
    return vector

query128 = kayak.query(np.stack([dim128(0), dim128(1)]))
documents128 = kayak.documents(
    ["doc-a", "doc-b"],
    [
        np.stack([dim128(0), dim128(1)]),
        np.stack([dim128(0), dim128(0)]),
    ],
)
index128 = documents128.pack()

flat_query = query128.to_layout("flat_dim128")
hybrid_index = index128.to_layout("hybrid_flat_dim128")

scores = kayak.maxsim(flat_query, hybrid_index)

Backends

The package exposes two named backends:

kayak.NUMPY_REFERENCE_BACKEND
kayak.MOJO_EXACT_CPU_BACKEND

You can inspect backend availability explicitly:

print(kayak.available_backends())
print(kayak.backend_info(kayak.MOJO_EXACT_CPU_BACKEND))

Example:

scores = kayak.maxsim(
    query,
    index,
    backend=kayak.NUMPY_REFERENCE_BACKEND,
)

The NumPy backend is the safest default.

The Mojo exact CPU backend is the faster exact path when your environment has a working Mojo installation:

scores = kayak.maxsim(
    query,
    index,
    backend=kayak.MOJO_EXACT_CPU_BACKEND,
)

Public Surface

Application code should import from kayak.

Main exports:

BackendInfo
LateQuery
LateQueryBatch
LateDocuments
LateIndex
LateScores
SearchHit
available_backends
backend_info
query
query_batch
documents
packed_index
hybrid_flat_dim128_index
flat_query_dim128
maxsim
maxsim_batch
search
search_batch

Mental Model

Kayak is not a generic tensor library.

It is a late-interaction retrieval API with:

ragged query and document vector counts
explicit layout conversion
exact MaxSim scoring
explicit candidate-window selection before rescoring
explicit search backends

That makes it suitable for code that wants retrieval semantics first, while still fitting naturally into Python workflows built on NumPy or PyTorch.

Project details

Release history Release notifications | RSS feed

0.3.0

Apr 15, 2026

This version

0.1.4

Apr 12, 2026

0.1.3

Apr 12, 2026

0.1.2

Apr 12, 2026

0.1.1

Apr 12, 2026

0.1.0

Apr 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kayak-0.1.4.tar.gz (85.8 kB view details)

Uploaded Apr 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kayak-0.1.4-py3-none-any.whl (2.6 MB view details)

Uploaded Apr 12, 2026 Python 3

File details

Details for the file kayak-0.1.4.tar.gz.

File metadata

Download URL: kayak-0.1.4.tar.gz
Upload date: Apr 12, 2026
Size: 85.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.16

File hashes

Hashes for kayak-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`bf21550a6090835728855b06bfaf9dadf739376b31a3319552bfd0f7ea262d9b`
MD5	`1569c7f855c9d381bb61bd9b0747228a`
BLAKE2b-256	`a8182a51ae616a8d334e49acf941517a6ca9ece9850839ef883a79bca3dbd4a4`

See more details on using hashes here.

File details

Details for the file kayak-0.1.4-py3-none-any.whl.

File metadata

Download URL: kayak-0.1.4-py3-none-any.whl
Upload date: Apr 12, 2026
Size: 2.6 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.16

File hashes

Hashes for kayak-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7441e2477f1e03e3960191f92bd5063466227d09c15484772cb68d560ee0be4c`
MD5	`b74fe62bdcefdfaa7ba141cb9a8aa041`
BLAKE2b-256	`055a9175040fe53b59eeeb977f60778553addffa9c4c8ed9382637a78c08c76a`

See more details on using hashes here.

kayak 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Kayak Python SDK

Install

Optional Mojo Backend

Core API

Layouts

Backends

Public Surface

Mental Model

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes