A Python library for TiDB.

Project description

TiDB Python AI SDK

Quick Start • Documentation • Examples • Roadmap • Discord • Report Bug

Introduction

Python SDK for TiDB AI: A unified data platform empowering developers to build next-generation AI applications.

🔍 Unified Search Modes: Vector · Full‑Text · Hybrid
🎭 Auto‑Embedding & Multi‑Modal Storage: Support for text, images, and more
🖼️ Image Search Support: Text‑to‑image and image‑to‑image retrieval capabilities
🎯 Advanced Filtering & Reranking: Flexible filters with optional reranker models to fine-tune result relevance
💱 Transaction Support: Full transaction management including commit/rollback to ensure consistency

Installation

[!NOTE] This Python package is under rapid development and its API may change. It is recommended to use a fixed version when installing, e.g., pytidb==0.0.12.

pip install pytidb

# To use built-in embedding functions and rerankers:
pip install "pytidb[models]"

# To convert query results to pandas DataFrame:
pip install pandas

Connect to TiDB Cloud

Create a free TiDB cluster at tidbcloud.com.

import os
from pytidb import TiDBClient

tidb_client = TiDBClient.connect(
    host=os.getenv("TIDB_HOST"),
    port=int(os.getenv("TIDB_PORT")),
    username=os.getenv("TIDB_USERNAME"),
    password=os.getenv("TIDB_PASSWORD"),
    database=os.getenv("TIDB_DATABASE"),
    ensure_db=True,
)

Highlights

🤖 Automatic Embedding

PyTiDB automatically embeds text fields (e.g., text) and stores the vector embedding in a vector field (e.g., text_vec).

Create a table with an embedding function:

from pytidb.schema import TableModel, Field, FullTextField
from pytidb.embeddings import EmbeddingFunction

# Set API key for embedding provider.
tidb_client.configure_embedding_provider("openai", api_key=os.getenv("OPENAI_API_KEY"))

class Chunk(TableModel):
    __tablename__ = "chunks"

    id: int = Field(primary_key=True)
    text: str = FullTextField()
    text_vec: list[float] = EmbeddingFunction(
        "openai/text-embedding-3-small"
    ).VectorField(source_field="text")  # 👈 Defines the vector field.
    user_id: int = Field()

table = tidb_client.create_table(schema=Chunk, if_exists="skip")

Bulk insert data:

table.bulk_insert([
    Chunk(id=2, text="bar", user_id=2),   # 👈 The text field is embedded and saved to text_vec automatically.
    Chunk(id=3, text="baz", user_id=3),
    Chunk(id=4, text="qux", user_id=4),
])

🔍 Search

Vector Search

Vector search finds the most relevant records based on semantic similarity, so you don't need to include all keywords explicitly in your query.

df = (
  table.search("<query>")  # 👈 The query is embedded automatically.
    .filter({"user_id": 2})
    .limit(2)
    .to_list()
)
# Output: A list of dicts.

See the Vector Search example for more details.

Full-text Search

Full-text search tokenizes the query and finds the most relevant records by matching exact keywords.

df = (
  table.search("<query>", search_type="fulltext")
    .limit(2)
    .to_pydantic()
)
# Output: A list of pydantic model instances.

See the Full-text Search example for more details.

Hybrid Search

Hybrid search combines exact matching from full-text search with semantic understanding from vector search, delivering more relevant and reliable results.

df = (
  table.search("<query>", search_type="hybrid")
    .limit(2)
    .to_pandas()
)
# Output: A pandas DataFrame.

See the Hybrid Search example for more details.

Image Search

Image search lets you find visually similar images using natural language descriptions or another image as a reference.

from PIL import Image
from pytidb.schema import TableModel, Field
from pytidb.embeddings import EmbeddingFunction

# Define a multi-modal embedding model.
jina_embed_fn = EmbeddingFunction("jina_ai/jina-embeddings-v4")  # Using multi-modal embedding model.

class Pet(TableModel):
    __tablename__ = "pets"
    id: int = Field(primary_key=True)
    image_uri: str = Field()
    image_vec: list[float] = jina_embed_fn.VectorField(
        source_field="image_uri",
        source_type="image"
    )

table = tidb_client.create_table(schema=Pet, if_exists="skip")

# Insert sample images ...
table.insert(Pet(image_uri="path/to/shiba_inu_14.jpg"))

# Search for images using natural language
results = table.search("shiba inu dog").limit(1).to_list()

# Search for images using an image ...
query_image = Image.open("shiba_inu_15.jpg")
results = table.search(query_image).limit(1).to_pydantic()

See the Image Search example for more details.

Advanced Filtering

PyTiDB supports a variety of operators for flexible filtering:

Operator	Description	Example
`$eq`	Equal to	`{"field": {"$eq": "hello"}}`
`$gt`	Greater than	`{"field": {"$gt": 1}}`
`$gte`	Greater than or equal	`{"field": {"$gte": 1}}`
`$lt`	Less than	`{"field": {"$lt": 1}}`
`$lte`	Less than or equal	`{"field": {"$lte": 1}}`
`$in`	In array	`{"field": {"$in": [1, 2, 3]}}`
`$nin`	Not in array	`{"field": {"$nin": [1, 2, 3]}}`
`$and`	Logical AND	`{"$and": [{"field1": 1}, {"field2": 2}]}`
`$or`	Logical OR	`{"$or": [{"field1": 1}, {"field2": 2}]}`

⛓ Join Structured and Unstructured Data

from pytidb import Session
from pytidb.sql import select

# Create a table to store user data:
class User(TableModel):
    __tablename__ = "users"
    id: int = Field(primary_key=True)
    name: str = Field(max_length=20)

# Use the db_engine from TiDBClient when creating a Session
with Session(tidb_client.db_engine) as session:
    query = (
        select(Chunk).join(User, Chunk.user_id == User.id).where(User.name == "Alice")
    )
    chunks = session.exec(query).all()

[(c.id, c.text, c.user_id) for c in chunks]

💱 Transaction Support

PyTiDB supports transaction management, helping you avoid race conditions and ensure data consistency.

with tidb_client.session() as session:
    initial_total_balance = tidb_client.query("SELECT SUM(balance) FROM players").scalar()

    # Transfer 10 coins from player 1 to player 2
    tidb_client.execute("UPDATE players SET balance = balance - 10 WHERE id = 1")
    tidb_client.execute("UPDATE players SET balance = balance + 10 WHERE id = 2")

    session.commit()
    # or session.rollback()

    final_total_balance = tidb_client.query("SELECT SUM(balance) FROM players").scalar()
    assert final_total_balance == initial_total_balance

Extensions

🔌 Built-in MCP support

[!TIP] Click the button below to install TiDB MCP Server in Cursor. Then, confirm by clicking Install when prompted.

Project details

Release history Release notifications | RSS feed

0.0.15.dev1 pre-release

Mar 6, 2026

0.0.14

Feb 3, 2026

This version

0.0.14.dev2 pre-release

Feb 3, 2026

0.0.14.dev1 pre-release

Feb 3, 2026

0.0.13

Aug 29, 2025

0.0.13.dev8 pre-release

Aug 28, 2025

0.0.13.dev7 pre-release

Aug 28, 2025

0.0.13.dev6 pre-release

Aug 28, 2025

0.0.13.dev5 pre-release

Aug 28, 2025

0.0.13.dev4 pre-release

Aug 28, 2025

0.0.13.dev3 pre-release

Aug 28, 2025

0.0.13.dev2 pre-release

Aug 28, 2025

0.0.13.dev1 pre-release

Aug 28, 2025

0.0.12

Aug 8, 2025

0.0.11

Aug 5, 2025

0.0.10.post1

Aug 4, 2025

0.0.10.dev1 pre-release

Jul 21, 2025

0.0.9

Jul 17, 2025

0.0.9.dev1 pre-release

Jul 9, 2025

0.0.8.post2

Jul 1, 2025

0.0.8.post1

Jul 1, 2025

0.0.8

Jul 1, 2025

0.0.8.dev3 pre-release

Jul 1, 2025

0.0.8.dev2 pre-release

Jun 26, 2025

0.0.8.dev1 pre-release

Jun 25, 2025

0.0.7

Jun 4, 2025

0.0.6

May 29, 2025

0.0.6.dev1 pre-release

May 28, 2025

0.0.5.post2

May 20, 2025

0.0.5.post1

May 20, 2025

0.0.5

Apr 25, 2025

0.0.5.dev8 pre-release

Apr 24, 2025

0.0.5.dev7 pre-release

Apr 24, 2025

0.0.5.dev6 pre-release

Apr 24, 2025

0.0.5.dev5 pre-release

Apr 23, 2025

0.0.5.dev4 pre-release

Apr 23, 2025

0.0.5.dev3 pre-release

Apr 22, 2025

0.0.5.dev2 pre-release

Apr 22, 2025

0.0.5.dev1 pre-release

Apr 21, 2025

0.0.4

Apr 17, 2025

0.0.4.dev1 pre-release

Apr 13, 2025

0.0.3

Apr 9, 2025

0.0.3.dev2 pre-release

Apr 8, 2025

0.0.3.dev1 pre-release

Apr 8, 2025

0.0.2.post7

Apr 2, 2025

0.0.2.post6

Mar 27, 2025

0.0.2.post5

Mar 26, 2025

0.0.2.post4

Mar 26, 2025

0.0.2.post3

Mar 26, 2025

0.0.2.post2

Mar 26, 2025

0.0.2

Mar 26, 2025

0.0.1

Jan 6, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytidb-0.0.14.dev2.tar.gz (47.2 kB view details)

Uploaded Feb 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pytidb-0.0.14.dev2-py3-none-any.whl (58.1 kB view details)

Uploaded Feb 3, 2026 Python 3

File details

Details for the file pytidb-0.0.14.dev2.tar.gz.

File metadata

Download URL: pytidb-0.0.14.dev2.tar.gz
Upload date: Feb 3, 2026
Size: 47.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.6.8

File hashes

Hashes for pytidb-0.0.14.dev2.tar.gz
Algorithm	Hash digest
SHA256	`e940f3c8d413977382f6dbb6d320dd2f718c903d1872fc62639a848fc01fb23a`
MD5	`7aace53adf966739f193aeeeffddca0a`
BLAKE2b-256	`8849bee7bafebc2713cd5de74a49d4ff44b93bfa279bb9129db1d96d6ed0f9b7`

See more details on using hashes here.

File details

Details for the file pytidb-0.0.14.dev2-py3-none-any.whl.

File metadata

Download URL: pytidb-0.0.14.dev2-py3-none-any.whl
Upload date: Feb 3, 2026
Size: 58.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.6.8

File hashes

Hashes for pytidb-0.0.14.dev2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d5494adbf22e47a20c77e0bf7a28a616c80208daf4e1a2f1f8eada4e8c005293`
MD5	`61e4bfe9fe99226a5d9a0ddc25014e4d`
BLAKE2b-256	`9049926dfbe25dfc7679f88494f7c26f0814246e0832b8703553e594524544d8`

See more details on using hashes here.

pytidb 0.0.14.dev2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

TiDB Python AI SDK

Quick Start • Documentation • Examples • Roadmap • Discord • Report Bug

Introduction

Installation

Connect to TiDB Cloud

Highlights

🤖 Automatic Embedding

🔍 Search

Advanced Filtering

⛓ Join Structured and Unstructured Data

💱 Transaction Support

Extensions

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes