Skip to main content

The Superlinked vector computing library

Project description

SIE: Superlinked Inference Engine is an open-source inference server for small AI models: embeddings, reranking, and extraction.
It runs on your own infrastructure, from a laptop to a production Kubernetes cluster.
The legacy superlinked Python framework is deprecated for new projects. Use sie-sdk instead.

Superlinked Python Framework (Deprecated)

Deprecated for new projects

The superlinked Python framework is deprecated for new projects.

Superlinked now focuses on SIE: Superlinked Inference Engine, an open-source inference server for small AI models. SIE runs encoders, rerankers, and entity extractors on your own infrastructure, from a laptop to a production Kubernetes cluster.

This package does not install SIE. It remains available only for existing projects that already depend on the legacy Superlinked framework.

New projects should use SIE instead:

pip install sie-sdk

SIE GitHub: https://github.com/superlinked/sie
SIE docs: https://superlinked.com/docs
Superlinked website: https://superlinked.com

Use SIE for new projects

SIE is the current Superlinked inference product for semantic search, retrieval, RAG, reranking, and information extraction workloads. It exposes three primitives:

  • encode converts text or images to vectors for semantic search and RAG.
  • score reranks query-document pairs for higher-precision retrieval.
  • extract pulls entities and structured data from unstructured text.

Use SIE when you need to:

  • Serve 85+ supported models from one API.
  • Generate dense, sparse, multi-vector, and vision embeddings.
  • Rerank candidate documents or items with reranking models.
  • Extract entities and structured information from text.
  • Run inference locally with Docker.
  • Deploy inference on production Kubernetes infrastructure.
  • Integrate inference into search, RAG, and document-processing systems.

SIE is not a drop-in replacement for the deprecated superlinked framework. It is the new Superlinked inference product for new applications and infrastructure.

Quickstart with SIE

SIE runs as a Docker container. Your application calls it over HTTP. Start the server, install the SDK, then call encode, score, or extract.

SIE's primary target is x86 Linux nodes with NVIDIA GPUs. The CPU image is useful for trying SIE locally.

1. Run the engine

# macOS (Apple Silicon)
docker run --platform linux/amd64 -p 8080:8080 \
  -v sie-hf-cache:/app/.cache/huggingface \
  ghcr.io/superlinked/sie-server:latest-cpu-default

# Linux (CPU)
docker run -p 8080:8080 \
  -v sie-hf-cache:/app/.cache/huggingface \
  ghcr.io/superlinked/sie-server:latest-cpu-default

# Linux (NVIDIA GPU)
docker run --gpus all -p 8080:8080 \
  -v sie-hf-cache:/app/.cache/huggingface \
  ghcr.io/superlinked/sie-server:latest-cuda12-default

The server starts on port 8080. Models are available through the API and load on first request.

Optional readiness check:

curl http://localhost:8080/readyz
# "ok"

2. Install the SDK

pip install sie-sdk           # Python
pnpm add @superlinked/sie-sdk # TypeScript

3. Generate embeddings

from sie_sdk import SIEClient
from sie_sdk.types import Item

client = SIEClient("http://localhost:8080")

result = client.encode("BAAI/bge-m3", Item(text="Hello world"))
print(result["dense"].shape)  # (1024,)

4. Rerank search results

query = Item(text="What is machine learning?")
items = [
    Item(text="Machine learning uses algorithms to learn from data."),
    Item(text="The weather is sunny today."),
]

result = client.score("BAAI/bge-reranker-v2-m3", query, items)

for entry in result["scores"]:
    print(f"Rank {entry['rank']}: score={entry['score']:.3f}")

5. Extract entities

result = client.extract(
    "urchade/gliner_multi-v2.1",
    Item(text="Tim Cook is the CEO of Apple."),
    labels=["person", "organization"],
)

for entity in result["entities"]:
    print(f"{entity['label']}: {entity['text']}")

What changed?

The superlinked package originally provided a Python framework for building vector search and recommendation applications with declarative schemas, indexes, query-time weighting, and vector database integrations.

That legacy framework is now deprecated for new projects.

Superlinked now focuses on SIE: Superlinked Inference Engine. SIE is designed for teams that need a unified inference layer for modern search, retrieval, RAG, reranking, and information extraction workloads.

Should I still use this package?

Use superlinked only if you are maintaining an existing application that already depends on the legacy framework.

Do not start new projects with this package. For new development, use SIE.

Links

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

superlinked-37.5.2.tar.gz (237.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

superlinked-37.5.2-py3-none-any.whl (565.3 kB view details)

Uploaded Python 3

File details

Details for the file superlinked-37.5.2.tar.gz.

File metadata

  • Download URL: superlinked-37.5.2.tar.gz
  • Upload date:
  • Size: 237.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for superlinked-37.5.2.tar.gz
Algorithm Hash digest
SHA256 4e85e023d729541fea8ff33cf65c5dd3f03649e7342fa6239010942b694fd27c
MD5 b2d1322a89ea3c7ab18f6aaba3b9c64f
BLAKE2b-256 3de2837b49b4fde2d70589d83698b2c7653da8b09d21a062a34fd30062686aa1

See more details on using hashes here.

Provenance

The following attestation bundles were made for superlinked-37.5.2.tar.gz:

Publisher: python.yml on superlinked/superlinked-internal

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file superlinked-37.5.2-py3-none-any.whl.

File metadata

  • Download URL: superlinked-37.5.2-py3-none-any.whl
  • Upload date:
  • Size: 565.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for superlinked-37.5.2-py3-none-any.whl
Algorithm Hash digest
SHA256 b04ad1bbaa38a5b50538d9896e1ff7f874e6722caa1603a9f4258327b1ac74db
MD5 01122d4384ac9c6a2087d77b239c4b15
BLAKE2b-256 afbb8ad45499b6427781b5b022448c1f93d51289ca1bf3679418c0482b2f3bb7

See more details on using hashes here.

Provenance

The following attestation bundles were made for superlinked-37.5.2-py3-none-any.whl:

Publisher: python.yml on superlinked/superlinked-internal

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page