Context-aware similarity search. Dynamic metric routing that understands your query and your data.

These details have not been verified by PyPI

Project links

Project description

refract

Smart similarity search that understands your query and your data.

refract is a Python library that replaces static cosine similarity with a dynamic, context-aware mixture of similarity metrics -- weighted based on the nature of your query and the geometry of your search space. Every result comes with a provenance trace explaining exactly how it was scored.

flowchart LR
    Q["Query"] --> QA["Query Analyzer"]
    C["Corpus"] --> SA["Space Analyzer"]
    QA --> R["Router"]
    SA --> R
    R --> |"weights"| F["Fusion Engine"]
    M["Metrics\n cosine | bm25 | mahalanobis | euclidean"] --> F
    F --> RES["Ranked Results\n+ Provenance"]

The problem with cosine similarity

Cosine similarity is the default for vector search. But it assumes the embedding space is flat and isotropic, that all dimensions contribute equally, and that the same metric works for every query type.

None of these are true. Transformer embeddings are anisotropic. Hierarchical relationships are not linear. The right notion of "similar" for "sort list python" is different from "what are the philosophical implications of determinism".

What refract does

Instead of score(x, y) = cosine(x, y), refract computes:

score(x, y | query, space) = sum_i  w_i(query, space) * sim_i(x, y)

The weights w_i are determined dynamically by analyzing:

Query type -- keyword, natural language, code, or structured data
Search space geometry -- density, variance, anisotropy of the candidate pool
Discriminability -- which metrics actually separate candidates for this query

Why not just...

Approach	What it does	refract difference
Cosine similarity	Single static metric	Refract dynamically routes across multiple metrics
Hybrid search (BM25 + dense)	Static weights (e.g., 0.7/0.3)	Weights are dynamic and space-aware
Rerankers (cross-encoders)	Post-processes a fixed candidate set	Refract changes how scoring happens, not just the order after
Learning to rank	Learns feature weights	Refract works out of the box (heuristic), with optional learned routing
FAISS / Qdrant / Pinecone	ANN indexing & search	Infrastructure layer -- refract operates above these

Install

# Core (numpy, scipy, scikit-learn, rank_bm25 only)
pip install refract-search

# With local embeddings
pip install "refract-search[sentence-transformers]"

# With OpenAI embeddings
pip install "refract-search[openai]"

# Everything
pip install "refract-search[all]"

Quickstart

import refract

docs = [
    "Sort a Python list using the sorted() built-in.",
    "Neural networks learn representations of data.",
    "Retrieve relevant documents from a large corpus.",
    "Use cosine similarity to measure vector closeness.",
]

results = refract.search("how do I sort things in Python", docs)

for r in results:
    print(f"{r.score:.3f}  {r.text}")

Output:

0.726  Sort a Python list using the sorted() built-in function.
0.357  Python decorators modify function behavior at definition time.
0.333  Use cosine similarity to measure vector closeness in embedding space.

Score provenance

Every result explains why it ranked where it did:

result = results[0]
print(result.provenance)
# Provenance(score=0.726, router='heuristic', query_type='natural_language',
#   density='medium',
#   [cosine=0.895x0.50, bm25=0.877x0.15, mahalanobis=0.412x0.25, euclidean=0.513x0.10])

With an embedder

from refract.embedders.sentence_transformers import SentenceTransformerEmbedder

embedder = SentenceTransformerEmbedder("all-MiniLM-L6-v2")
results = refract.search("machine learning fundamentals", docs, embedder=embedder)

With pre-computed vectors

import numpy as np

query_vec = np.random.randn(384)
corpus_vecs = np.random.randn(100, 384)

results = refract.search(query_vec, corpus_vecs)

Batch search

Amortize corpus analysis across multiple queries:

results = refract.search_batch(
    ["query one", "query two", "query three"],
    documents,
    top_k=5,
)
# Returns: list[list[SearchResult]]

Custom metric

from refract.metrics import BaseMetric
import numpy as np

class MyDomainMetric(BaseMetric):
    name = "my_metric"

    def score(self, query_vec: np.ndarray, candidate_vec: np.ndarray) -> float:
        return float(np.dot(query_vec, candidate_vec))

results = refract.search(query, docs, metrics=["cosine", "bm25", MyDomainMetric()])

Custom router

from refract.routing import BaseRouter

class MyRouter(BaseRouter):
    name = "my_router"

    def route(self, query_profile, space_profile, available_metrics):
        if query_profile.query_type == "code":
            return {"cosine": 0.7, "bm25": 0.3}
        return {"cosine": 0.5, "mahalanobis": 0.3, "bm25": 0.2}

results = refract.search(query, docs, router=MyRouter())

Use as a RAG retrieval step

import refract

def retrieve(query: str, knowledge_base: list[str], top_k: int = 5) -> list[str]:
    results = refract.search(query, knowledge_base, top_k=top_k)
    return [r.text for r in results]

# Feed into your LLM
context = retrieve("What is the refund policy?", documents)

Benchmarking

Prove it works on your data:

from refract.benchmark import BenchmarkHarness, CustomDataset

dataset = CustomDataset(
    name="my_data",
    queries=["query 1", "query 2"],
    corpus=["doc 1", "doc 2", "doc 3"],
    relevance={0: {0}, 1: {1, 2}},
)

harness = BenchmarkHarness()
results = harness.run(dataset, compare_cosine_baseline=True)

for r in results:
    print(f"{r.method:20s}  NDCG@10={r.ndcg_at_10:.3f}  Recall@10={r.recall_at_10:.3f}")

Routing modes

Mode	When to use	Training required
`HeuristicRouter` (default)	Always -- good out of the box	No
`LearnedRouter`	When you have relevance feedback data	Yes
`CompositeRouter`	Blend multiple routers	Depends
`BaseRouter` subclass	Full custom control	You decide

Works with

refract is not a vector database or RAG framework. It makes the scoring step smarter. Use it with:

Vector DBs: FAISS, Qdrant, Pinecone, Weaviate, Milvus, Chroma
RAG frameworks: LangChain, LlamaIndex, Haystack
Embeddings: OpenAI, Cohere, sentence-transformers, any custom model

Design principles

Progressive complexity. Five lines to get started. Full control available.
Embedding-agnostic. Bring your own vectors; embedders are optional extras.
Explainable by default. Every score comes with a provenance trace.
Pluggable everywhere. Metrics, routers, and embedders all follow stable interfaces.
No reinventing the wheel. Does not store vectors. Does not orchestrate LLMs. One job: smarter scoring.

Project structure

src/refract/
  search.py          # Main API: refract.search(), refract.search_batch()
  types.py           # SearchResult, Provenance, QueryProfile, SpaceProfile
  analysis/          # Query type detection + space geometry analysis
  metrics/           # Cosine, Euclidean, Mahalanobis, BM25 + registry
  routing/           # HeuristicRouter, LearnedRouter, CompositeRouter
  fusion/            # Weighted score fusion with provenance
  embedders/         # SentenceTransformer, OpenAI, Cohere (optional)
  benchmark/         # Evaluation harness with NDCG, Recall, MRR

Examples

Example	Description
`quickstart.py`	5-line usage demo
`rag_pipeline.py`	RAG retrieval step
`code_search.py`	Code similarity + query type detection
`custom_metric.py`	Plug in your own metric
`compare_cosine.py`	Side-by-side vs vanilla cosine
`benchmark_demo.py`	Evaluation harness demo
`vector_db_integration.py`	FAISS/Qdrant integration pattern

Contributing

See CONTRIBUTING.md for development setup and guidelines.

Roadmap

0.1.0 -- Core API, heuristic router, cosine / euclidean / mahalanobis / BM25 (you are here)
0.2.0 -- Learned router with PyTorch gating network
0.3.0 -- BEIR benchmark harness with published results
1.0.0 -- Stable API, comprehensive benchmarks, documentation site

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.0

Apr 11, 2026

0.1.2

Apr 10, 2026

This version

0.1.1

Apr 10, 2026

0.1.0

Apr 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

refract_search-0.1.1.tar.gz (53.1 kB view details)

Uploaded Apr 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

refract_search-0.1.1-py3-none-any.whl (45.8 kB view details)

Uploaded Apr 10, 2026 Python 3

File details

Details for the file refract_search-0.1.1.tar.gz.

File metadata

Download URL: refract_search-0.1.1.tar.gz
Upload date: Apr 10, 2026
Size: 53.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.0

File hashes

Hashes for refract_search-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`f01471caa09b915b4dbcafec1c55ae2dbd70a2933c1c4b6264ae9cd70ce6246b`
MD5	`78b7dfcb728dc048892fb63ee5dbf3d8`
BLAKE2b-256	`1ca2c0d8a3a3724fe16a4bc0f0b052d904fb654d985fcdced5d33f5c94e552c6`

See more details on using hashes here.

File details

Details for the file refract_search-0.1.1-py3-none-any.whl.

File metadata

Download URL: refract_search-0.1.1-py3-none-any.whl
Upload date: Apr 10, 2026
Size: 45.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.0

File hashes

Hashes for refract_search-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`14eef9d2776e92c3849a5264110b3f2f11c32c6a57a9bcd024e16dd60b37b5c5`
MD5	`f16ac05dc196f8e26eec00e6b276e16d`
BLAKE2b-256	`efbdef79ae9b30016105aa0c0fe47ffc61749c1437d369f00a5714cbe5fbf795`

See more details on using hashes here.

refract-search 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

refract

The problem with cosine similarity

What refract does

Why not just...

Install

Quickstart

Score provenance

With an embedder

With pre-computed vectors

Batch search

Custom metric

Custom router

Use as a RAG retrieval step

Benchmarking

Routing modes

Works with

Design principles

Project structure

Examples

Contributing

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes