Skip to main content

coreason-search

Project description

coreason-search

The Unified Retrieval Execution Engine for the CoReason ecosystem.

License CI Ruff Docs

coreason-search serves as the "Librarian" of the CoReason architecture, designed to solve three distinct problems: Ad-Hoc Reasoning (RAG), Systematic Evidence Synthesis, and Context Distillation. It adopts a "Late Chunking / No Chunking" philosophy, utilizing SOTA 32k context embeddings to process full documents.

Features

  • Long-Context Sovereignty: Utilizes SOTA embeddings (e.g., Qwen3-Embedding) with 32k context windows to ingest full research papers without "chunk-and-pray" methods.
  • Modular Strategy Pattern: Supports multiple retrieval strategies:
    • Dense Retriever: Vector-based semantic search using LanceDB.
    • Sparse Retriever: Boolean/Keyword search for systematic reviews (PubMed style).
    • Graph Retriever: Symbolic 1-hop neighbor expansion using coreason-graph-nexus.
  • Hybrid Fusion: Implements Reciprocal Rank Fusion (RRF) to merge results from vector and keyword searches robustly.
  • Precision Re-Ranking: Uses Cross-Encoders to re-rank top results for maximum precision.
  • Context Distillation (The Scout): Compresses documents by stripping irrelevant sentences, maximizing the signal-to-noise ratio for downstream LLMs.
  • Systematic Search Mode: Supports "Research-Grade" reproducible reviews with strict boolean logic and generator-based pagination for large result sets.

Installation

pip install coreason-search

Usage

from coreason_search.engine import SearchEngine
from coreason_search.schemas import SearchRequest, RetrieverType

# Initialize the engine (loads configuration automatically)
engine = SearchEngine()

# Define a search request
request = SearchRequest(
    query="mechanism of action of aspirin in liver failure",
    strategies=[RetrieverType.LANCE_DENSE, RetrieverType.LANCE_FTS],
    fusion_enabled=True,
    rerank_enabled=True,
    distill_enabled=True,
    top_k=5
)

# Execute the search
response = engine.search(request)

# Process results
for hit in response.hits:
    print(f"[{hit.score:.4f}] {hit.doc_id}")
    print(f"Distilled Context: {hit.distilled_text}\n")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

coreason_search-0.4.0.tar.gz (25.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

coreason_search-0.4.0-py3-none-any.whl (41.0 kB view details)

Uploaded Python 3

File details

Details for the file coreason_search-0.4.0.tar.gz.

File metadata

  • Download URL: coreason_search-0.4.0.tar.gz
  • Upload date:
  • Size: 25.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for coreason_search-0.4.0.tar.gz
Algorithm Hash digest
SHA256 6c8a3fc57585eb40c108796ea0d8d0bcd890321c2704ea4517361bce78870b77
MD5 fa6c2f9581ac6e42413a2ed8ddcd5bc0
BLAKE2b-256 6812d95295ff3e1eb02ec9e7998806c161d82d0743c0df7b714e91b038ac352a

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_search-0.4.0.tar.gz:

Publisher: publish.yml on CoReason-AI/coreason-search

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file coreason_search-0.4.0-py3-none-any.whl.

File metadata

File hashes

Hashes for coreason_search-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 dfe957b8bc4ad5a5fb394e0c872516d4d341623eebe6660287992620b682b8c0
MD5 3cbf40913a57148054786e26949f14ff
BLAKE2b-256 43244f08f73ae3076f56256cf16150f0fb2ed736187403042bdaff17d55731d2

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_search-0.4.0-py3-none-any.whl:

Publisher: publish.yml on CoReason-AI/coreason-search

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page