Skip to main content

coreason-search

Project description

coreason-search

The Unified Retrieval Execution Engine for the CoReason ecosystem.

License CI Ruff Docs

coreason-search serves as the "Librarian" of the CoReason architecture, designed to solve three distinct problems: Ad-Hoc Reasoning (RAG), Systematic Evidence Synthesis, and Context Distillation. It adopts a "Late Chunking / No Chunking" philosophy, utilizing SOTA 32k context embeddings to process full documents.

Features

  • Long-Context Sovereignty: Utilizes SOTA embeddings (e.g., Qwen3-Embedding) with 32k context windows to ingest full research papers without "chunk-and-pray" methods.
  • Modular Strategy Pattern: Supports multiple retrieval strategies:
    • Dense Retriever: Vector-based semantic search using LanceDB.
    • Sparse Retriever: Boolean/Keyword search for systematic reviews (PubMed style).
    • Graph Retriever: Symbolic 1-hop neighbor expansion using coreason-graph-nexus.
  • Hybrid Fusion: Implements Reciprocal Rank Fusion (RRF) to merge results from vector and keyword searches robustly.
  • Precision Re-Ranking: Uses Cross-Encoders to re-rank top results for maximum precision.
  • Context Distillation (The Scout): Compresses documents by stripping irrelevant sentences, maximizing the signal-to-noise ratio for downstream LLMs.
  • Systematic Search Mode: Supports "Research-Grade" reproducible reviews with strict boolean logic and generator-based pagination for large result sets.

Installation

pip install coreason-search

Usage

from coreason_search.engine import SearchEngine
from coreason_search.schemas import SearchRequest, RetrieverType

# Initialize the engine (loads configuration automatically)
engine = SearchEngine()

# Define a search request
request = SearchRequest(
    query="mechanism of action of aspirin in liver failure",
    strategies=[RetrieverType.LANCE_DENSE, RetrieverType.LANCE_FTS],
    fusion_enabled=True,
    rerank_enabled=True,
    distill_enabled=True,
    top_k=5
)

# Execute the search
response = engine.search(request)

# Process results
for hit in response.hits:
    print(f"[{hit.score:.4f}] {hit.doc_id}")
    print(f"Distilled Context: {hit.distilled_text}\n")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

coreason_search-0.1.0.tar.gz (25.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

coreason_search-0.1.0-py3-none-any.whl (40.5 kB view details)

Uploaded Python 3

File details

Details for the file coreason_search-0.1.0.tar.gz.

File metadata

  • Download URL: coreason_search-0.1.0.tar.gz
  • Upload date:
  • Size: 25.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for coreason_search-0.1.0.tar.gz
Algorithm Hash digest
SHA256 873d83f76d5256c7d3b030585b3473fff4de2cc794ff726815f8d48cd32b5dda
MD5 8c80c23b85854547c240c6b1d1ce7417
BLAKE2b-256 06e89fa0247a590103bc4310b8b96ef05a2fa7512716435d2fbe09afe03e3878

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_search-0.1.0.tar.gz:

Publisher: publish.yml on CoReason-AI/coreason-search

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file coreason_search-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for coreason_search-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5ec836a676f7731ecaf3881248194a85652da137cb654621cf953431e27ef69d
MD5 46566b4b9ab4169c25621b74f4cbcf92
BLAKE2b-256 298145953ed9853193cfaec7a98c7a838ca07e69f3027ff12cef9d21b1bdcbf9

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_search-0.1.0-py3-none-any.whl:

Publisher: publish.yml on CoReason-AI/coreason-search

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page