Skip to main content

coreason-search

Project description

coreason-search

The Unified Retrieval Execution Engine for the CoReason ecosystem.

License CI Ruff Docs

coreason-search serves as the "Librarian" of the CoReason architecture, designed to solve three distinct problems: Ad-Hoc Reasoning (RAG), Systematic Evidence Synthesis, and Context Distillation. It adopts a "Late Chunking / No Chunking" philosophy, utilizing SOTA 32k context embeddings to process full documents.

Features

  • Long-Context Sovereignty: Utilizes SOTA embeddings (e.g., Qwen3-Embedding) with 32k context windows to ingest full research papers without "chunk-and-pray" methods.
  • Modular Strategy Pattern: Supports multiple retrieval strategies:
    • Dense Retriever: Vector-based semantic search using LanceDB.
    • Sparse Retriever: Boolean/Keyword search for systematic reviews (PubMed style).
    • Graph Retriever: Symbolic 1-hop neighbor expansion using coreason-graph-nexus.
  • Hybrid Fusion: Implements Reciprocal Rank Fusion (RRF) to merge results from vector and keyword searches robustly.
  • Precision Re-Ranking: Uses Cross-Encoders to re-rank top results for maximum precision.
  • Context Distillation (The Scout): Compresses documents by stripping irrelevant sentences, maximizing the signal-to-noise ratio for downstream LLMs.
  • Systematic Search Mode: Supports "Research-Grade" reproducible reviews with strict boolean logic and generator-based pagination for large result sets.

Installation

pip install coreason-search

Usage

from coreason_search.engine import SearchEngine
from coreason_search.schemas import SearchRequest, RetrieverType

# Initialize the engine (loads configuration automatically)
engine = SearchEngine()

# Define a search request
request = SearchRequest(
    query="mechanism of action of aspirin in liver failure",
    strategies=[RetrieverType.LANCE_DENSE, RetrieverType.LANCE_FTS],
    fusion_enabled=True,
    rerank_enabled=True,
    distill_enabled=True,
    top_k=5
)

# Execute the search
response = engine.search(request)

# Process results
for hit in response.hits:
    print(f"[{hit.score:.4f}] {hit.doc_id}")
    print(f"Distilled Context: {hit.distilled_text}\n")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

coreason_search-0.4.1.tar.gz (25.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

coreason_search-0.4.1-py3-none-any.whl (41.0 kB view details)

Uploaded Python 3

File details

Details for the file coreason_search-0.4.1.tar.gz.

File metadata

  • Download URL: coreason_search-0.4.1.tar.gz
  • Upload date:
  • Size: 25.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for coreason_search-0.4.1.tar.gz
Algorithm Hash digest
SHA256 9557c6cc190b8fe512b25e47aef2a7ac66297b4958e968b31687bad3f176e9d7
MD5 19b349f7ebb01a83b21751afafa8b55a
BLAKE2b-256 30de9971510587be2c3ef8dcb328bd94947b85cb2e04da7f7ebe114ebe4a40c8

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_search-0.4.1.tar.gz:

Publisher: publish.yml on CoReason-AI/coreason-search

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file coreason_search-0.4.1-py3-none-any.whl.

File metadata

File hashes

Hashes for coreason_search-0.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 93a3cae06e8182dce7d78a0839c6738a3eac04667f85c86c056b5c7c636ac740
MD5 0e478d1f2f50fe9654ef54a900fd5a2a
BLAKE2b-256 8b0e9ea5ad65fc5c0cf1608b91a78c763b3235cc59c8acb15a419e649db99ba1

See more details on using hashes here.

Provenance

The following attestation bundles were made for coreason_search-0.4.1-py3-none-any.whl:

Publisher: publish.yml on CoReason-AI/coreason-search

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page