RAG query parsing plugin — parse natural language queries into semantic terms and structured filters using LLMs

Project description

LangCore RAG — Query Parsing for Hybrid Retrieval

A plugin for LangExtract that parses natural-language queries into semantic terms (for vector search) and structured metadata filters (for database / index filtering), enabling hybrid RAG retrieval pipelines. Inspired by LangStruct's .query() method.

Note: This is a third-party plugin for LangExtract. For the main LangExtract library, visit google/langextract.

Installation

Install from source:

git clone <repo-url>
cd langcore-rag
pip install -e .

Or with uv:

uv pip install -e .

Features at a Glance

Feature	langcore-rag	LangStruct
Query → semantic terms + filters	✅ `QueryParser.parse()`	✅ `.query()`
Async support	✅ `async_parse()`	✅
Pydantic schema introspection	✅ Auto-discovers filterable fields	✅
MongoDB-style operators	✅ `$eq`, `$gte`, `$lte`, `$in`, `$nin`, etc.	✅
Confidence score	✅ 0.0 – 1.0	❌
Explanation / rationale	✅ Human-readable	❌
Any LLM backend	✅ Via LiteLLM (100+ providers)	✅
Robust JSON parsing	✅ Raw JSON + Markdown fences + graceful fallback	⚠️

Quick Start

1. Define a Schema

from pydantic import BaseModel, Field

class Invoice(BaseModel):
    amount: float = Field(description="Total invoice amount in USD")
    due_date: str = Field(description="Due date in ISO-8601 format")
    vendor: str = Field(description="Vendor / supplier name")
    paid: bool = Field(description="Whether the invoice is paid")

2. Parse a Query

from langcore_rag import QueryParser

parser = QueryParser(schema=Invoice, model_id="gemini/gemini-2.5-flash")
parsed = parser.parse("invoices over $5000 due in March 2024")

print(parsed.semantic_terms)
# → ["invoices"]

print(parsed.structured_filters)
# → {"amount": {"$gte": 5000}, "due_date": {"$gte": "2024-03-01", "$lte": "2024-03-31"}}

print(parsed.confidence)
# → 0.92

print(parsed.explanation)
# → "Extracted amount ≥ 5000 and date range for March 2024."

3. Async Usage

import asyncio
from langcore_rag import QueryParser

async def main():
    parser = QueryParser(schema=Invoice, model_id="gpt-4o")
    parsed = await parser.async_parse("unpaid invoices from Acme Corp")
    print(parsed.structured_filters)
    # → {"paid": {"$eq": false}, "vendor": {"$eq": "Acme Corp"}}

asyncio.run(main())

API Reference

`QueryParser`

QueryParser(
    schema: type[BaseModel],
    model_id: str,
    *,
    temperature: float = 0.0,
    max_tokens: int = 1024,
    **litellm_kwargs,
)

Parameter	Type	Description
`schema`	`type[BaseModel]`	Pydantic model whose fields define filterable metadata
`model_id`	`str`	Any LiteLLM-compatible model ID (e.g. `"gpt-4o"`, `"gemini/gemini-2.5-flash"`, `"anthropic/claude-3-opus"`)
`temperature`	`float`	Sampling temperature (default `0.0` for deterministic output)
`max_tokens`	`int`	Maximum tokens to generate (default `1024`)
`**litellm_kwargs`		Extra kwargs forwarded to `litellm.completion()` (e.g. `api_key`, `api_base`, `timeout`)

Methods

Method	Signature	Description
`parse`	`(query_text: str) -> ParsedQuery`	Synchronous query parsing
`async_parse`	`(query_text: str) -> ParsedQuery`	Asynchronous query parsing

Properties

Property	Type	Description
`schema`	`type[BaseModel]`	The Pydantic schema used for field discovery
`model_id`	`str`	The LiteLLM model identifier
`system_prompt`	`str`	The generated system prompt (useful for debugging)

`ParsedQuery`

An immutable (frozen) dataclass returned by parse() / async_parse().

Field	Type	Description
`semantic_terms`	`list[str]`	Free-text terms for vector / similarity search
`structured_filters`	`dict[str, Any]`	Metadata filters with MongoDB-style operators
`confidence`	`float`	0.0 – 1.0 confidence in the parse quality
`explanation`	`str`	Human-readable rationale for the decomposition

How It Works

Schema introspection — QueryParser inspects the Pydantic model's fields to identify which ones are scalar/filterable (int, float, str, bool, date, datetime). Complex types like list[str] are excluded.
System prompt generation — A system prompt is built listing the filterable fields with their types and descriptions, instructing the LLM to output a JSON object with semantic_terms, structured_filters, confidence, and explanation.
LLM call — The query text is sent as a user message alongside the system prompt via litellm.completion() (sync) or litellm.acompletion() (async).
Response parsing — The LLM's text response is parsed as JSON (handling both raw JSON and Markdown code fences). Values are type-coerced and clamped to produce a valid ParsedQuery.

Supported Filter Operators

The parser instructs the LLM to use MongoDB-style operators:

Operator	Meaning	Example
`$eq`	Equals	`{"vendor": {"$eq": "Acme"}}`
`$ne`	Not equals	`{"paid": {"$ne": true}}`
`$gt`	Greater than	`{"amount": {"$gt": 1000}}`
`$gte`	Greater than or equal	`{"amount": {"$gte": 5000}}`
`$lt`	Less than	`{"amount": {"$lt": 100}}`
`$lte`	Less than or equal	`{"due_date": {"$lte": "2024-12-31"}}`
`$in`	In list	`{"vendor": {"$in": ["Acme", "Globex"]}}`
`$nin`	Not in list	`{"vendor": {"$nin": ["Initech"]}}`

Development

# Install dev dependencies
uv sync

# Run tests
uv run pytest tests/ -v

# Lint
uv run ruff check langcore_rag/ tests/

# Format
uv run ruff format langcore_rag/ tests/

License

Apache 2.0

Project details

Release history Release notifications | RSS feed

1.2.0

Feb 24, 2026

1.1.0

Feb 24, 2026

1.0.4

Feb 23, 2026

1.0.3

Feb 23, 2026

1.0.2

Feb 23, 2026

This version

1.0.1

Feb 23, 2026

1.0.0

Feb 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langcore_rag-1.0.1.tar.gz (19.3 kB view details)

Uploaded Feb 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

langcore_rag-1.0.1-py3-none-any.whl (13.4 kB view details)

Uploaded Feb 23, 2026 Python 3

File details

Details for the file langcore_rag-1.0.1.tar.gz.

File metadata

Download URL: langcore_rag-1.0.1.tar.gz
Upload date: Feb 23, 2026
Size: 19.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for langcore_rag-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`8ec9174e904a11c65cc6205776b5889230d7b6a3dba8a3a2d1079f7aa9021498`
MD5	`536e0e948bbd4181eede256f1854af6d`
BLAKE2b-256	`52d725175c9fa503357f0eb48d3ce68454cd03f69aed746834516a649c6ad2f2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for langcore_rag-1.0.1.tar.gz:

Publisher: release.yml on IgnatG/langcore-rag

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: langcore_rag-1.0.1.tar.gz
- Subject digest: 8ec9174e904a11c65cc6205776b5889230d7b6a3dba8a3a2d1079f7aa9021498
- Sigstore transparency entry: 980745632
- Sigstore integration time: Feb 23, 2026
Source repository:
- Permalink: IgnatG/langcore-rag@305f64d15e23d5e69a9eb8364838320dac59853e
- Branch / Tag: refs/heads/main
- Owner: https://github.com/IgnatG
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@305f64d15e23d5e69a9eb8364838320dac59853e
- Trigger Event: push

File details

Details for the file langcore_rag-1.0.1-py3-none-any.whl.

File metadata

Download URL: langcore_rag-1.0.1-py3-none-any.whl
Upload date: Feb 23, 2026
Size: 13.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for langcore_rag-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dcebf9d174daef712a6358186397b64b5d2e55247a84a26ca8f6c459b750efbb`
MD5	`06763cc464cd78a53a8a0cce983920b5`
BLAKE2b-256	`5cfce58c0de77dfdca04181b6349cb37a2ad9e89a14abd213ae7e7b7288d4a39`

See more details on using hashes here.

Provenance

The following attestation bundles were made for langcore_rag-1.0.1-py3-none-any.whl:

Publisher: release.yml on IgnatG/langcore-rag

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: langcore_rag-1.0.1-py3-none-any.whl
- Subject digest: dcebf9d174daef712a6358186397b64b5d2e55247a84a26ca8f6c459b750efbb
- Sigstore transparency entry: 980745671
- Sigstore integration time: Feb 23, 2026
Source repository:
- Permalink: IgnatG/langcore-rag@305f64d15e23d5e69a9eb8364838320dac59853e
- Branch / Tag: refs/heads/main
- Owner: https://github.com/IgnatG
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@305f64d15e23d5e69a9eb8364838320dac59853e
- Trigger Event: push

langcore-rag 1.0.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

LangCore RAG — Query Parsing for Hybrid Retrieval

Installation

Features at a Glance

Quick Start

1. Define a Schema

2. Parse a Query

3. Async Usage

API Reference

`QueryParser`

Methods

Properties

`ParsedQuery`

How It Works

Supported Filter Operators

Development

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance