Model-aware text chunking and answer re-ranking for LLM pipelines. Automatically adapts chunk size to tokenizer and context window, then consolidates and ranks answers across chunks.

These details have not been verified by PyPI

Project links

Project description

ChunkRank: Model-Aware Chunking + Answer Ranking

Used internally for long-document QA and evaluation pipelines handling 1,000+ PDFs.

ChunkRank is a lightweight Python library that automatically chunks text based on an LLM's tokenizer and context window, then consolidates and ranks answers across chunks.

🔗 PyPI: https://pypi.org/project/chunkrank/

Why ChunkRank?

When working with LLMs, long documents must be split into chunks, but:

Every model has different tokenizers and context limits
Chunk sizes are usually hard-coded and error-prone
Answer quality drops when responses come from multiple chunks
Existing RAG frameworks are heavy when you only need chunking + ranking

ChunkRank solves this gap.

Installation

pip install chunkrank

With semantic chunking + cross-encoder reranking:

pip install chunkrank[semantic]

With all optional backends:

pip install chunkrank[all]

For development:

poetry install --with dev

Quick Example

import chunkrank

text = open("document.txt").read()
question = "What is the main topic of this document?"

chunks = chunkrank.split(text, model="gpt-4o-mini")
answers = chunkrank.answer(question, chunks)
best = chunkrank.rank(answers)

print(best)

Core API

import chunkrank

# 1. Split text into model-aware chunks
chunks = chunkrank.split(text, model="gpt-4o-mini")

# 2. Answer the question across all chunks
#    Default: local extractive (no API key required)
answers = chunkrank.answer(question, chunks)

#    With OpenAI:
answers = chunkrank.answer(question, chunks, provider="openai", api_key="sk-...")

#    With Anthropic:
answers = chunkrank.answer(question, chunks, provider="anthropic", api_key="sk-ant-...")

# 3. Rank and return the best answer
best_answer = chunkrank.rank(answers)

Pipeline API

from chunkrank import ChunkRankPipeline

# Local (no LLM required)
pipe = ChunkRankPipeline(model="gpt-4o-mini")

# With OpenAI
pipe = ChunkRankPipeline(model="gpt-4o-mini", provider="openai", api_key="sk-...")

# With Anthropic
pipe = ChunkRankPipeline(model="gpt-4o-mini", provider="anthropic", api_key="sk-ant-...")

# Process — returns best answer
answer = pipe.process(question="What is the main topic?", text=text)

# Stream — yields answers progressively as each chunk is processed
for partial in pipe.stream(question="What is the main topic?", text=text):
    print(partial)

Async API

from chunkrank import AsyncChunkRankPipeline

pipe = AsyncChunkRankPipeline(model="gpt-4o-mini", provider="openai", api_key="sk-...")

# Parallel chunk answering via asyncio.gather
answer = await pipe.process(question, text)

# Async streaming
async for partial in pipe.stream(question, text):
    print(partial)

Module-level async functions:

import chunkrank

chunks = await chunkrank.async_split(text, model="gpt-4o-mini")
answers = await chunkrank.async_answer(question, chunks)   # parallel LLM calls
best = await chunkrank.async_rank(answers)

Ranking Methods

Method	Description	Extra dep
`bm25` (default)	BM25 lexical ranking	none
`tfidf`	TF-IDF cosine similarity	none
`embedding`	Dense vector similarity	`[semantic]` or `openai-embed`
`cross-encoder`	Semantic cross-encoder (most accurate)	`[semantic]`

from chunkrank import Ranker

ranker = Ranker(method="cross-encoder")
ranked = ranker.rank(question, answers)

Chunking Strategies

# Token-budget sliding window (default)
chunks = chunkrank.split(text, model="gpt-4o-mini", strategy="tokens", overlap_tokens=64)

# Semantic — splits on embedding similarity drops between sentences
chunks = chunkrank.split(text, model="gpt-4o-mini", strategy="semantic", similarity_threshold=0.5)

Retrieve-then-Answer (top-K)

Rank chunks first, answer only the top-K — reduces LLM calls on large documents:

pipe = ChunkRankPipeline(model="gpt-4o-mini", retrieval_top_k=3)
answer = pipe.process(question, text)

Disk Cache

Avoid re-chunking the same document on repeated runs:

from chunkrank import ChunkCache, Chunker, ChunkerConfig

cache = ChunkCache(".chunkrank_cache")
chunks = cache.get(text, model="gpt-4o-mini")
if chunks is None:
    chunks = Chunker(ChunkerConfig(model="gpt-4o-mini")).split(text)
    cache.set(text, model="gpt-4o-mini", chunks=chunks)

Runtime Model Registration

import chunkrank

chunkrank.register_model("my-custom-model", max_context=200_000)

Supported Models

54 models in the built-in registry, including:

Provider	Models
OpenAI	gpt-4o, gpt-4o-mini, gpt-4-turbo, gpt-4, gpt-3.5-turbo, o1, o3, o3-mini, o4-mini
Anthropic	claude-3-opus/sonnet/haiku, claude-3-5-sonnet/haiku, claude-sonnet-4-6, claude-opus-4-6
Google	gemini-1.5-pro, gemini-1.5-flash, gemini-2.0-flash, gemini-2.5-pro
Meta	Llama-3.1/3.2/3.3, Llama-4-Scout (10M ctx), Llama-4-Maverick
Mistral	mistral-7b, mixtral-8x7b, mistral-large, codestral
Cohere	command-r, command-r-plus, command-r7b
DeepSeek	deepseek-v3, deepseek-r1
Qwen	qwen2.5-72b-instruct, qwen2.5-coder-32b-instruct

Unknown models fall back to 128k context with tiktoken (o200k_base).

How It Fits

Tool	What it does
LangChain / LlamaIndex	Full RAG pipelines
Haystack	End-to-end retrieval frameworks
ChunkRank	Focused, model-aware chunking + answer ranking

ChunkRank complements RAG frameworks — it doesn't replace them.

Requirements

Python 3.10+
numpy, scikit-learn, rank-bm25

Community

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.2.0

Apr 23, 2026

1.1.3

Apr 20, 2026

1.1.2

Apr 20, 2026

1.1.1

Apr 20, 2026

This version

1.1.0

Apr 20, 2026

1.0.0

Mar 10, 2026

0.2.4

Jan 15, 2026

0.2.3

Jan 15, 2026

0.2.2

Jan 15, 2026

0.2.1

Jan 15, 2026

0.2.0

Jan 13, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chunkrank-1.1.0.tar.gz (14.8 kB view details)

Uploaded Apr 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

chunkrank-1.1.0-py3-none-any.whl (18.5 kB view details)

Uploaded Apr 20, 2026 Python 3

File details

Details for the file chunkrank-1.1.0.tar.gz.

File metadata

Download URL: chunkrank-1.1.0.tar.gz
Upload date: Apr 20, 2026
Size: 14.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.2.1 CPython/3.14.3 Darwin/25.3.0

File hashes

Hashes for chunkrank-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`6cdd39e0adbcdcae23de62bed8559922b9dd55dfa4144791855b06142fd0eab4`
MD5	`99f030649f01d450292961d365632c77`
BLAKE2b-256	`240e413d08d5155a7f663c216c6ba4e33c40c9019f67e0f95d3da536135f25d4`

See more details on using hashes here.

File details

Details for the file chunkrank-1.1.0-py3-none-any.whl.

File metadata

Download URL: chunkrank-1.1.0-py3-none-any.whl
Upload date: Apr 20, 2026
Size: 18.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.2.1 CPython/3.14.3 Darwin/25.3.0

File hashes

Hashes for chunkrank-1.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f34477e1a8582b99f1d66300ed3722b5114fc81b09681df02b0c8e2f194fe304`
MD5	`93d237d4ae173370bcafb2dba129855e`
BLAKE2b-256	`1e293602a2f97fba8af9915682dffdd98994013aa747f0ccd02cca4da65edfe6`

See more details on using hashes here.

chunkrank 1.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ChunkRank: Model-Aware Chunking + Answer Ranking

Why ChunkRank?

Installation

Quick Example

Core API

Pipeline API

Async API

Ranking Methods

Chunking Strategies

Retrieve-then-Answer (top-K)

Disk Cache

Runtime Model Registration

Supported Models

How It Fits

Requirements

Community

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes