Model-aware text chunking and answer re-ranking for LLM pipelines. Automatically adapts chunk size to tokenizer and context window, then consolidates and ranks answers across chunks.

These details have not been verified by PyPI

Project description

ChunkRank: Model-Aware Chunking + Answer Ranking

Used internally for long-document QA and evaluation pipelines handling 1,000+ PDFs.

ChunkRank is a lightweight Python library that automatically chunks 
text based on an LLM’s tokenizer and context window, then consolidates
and ranks answers across chunks. In short ChunkRank is a model-aware text 
chunking and answer re-ranking library for LLM pipelines.

🔗 PyPI : https://pypi.org/project/chunkrank/

Why ChunkRank?

When working with LLMs, long documents must be split into chunks, but:

Every model has different tokenizers and context limits
Chunk sizes are usually hard-coded and error-prone
Answer quality drops when responses come from multiple chunks
Existing RAG frameworks are heavy when you only need chunking + ranking

ChunkRank solves this gap.

What It Does

✅Model-aware chunking

Pass a model name (gpt-4o-mini, claude-3.5-sonnet, Llama-3.1-8B etc.)
ChunkRank automatically:
- Selects the correct tokenizer
- Applies the correct context window
- Reserves token space for prompts and responses

No manual token math. No trial-and-error.

✅Answer consolidation & ranking

Query runs across multiple chunks
Multiple candidate answers are produced
ChunkRank re-ranks them to return the best answer Works standalone — no full RAG stack required.

Installation

pip install chunkrank

or for development:

poetry install

Quick Example

from chunkrank import ChunkRankPipeline

text = open("document.txt").read()

pipe = ChunkRankPipeline(model="gpt-4o-mini")

answer = pipe.process(
    question="What is the main topic of this document?",
    text=text
)

print(answer)

Core API

chunks = chunkrank.split(text, model="gpt-4o-mini")

answers = chunkrank.answer(question, chunks)

best_answer = chunkrank.rank(answers)

Supported Capabilities

Automatic model → tokenizer → context resolution
Token, sentence, and paragraph chunking strategies
Cross-encoder based answer re-ranking
Works with OpenAI, Anthropic, HF, Llama-based models
Drop-in utility for QA, summarization, extraction

How It Fits

Tool	What it does
LangChain / LlamaIndex	Full RAG pipelines
Haystack	End-to-end retrieval frameworks
ChunkRank	Focused, model-aware chunking + answer ranking

ChunkRank complements RAG frameworks — it doesn’t replace them.

Roadmap

Build the model registry (model → context window + tokenizer).
Implement chunking strategies (tokens, sentences, paragraphs).
Integrate a re-ranking engine (start with Hugging Face cross-encoder).
Package and release to PyPI with a simple API.

Community

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.2.0

Apr 23, 2026

1.1.3

Apr 20, 2026

1.1.2

Apr 20, 2026

1.1.1

Apr 20, 2026

1.1.0

Apr 20, 2026

1.0.0

Mar 10, 2026

0.2.4

Jan 15, 2026

0.2.3

Jan 15, 2026

This version

0.2.2

Jan 15, 2026

0.2.1

Jan 15, 2026

0.2.0

Jan 13, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chunkrank-0.2.2.tar.gz (6.4 kB view details)

Uploaded Jan 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

chunkrank-0.2.2-py3-none-any.whl (9.0 kB view details)

Uploaded Jan 15, 2026 Python 3

File details

Details for the file chunkrank-0.2.2.tar.gz.

File metadata

Download URL: chunkrank-0.2.2.tar.gz
Upload date: Jan 15, 2026
Size: 6.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.2.1 CPython/3.14.2 Darwin/24.5.0

File hashes

Hashes for chunkrank-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`7ad968af7bfac13bb598741d492d4854063cbadcfc45c4eb72d4ad68a2053669`
MD5	`e1850014ed193ab3df43d429ab3baa20`
BLAKE2b-256	`9e56978e50fd5398399ff1f11d8e3f9415689a0fe1d05efd551bce4a96ece2d0`

See more details on using hashes here.

File details

Details for the file chunkrank-0.2.2-py3-none-any.whl.

File metadata

Download URL: chunkrank-0.2.2-py3-none-any.whl
Upload date: Jan 15, 2026
Size: 9.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.2.1 CPython/3.14.2 Darwin/24.5.0

File hashes

Hashes for chunkrank-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ff42b64f867617827b3ca90464fae4ccaa4be96be05b37202ca69565be19f1ae`
MD5	`5b9cbef1435c4658a23013a5260d19ad`
BLAKE2b-256	`b0790cd36262b36a2a1904e966c1e93543cdbb00fc60cb0db8461af2ee5d388f`

See more details on using hashes here.

chunkrank 0.2.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

ChunkRank: Model-Aware Chunking + Answer Ranking

Why ChunkRank?

What It Does

Installation

Quick Example

Core API

Supported Capabilities

How It Fits

Roadmap

Community

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes