Chunking utilities for GraphRAG
Project description
GraphRAG Chunking
This package contains a collection of text chunkers, a core config model, and a factory for acquiring instances.
Examples
Basic sentence chunking with nltk
The SentenceChunker class splits text into individual sentences by identifying sentence boundaries. It takes input text and returns a list where each element is a separate sentence, making it easy to process text at the sentence level.
Open the notebook to explore the basic sentence example code
Token chunking
The TokenChunker splits text into fixed-size chunks based on token count rather than sentence boundaries. It uses a tokenizer to encode text into tokens, then creates chunks of a specified size with configurable overlap between chunks.
Open the notebook to explore the token chunking example code
Using the factory via helper util
The create_chunker factory function provides a configuration-driven approach to instantiate chunkers by accepting a ChunkingConfig object that specifies the chunking strategy and parameters. This allows for more flexible and maintainable code by separating chunker configuration from direct instantiation.
Open the notebook to explore the factory helper util example code
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file graphrag_chunking-3.0.2.tar.gz.
File metadata
- Download URL: graphrag_chunking-3.0.2.tar.gz
- Upload date:
- Size: 7.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9f09979481039e42b68e64899cda3fa4c9dd991bec78e0bf74c85c875866ec95
|
|
| MD5 |
3a45a590cc5fc5c28f626fa2071256f0
|
|
| BLAKE2b-256 |
c15f427927db5bff5b89a818a6a1bd6ad23eddabd59590ff72265b994c7e7d87
|
File details
Details for the file graphrag_chunking-3.0.2-py3-none-any.whl.
File metadata
- Download URL: graphrag_chunking-3.0.2-py3-none-any.whl
- Upload date:
- Size: 9.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f3fc9f0aa22494545043ee86dc2f5569959eb50baa99d935ece3354ac424dc33
|
|
| MD5 |
f198d95058148748e0a73930db262faa
|
|
| BLAKE2b-256 |
54db28ae22ccd12761233baf2b4bb3b0a4a94571b50ff8b97022be6e1ab238a0
|