Last released Sep 12, 2024
A Python package for token-aware HTML chunking that preserves structure and attributes, with optional cleaning and attribute length control.
Supported by