Last released Jul 22, 2024
A custom tokenizer for Swahili text using syllabic vocabulary with byte fallback.
Supported by