3 projects
langidentify
Fast, high-accuracy language detection for Python. Uses ngram classification augmented with a topwords signal for improved short-text accuracy. Supports 80+ languages.
cjclassifier
Classify text as Chinese Simplified, Chinese Traditional, or Japanese using a statistical model.
langidentify-full-model
High-accuracy full model data for the LangIdentify language detection library. Install via pip install langidentify[full].