Look up the frequencies of words in many languages, based on many sources of data.
Fixes mojibake and other problems with Unicode, after the fact
An OrderedSet is a custom MutableSet that remembers its order, so that every
Tools for labeling human languages with IETF language tags
A library for representing floating point vectors in a compact, base64-like format
Labels and compares human languages in a standardized way -- Python 2 backport
Computes association strength over semantic networks in a dimensionality-reduced form.