Language detection library ported from Google's language-detection.
Project description
langdetect
[![Build Status](https://travis-ci.org/Mimino666/langdetect.svg?branch=master)](https://travis-ci.org/Mimino666/langdetect)
Port of [language-detection](https://code.google.com/p/language-detection/) (version from 03/03/2014) library to Python.
Installation
$ pip install langdetect
Supported Python versions 2.6, 2.7, 3.x.
Basic usage
To detect the language of the text:
`python >>> from langdetect import detect >>> detect("War doesn't show who's right, just who's left.") 'en' >>> detect("Ein, zwei, drei, vier") 'de' `
To find out the probabilities for the top languages:
`python >>> from langdetect import detect_langs >>> detect_langs("Otec matka syn.") [sk:0.572770823327, pl:0.292872522702, cs:0.134356653968] `
IMPORTANT
Language detection algorithm is non-deterministic, which means that if you try to run it on a text which is either too short or too ambiguous, you might get different results everytime you run it.
To enforce consistent results, call following code before the first language detection:
`python from langdetect import DetectorFactory DetectorFactory.seed = 0 `
More information
This library is a direct port of [language-detection](https://code.google.com/p/language-detection/) from Java to Python. All the classes and methods (even unit tests) are unchanged, so for more information see the project’s website or wiki.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for langdetect_py-1.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f31023136acc964992b9257d83af04f13116b89f026139d52da3567e462e38ee |
|
MD5 | 845f2e25033303236b839d2ed89084e0 |
|
BLAKE2b-256 | 338598532c24110055b5f7601e1bae17e894c5205f9c22e4845346e232a7f121 |