Language detection using FastText and Spacy
Project description
spacy_fastlang
Install
Assuming you have a working python environment, you can simply install it using
pip install spacy_fastlang
Usage
The library exports a pipeline component called language_detector
that will set two spacy extensions
- doc._.language = ISO code of the detected language or
xx
as a fallback - doc._.language_score = confidence
import spacy_fastlang # noqa: F401 # pylint: disable=unused-import
nlp = spacy.load("...")
nlp.add_pipe("language_detector")
doc = nlp(en_text)
doc._.language == "..."
doc._.language_score >= ...
Options
Check the tests to see more examples and available options
License
Everythin is under MIT
except the default model which is distributed under Creative Commons Attribution-Share-Alike License 3.0 by facebook here
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
spacy_fastlang-2.0.0.tar.gz
(782.4 kB
view details)
Built Distribution
File details
Details for the file spacy_fastlang-2.0.0.tar.gz
.
File metadata
- Download URL: spacy_fastlang-2.0.0.tar.gz
- Upload date:
- Size: 782.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.5.1 CPython/3.8.17 Linux/5.15.0-1041-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2c8408a985b34c5fceb87f2261ddabb5341cd5c202809fc7e0d464187f54dbfb |
|
MD5 | dda54f919ec49e26fc53506226fc55b0 |
|
BLAKE2b-256 | 8d2a76bb5328496f5019e0d1ff5760c6ac30f867ada79fa42ed9d3c16ebde014 |
File details
Details for the file spacy_fastlang-2.0.0-py3-none-any.whl
.
File metadata
- Download URL: spacy_fastlang-2.0.0-py3-none-any.whl
- Upload date:
- Size: 780.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.5.1 CPython/3.8.17 Linux/5.15.0-1041-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bed792486c8e8fe3c2c4e7566134f0980173525c200c0a5e285bcc98c9c7bd3c |
|
MD5 | 96c151c2c270741e87f103eb2dd6d0bb |
|
BLAKE2b-256 | 5094576ddc9dcf6841e0b7effbc0bda4f30696053e582b68407e888d6c98bb0e |