texta-crf-extractor
Project description
TEXTA CRF Extractor
Requirements
- Python >= 3.8
- SciPy installation for scikit-learn (requires BLAS & LAPACK system libraries).
Installation:
# For debian based systems (ex: debian:buster) to install binary dependencies.
apt-get update && apt-get install python3-scipy
# Install without MLP
pip install texta-crf-extractor
# Install with MLP
pip install texta-crf-extractor[mlp]
Usage:
from texta_crf_extractor.crf_extractor import CRFExtractor
from texta_mlp.mlp import MLP
mlp = MLP(language_codes=["en"], default_language_code="en")
# prepare data
texts = ["foo", "bar"]
mlp_docs = [mlp.process(text) for text in texts]
# create extractor
extractor = CRFExtractor(mlp=mlp)
# train the CRF model
extractor.train(mlp_docs)
# tag something
extractor.tag("Tere maailm!")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
texta-crf-extractor-2.0.1.tar.gz
(20.8 kB
view details)
File details
Details for the file texta-crf-extractor-2.0.1.tar.gz
.
File metadata
- Download URL: texta-crf-extractor-2.0.1.tar.gz
- Upload date:
- Size: 20.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d5333c0fd04fc6cdd12c45795d6c9b7f733190f54fa9649d9ecdec59530205c9 |
|
MD5 | 5bf00fcfb89d66f957d709d06afeec79 |
|
BLAKE2b-256 | 86e165b846e87a053114c5b1b548945816c670915579a8970fcb3ccb8a4178e6 |