texta-crf-extractor
Project description
TEXTA CRF Extractor
Requirements
- Python >= 3.8
- SciPy installation for scikit-learn (requires BLAS & LAPACK system libraries).
Installation:
# For debian based systems (ex: debian:buster) to install binary dependencies.
apt-get update && apt-get install python3-scipy
# Install without MLP
pip install texta-crf-extractor
# Install with MLP
pip install texta-crf-extractor[mlp]
Usage:
from texta_crf_extractor.crf_extractor import CRFExtractor
from texta_mlp.mlp import MLP
mlp = MLP(language_codes=["en"], default_language_code="en")
# prepare data
texts = ["foo", "bar"]
mlp_docs = [mlp.process(text) for text in texts]
# create extractor
extractor = CRFExtractor(mlp=mlp)
# train the CRF model
extractor.train(mlp_docs)
# tag something
extractor.tag("Tere maailm!")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
texta-crf-extractor-2.2.0.tar.gz
(20.7 kB
view details)
File details
Details for the file texta-crf-extractor-2.2.0.tar.gz
.
File metadata
- Download URL: texta-crf-extractor-2.2.0.tar.gz
- Upload date:
- Size: 20.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.15.0 pkginfo/1.8.2 requests/2.25.1 setuptools/50.3.0 requests-toolbelt/0.10.1 tqdm/4.64.1 CPython/3.5.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b7cfe1fe5e7b0ba45a32a3eec819b3ea3a7aac067898cc7152cbca7588a538ab |
|
MD5 | ed67638b555dd41a74827e0d3ef22057 |
|
BLAKE2b-256 | 3017531bfc401a61e69dbff19269d0ebd21c59ef5731b236c65987941b168c0c |