texta-crf-extractor
Project description
TEXTA CRF Extractor
Requirements
- Python >= 3.8
- SciPy installation for scikit-learn (requires BLAS & LAPACK system libraries).
Installation:
# For debian based systems (ex: debian:buster) to install binary dependencies.
apt-get update && apt-get install python3-scipy
# Install without MLP
pip install texta-crf-extractor
# Install with MLP
pip install texta-crf-extractor[mlp]
Usage:
from texta_crf_extractor.crf_extractor import CRFExtractor
from texta_mlp.mlp import MLP
mlp = MLP(language_codes=["en"], default_language_code="en")
# prepare data
texts = ["foo", "bar"]
mlp_docs = [mlp.process(text) for text in texts]
# create extractor
extractor = CRFExtractor(mlp=mlp)
# train the CRF model
extractor.train(mlp_docs)
# tag something
extractor.tag("Tere maailm!")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file texta-crf-extractor-1.0.16.tar.gz
.
File metadata
- Download URL: texta-crf-extractor-1.0.16.tar.gz
- Upload date:
- Size: 20.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.8.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f233e8758861da52127e41ef93f7715a315c19ce606a1d7dd8d6825030ff95b3 |
|
MD5 | d8033abc45858aec5374648f4775d644 |
|
BLAKE2b-256 | 18b40bada1baef5d91c353055609b08d5e2a5cfa653e5ad4d3e35d8f592cc5c2 |