Natural Language Processing in Rust with Python bidings
Project description
vtext
This is a Python wrapper for the Rust vtext crate.
This package aims to provide a high performance toolkit for ingesting textual data for machine learning applications.
The API is currently unstable.
Features
- Tokenization: Regexp tokenizer, Unicode segmentation + language specific rules
- Stemming: Snowball (in Python 15-20x faster than NLTK)
- Token counting: converting token counts to sparse matrices for use
in machine learning libraries. Similar to
CountVectorizer
andHashingVectorizer
in scikit-learn but will less broad functionality. - Levenshtein edit distance; Sørensen-Dice, Jaro, Jaro Winkler string similarities
Installation
vtext requires Python 3.5+, numpy 1.15+ and can be installed with,
pip install --pre vtext
Documentation
Project documentation: vtext.io/doc/latest/index.html
License
vtext is released under the Apache License, Version 2.0.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
File details
Details for the file vtext-0.1.0a3-cp37-cp37m-win_amd64.whl
.
File metadata
- Download URL: vtext-0.1.0a3-cp37-cp37m-win_amd64.whl
- Upload date:
- Size: 691.7 kB
- Tags: CPython 3.7m, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e811d766493e341b2ef4575c1b91915ab5d1645a39a9a3a9a3dd8c8646316442 |
|
MD5 | f850770133fc5c2deaacd2de85cfbcad |
|
BLAKE2b-256 | 3d467115ccad739b2c7b98de49a3087cd7f19b67560feb83ebbe12b1cec96211 |
File details
Details for the file vtext-0.1.0a3-cp37-cp37m-manylinux1_x86_64.whl
.
File metadata
- Download URL: vtext-0.1.0a3-cp37-cp37m-manylinux1_x86_64.whl
- Upload date:
- Size: 4.1 MB
- Tags: CPython 3.7m
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2825672e240dbf3ad5d029621426b5f42e3b53d4a33d477339173e086c2bd618 |
|
MD5 | fecd2b7d4c8e63a0ce9b9f2b5a09d040 |
|
BLAKE2b-256 | ad516adf21fdac9f45c22d2b43afe1334273b2f7eba5cec0cb391944469841f0 |
File details
Details for the file vtext-0.1.0a3-cp37-cp37m-macosx_10_9_x86_64.whl
.
File metadata
- Download URL: vtext-0.1.0a3-cp37-cp37m-macosx_10_9_x86_64.whl
- Upload date:
- Size: 754.3 kB
- Tags: CPython 3.7m, macOS 10.9+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | acd8dc6da2b76d0f62044f00e7f903ea31fdac09d88e1b65a2cbf18f2e32d519 |
|
MD5 | aa7a5bc4297015eedc0b6197142c1d8f |
|
BLAKE2b-256 | 15f3546640f725255abdf4de9131a3ab3e0411705dd5037a8de7a859c206b7bd |
File details
Details for the file vtext-0.1.0a3-cp36-cp36m-win_amd64.whl
.
File metadata
- Download URL: vtext-0.1.0a3-cp36-cp36m-win_amd64.whl
- Upload date:
- Size: 691.9 kB
- Tags: CPython 3.6m, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ba4afe74f34e8c934812a6a7277715ddcdc1d32185cecc6508ac03997df17f28 |
|
MD5 | ebfa5e0ea28704797e6daa17ccaa15bd |
|
BLAKE2b-256 | 0729e5c73066a79936c29c7d3115842fee482921a2530e6035764b0c7880e315 |
File details
Details for the file vtext-0.1.0a3-cp36-cp36m-manylinux1_x86_64.whl
.
File metadata
- Download URL: vtext-0.1.0a3-cp36-cp36m-manylinux1_x86_64.whl
- Upload date:
- Size: 2.8 MB
- Tags: CPython 3.6m
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | db0fdbfb932f7d2836b485c52abbd3e0d7ac027e5aa35489d29e0108cd454283 |
|
MD5 | 8ca449cbdf7fd8bd23f684f5ac128842 |
|
BLAKE2b-256 | 52151bb4e87b72cb5fd55f24204119a25463d5badd18f7bdf9e4690d3a267194 |
File details
Details for the file vtext-0.1.0a3-cp36-cp36m-macosx_10_7_x86_64.whl
.
File metadata
- Download URL: vtext-0.1.0a3-cp36-cp36m-macosx_10_7_x86_64.whl
- Upload date:
- Size: 754.4 kB
- Tags: CPython 3.6m, macOS 10.7+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 58f9cc78e4630113f6c1112e22186a63d9d2c77742f6a00bad6a7b2743ddd6e8 |
|
MD5 | 15b3e15d0a9b8985823d29d1d5246711 |
|
BLAKE2b-256 | 8282e94d4e2d3a58dbe875dc1e8d783344ba915e2b94eb334f7822368e676b2f |
File details
Details for the file vtext-0.1.0a3-cp35-cp35m-win_amd64.whl
.
File metadata
- Download URL: vtext-0.1.0a3-cp35-cp35m-win_amd64.whl
- Upload date:
- Size: 691.9 kB
- Tags: CPython 3.5m, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5d8f5d360a5a7a46304797a9c5cc87d6c43cb0bc374c84b5a0d536b4d5ff8e94 |
|
MD5 | c04b5029e6bf4bc9ed52fc70f4b627e2 |
|
BLAKE2b-256 | f7da3f7dfd62b8e65f13b27a1dde0fd57a522bdb97af0b159856e9e6657b9528 |
File details
Details for the file vtext-0.1.0a3-cp35-cp35m-manylinux1_x86_64.whl
.
File metadata
- Download URL: vtext-0.1.0a3-cp35-cp35m-manylinux1_x86_64.whl
- Upload date:
- Size: 1.4 MB
- Tags: CPython 3.5m
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c6ceaaa3af37002f7827d17ae8e6834ce5e7152f3f7130871affff304ce84f78 |
|
MD5 | 9a919446497b7657d3828d228e7b07d5 |
|
BLAKE2b-256 | 61317afd205e2244108d8b179670bf6bfbf1d7f63c450a0db75c5cd98dbd7069 |
File details
Details for the file vtext-0.1.0a3-cp35-cp35m-macosx_10_6_x86_64.whl
.
File metadata
- Download URL: vtext-0.1.0a3-cp35-cp35m-macosx_10_6_x86_64.whl
- Upload date:
- Size: 754.7 kB
- Tags: CPython 3.5m, macOS 10.6+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 64d9699a6d39bd72f721e5cee495c0b5cf6c0abffc16e6bb2d7ee9fce55c7cdb |
|
MD5 | 08e940fa0647aec6e7cca83cbeb03a28 |
|
BLAKE2b-256 | 4911e3f8c14b061e3df0acbe972c8a82b57c3bfc94bbb2111a62cafc98b5218b |