Skip to main content

Natural Language Processing in Rust with Python bidings

Project description

vtext

This is a Python wrapper for the Rust vtext crate.

This package aims to provide a high performance toolkit for ingesting textual data for machine learning applications.

The API is currently unstable.

Features

  • Tokenization: Regexp tokenizer, Unicode segmentation + language specific rules
  • Stemming: Snowball (in Python 15-20x faster than NLTK)
  • Token counting: converting token counts to sparse matrices for use in machine learning libraries. Similar to CountVectorizer and HashingVectorizer in scikit-learn but will less broad functionality.
  • Levenshtein edit distance; Sørensen-Dice, Jaro, Jaro Winkler string similarities

Installation

vtext requires Python 3.5+, numpy 1.15+ and can be installed with,

pip install --pre vtext

Documentation

Project documentation: vtext.io/doc/latest/index.html

License

vtext is released under the Apache License, Version 2.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

vtext-0.1.0a3-cp37-cp37m-win_amd64.whl (691.7 kB view details)

Uploaded CPython 3.7m Windows x86-64

vtext-0.1.0a3-cp37-cp37m-manylinux1_x86_64.whl (4.1 MB view details)

Uploaded CPython 3.7m

vtext-0.1.0a3-cp37-cp37m-macosx_10_9_x86_64.whl (754.3 kB view details)

Uploaded CPython 3.7m macOS 10.9+ x86-64

vtext-0.1.0a3-cp36-cp36m-win_amd64.whl (691.9 kB view details)

Uploaded CPython 3.6m Windows x86-64

vtext-0.1.0a3-cp36-cp36m-manylinux1_x86_64.whl (2.8 MB view details)

Uploaded CPython 3.6m

vtext-0.1.0a3-cp36-cp36m-macosx_10_7_x86_64.whl (754.4 kB view details)

Uploaded CPython 3.6m macOS 10.7+ x86-64

vtext-0.1.0a3-cp35-cp35m-win_amd64.whl (691.9 kB view details)

Uploaded CPython 3.5m Windows x86-64

vtext-0.1.0a3-cp35-cp35m-manylinux1_x86_64.whl (1.4 MB view details)

Uploaded CPython 3.5m

vtext-0.1.0a3-cp35-cp35m-macosx_10_6_x86_64.whl (754.7 kB view details)

Uploaded CPython 3.5m macOS 10.6+ x86-64

File details

Details for the file vtext-0.1.0a3-cp37-cp37m-win_amd64.whl.

File metadata

  • Download URL: vtext-0.1.0a3-cp37-cp37m-win_amd64.whl
  • Upload date:
  • Size: 691.7 kB
  • Tags: CPython 3.7m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1

File hashes

Hashes for vtext-0.1.0a3-cp37-cp37m-win_amd64.whl
Algorithm Hash digest
SHA256 e811d766493e341b2ef4575c1b91915ab5d1645a39a9a3a9a3dd8c8646316442
MD5 f850770133fc5c2deaacd2de85cfbcad
BLAKE2b-256 3d467115ccad739b2c7b98de49a3087cd7f19b67560feb83ebbe12b1cec96211

See more details on using hashes here.

File details

Details for the file vtext-0.1.0a3-cp37-cp37m-manylinux1_x86_64.whl.

File metadata

  • Download URL: vtext-0.1.0a3-cp37-cp37m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 4.1 MB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1

File hashes

Hashes for vtext-0.1.0a3-cp37-cp37m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 2825672e240dbf3ad5d029621426b5f42e3b53d4a33d477339173e086c2bd618
MD5 fecd2b7d4c8e63a0ce9b9f2b5a09d040
BLAKE2b-256 ad516adf21fdac9f45c22d2b43afe1334273b2f7eba5cec0cb391944469841f0

See more details on using hashes here.

File details

Details for the file vtext-0.1.0a3-cp37-cp37m-macosx_10_9_x86_64.whl.

File metadata

  • Download URL: vtext-0.1.0a3-cp37-cp37m-macosx_10_9_x86_64.whl
  • Upload date:
  • Size: 754.3 kB
  • Tags: CPython 3.7m, macOS 10.9+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1

File hashes

Hashes for vtext-0.1.0a3-cp37-cp37m-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 acd8dc6da2b76d0f62044f00e7f903ea31fdac09d88e1b65a2cbf18f2e32d519
MD5 aa7a5bc4297015eedc0b6197142c1d8f
BLAKE2b-256 15f3546640f725255abdf4de9131a3ab3e0411705dd5037a8de7a859c206b7bd

See more details on using hashes here.

File details

Details for the file vtext-0.1.0a3-cp36-cp36m-win_amd64.whl.

File metadata

  • Download URL: vtext-0.1.0a3-cp36-cp36m-win_amd64.whl
  • Upload date:
  • Size: 691.9 kB
  • Tags: CPython 3.6m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1

File hashes

Hashes for vtext-0.1.0a3-cp36-cp36m-win_amd64.whl
Algorithm Hash digest
SHA256 ba4afe74f34e8c934812a6a7277715ddcdc1d32185cecc6508ac03997df17f28
MD5 ebfa5e0ea28704797e6daa17ccaa15bd
BLAKE2b-256 0729e5c73066a79936c29c7d3115842fee482921a2530e6035764b0c7880e315

See more details on using hashes here.

File details

Details for the file vtext-0.1.0a3-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

  • Download URL: vtext-0.1.0a3-cp36-cp36m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 2.8 MB
  • Tags: CPython 3.6m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1

File hashes

Hashes for vtext-0.1.0a3-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 db0fdbfb932f7d2836b485c52abbd3e0d7ac027e5aa35489d29e0108cd454283
MD5 8ca449cbdf7fd8bd23f684f5ac128842
BLAKE2b-256 52151bb4e87b72cb5fd55f24204119a25463d5badd18f7bdf9e4690d3a267194

See more details on using hashes here.

File details

Details for the file vtext-0.1.0a3-cp36-cp36m-macosx_10_7_x86_64.whl.

File metadata

  • Download URL: vtext-0.1.0a3-cp36-cp36m-macosx_10_7_x86_64.whl
  • Upload date:
  • Size: 754.4 kB
  • Tags: CPython 3.6m, macOS 10.7+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1

File hashes

Hashes for vtext-0.1.0a3-cp36-cp36m-macosx_10_7_x86_64.whl
Algorithm Hash digest
SHA256 58f9cc78e4630113f6c1112e22186a63d9d2c77742f6a00bad6a7b2743ddd6e8
MD5 15b3e15d0a9b8985823d29d1d5246711
BLAKE2b-256 8282e94d4e2d3a58dbe875dc1e8d783344ba915e2b94eb334f7822368e676b2f

See more details on using hashes here.

File details

Details for the file vtext-0.1.0a3-cp35-cp35m-win_amd64.whl.

File metadata

  • Download URL: vtext-0.1.0a3-cp35-cp35m-win_amd64.whl
  • Upload date:
  • Size: 691.9 kB
  • Tags: CPython 3.5m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1

File hashes

Hashes for vtext-0.1.0a3-cp35-cp35m-win_amd64.whl
Algorithm Hash digest
SHA256 5d8f5d360a5a7a46304797a9c5cc87d6c43cb0bc374c84b5a0d536b4d5ff8e94
MD5 c04b5029e6bf4bc9ed52fc70f4b627e2
BLAKE2b-256 f7da3f7dfd62b8e65f13b27a1dde0fd57a522bdb97af0b159856e9e6657b9528

See more details on using hashes here.

File details

Details for the file vtext-0.1.0a3-cp35-cp35m-manylinux1_x86_64.whl.

File metadata

  • Download URL: vtext-0.1.0a3-cp35-cp35m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 1.4 MB
  • Tags: CPython 3.5m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1

File hashes

Hashes for vtext-0.1.0a3-cp35-cp35m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 c6ceaaa3af37002f7827d17ae8e6834ce5e7152f3f7130871affff304ce84f78
MD5 9a919446497b7657d3828d228e7b07d5
BLAKE2b-256 61317afd205e2244108d8b179670bf6bfbf1d7f63c450a0db75c5cd98dbd7069

See more details on using hashes here.

File details

Details for the file vtext-0.1.0a3-cp35-cp35m-macosx_10_6_x86_64.whl.

File metadata

  • Download URL: vtext-0.1.0a3-cp35-cp35m-macosx_10_6_x86_64.whl
  • Upload date:
  • Size: 754.7 kB
  • Tags: CPython 3.5m, macOS 10.6+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.6.2 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.7.1

File hashes

Hashes for vtext-0.1.0a3-cp35-cp35m-macosx_10_6_x86_64.whl
Algorithm Hash digest
SHA256 64d9699a6d39bd72f721e5cee495c0b5cf6c0abffc16e6bb2d7ee9fce55c7cdb
MD5 08e940fa0647aec6e7cca83cbeb03a28
BLAKE2b-256 4911e3f8c14b061e3df0acbe972c8a82b57c3bfc94bbb2111a62cafc98b5218b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page