Skip to main content

Feature Engineering for Textual Data

Project description

TextVectorizer

A Library for representation learning of Text using Transformers such as BERT, AlBERT, RoBERTA and spacy

Text Annotation

from TextVectorizer import Vectorizer from TextVectorizer import Vectorizer vec = Vectorizer('bert') for i in vec.annotate('Hi I am Rahul'): ... print(i.text,i.pos_) Hi INTJ I PRON am AUX Rahul PROPN

Document Similarity

from TextVectorizer import Vectorizer vec = Vectorizer() doc1 = 'Apple is a company' doc2 = 'Apple is fruit' vec.similarity(doc1,doc2) 0.622238214831199

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

TextVectorizer-0.1.0.tar.gz (3.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

TextVectorizer-0.1.0-py3-none-any.whl (4.6 kB view details)

Uploaded Python 3

File details

Details for the file TextVectorizer-0.1.0.tar.gz.

File metadata

  • Download URL: TextVectorizer-0.1.0.tar.gz
  • Upload date:
  • Size: 3.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.48.0 CPython/3.7.3

File hashes

Hashes for TextVectorizer-0.1.0.tar.gz
Algorithm Hash digest
SHA256 2a239c028e1fadfbfcefc04a0c85cc4ffd0951e94981377303e0d7e8e86f30f6
MD5 5150ea8e8059b79dd08c9fb5175e0445
BLAKE2b-256 98bb959634fc09cf7a9ae9cc83e1b35804533d186332ddb50a8760fdd25425b6

See more details on using hashes here.

File details

Details for the file TextVectorizer-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: TextVectorizer-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 4.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.48.0 CPython/3.7.3

File hashes

Hashes for TextVectorizer-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e3de3fc5b13489eb2515f2ed475e78969655d16e0e2de2544829ff789bb61a45
MD5 f5e18e8ec92407f329f4a0ff4c2b0b2f
BLAKE2b-256 4f02f004578a271a1f667d4d2f225e9c91aed0cf5a574fa5d22f94f4803432e6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page