Skip to main content

No project description provided

Project description

Doain2Vect

Tf-Idf represents the importance a word is to a document in a latent vector space whereas the model doesn’t consider any semantic representation. But in a specific domain classification, the effect of a particular plays a vital role. On the other hand, the semantic representation model like word2vec, fasttext, gelove, don’t care about the frequency of the important word, they are the same for all latent space. But the important is that they carry semantic information for also unknown words in a latent vector space. To carry the semantic representation with frequency for unknown word representation in a sub-vector space of a domain, we propose a mathematical model from the trained presentation of frequency and semantic both. This model attempts to represent an unknown word from a fixed frequency trained model from another vector semantic representation. The vector space of frequency and semantic are different. But to sustain the importance of an unknown word, we convert the semantic meaning from the semantic vector space to the vector space of frequency.

This is a reserach and development of Hishab.ltd

Installation:

pip install domain2vec

Usage of doamin2vec:

    from domain2vec import domain2vec
	k = 20000 ## k is the number of cluster or feature that want to extract 
	vec = domain2Vec(ft, k)
	train=k.fit_transform(X_train)
	text = k.transform(X_text)

Project details


Release history Release notifications | RSS feed

This version

0.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

domain2vec-0.1-py3-none-any.whl (4.0 kB view details)

Uploaded Python 3

File details

Details for the file domain2vec-0.1-py3-none-any.whl.

File metadata

  • Download URL: domain2vec-0.1-py3-none-any.whl
  • Upload date:
  • Size: 4.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.4

File hashes

Hashes for domain2vec-0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 386007857ed422fbe189a86dc00376459a41ac96a665cc63bb182b8b8feccdf9
MD5 f7775e883a4925633759b3c79356fcb2
BLAKE2b-256 d084cdcf6c3e68c010ad6418cb68d1b33c2c0851eab1148fe3d98726bcf5906c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page