Skip to main content

Package to create and use Simple Explainable Language Multiset Representations

Project description

This crate provides a library for generating and using simple text data structures that work like language models. The data structures do not use real-valued vector embeddings; instead they use the mathematical concept of multisets and are derived directly from plain text data.

The data structures are named Simple Explainable Language Multiset Representations (SELMRs) and consist of multisets created from all multi-word expressions and all multi-word-context combinations contained in a collection of documents given some contraints. The multisets can be used for downstream NLP tasks like text classifications and searching, in a similar manner as real-valued vector embeddings.

SELMRs produce explainable results without any randomness and enable explicit links with lexical, linguistical and terminological annotations. No model is trained and no dimensionality reduction is applied.

For information on how to use this package, please look here.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

selmr-0.4.1.tar.gz (28.0 kB view details)

Uploaded Source

Built Distribution

selmr-0.4.1-cp310-none-win_amd64.whl (1.4 MB view details)

Uploaded CPython 3.10 Windows x86-64

File details

Details for the file selmr-0.4.1.tar.gz.

File metadata

  • Download URL: selmr-0.4.1.tar.gz
  • Upload date:
  • Size: 28.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.5.1

File hashes

Hashes for selmr-0.4.1.tar.gz
Algorithm Hash digest
SHA256 262e3456a2747e412a5eb4e08201315d6ee49c49937c75bbd80b36fcdc02e602
MD5 c0e0fabc89b9bbacd62622d6cb8bcf35
BLAKE2b-256 69ba8dbb761a6bb9f9521c40e9ad16739db1775ed6bd0f7dfaced4cf3a2e110d

See more details on using hashes here.

File details

Details for the file selmr-0.4.1-cp310-none-win_amd64.whl.

File metadata

  • Download URL: selmr-0.4.1-cp310-none-win_amd64.whl
  • Upload date:
  • Size: 1.4 MB
  • Tags: CPython 3.10, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.5.1

File hashes

Hashes for selmr-0.4.1-cp310-none-win_amd64.whl
Algorithm Hash digest
SHA256 945f4837043a3528047d03910657b6b3bd6bb58ec569d8a1effbbe45cb5e1490
MD5 4e71f9e7143877fe986a9ef19236a5e8
BLAKE2b-256 55951fa822c0fedbebc304148cd20c9635d9e463e22cea9ea4a8a51e94978aa4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page