Skip to main content

Package to create and use Simple Explainable Language Multiset Representations

Project description

This crate provides a library for generating and using simple text data structures that work like language models. The data structures do not use real-valued vector embeddings; instead they use the mathematical concept of multisets and are derived directly from plain text data.

The data structures are named Simple Explainable Language Multiset Representations (SELMRs) and consist of multisets created from all multi-word expressions and all multi-word-context combinations contained in a collection of documents given some contraints. The multisets can be used for downstream NLP tasks like text classifications and searching, in a similar manner as real-valued vector embeddings.

SELMRs produce explainable results without any randomness and enable explicit links with lexical, linguistical and terminological annotations. No model is trained and no dimensionality reduction is applied.

For information on how to use this package, please look here.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

selmr-0.4.2.tar.gz (28.2 kB view details)

Uploaded Source

Built Distribution

selmr-0.4.2-cp38-abi3-win_amd64.whl (1.4 MB view details)

Uploaded CPython 3.8+ Windows x86-64

File details

Details for the file selmr-0.4.2.tar.gz.

File metadata

  • Download URL: selmr-0.4.2.tar.gz
  • Upload date:
  • Size: 28.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.5.1

File hashes

Hashes for selmr-0.4.2.tar.gz
Algorithm Hash digest
SHA256 212b1b09f0d14e7df1211c65afd6470b578a9486fda681004a7fa34a86287c31
MD5 41a87b7332355f584573b35e11d2e5cd
BLAKE2b-256 c0026b71fa86dcb66e18ed297a06fff44a94673557c6ab0dd0b9e6eef150ae1e

See more details on using hashes here.

File details

Details for the file selmr-0.4.2-cp38-abi3-win_amd64.whl.

File metadata

  • Download URL: selmr-0.4.2-cp38-abi3-win_amd64.whl
  • Upload date:
  • Size: 1.4 MB
  • Tags: CPython 3.8+, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.5.1

File hashes

Hashes for selmr-0.4.2-cp38-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 69411ee1eb55d02af84168884e97a4fe7ee98dd9224cd4f13cd58fd86c877554
MD5 2205feb45db9e1ec059e6e922fbb346e
BLAKE2b-256 8d01271b5286b63a5a3969a354ed5bd8c582366e99928c5a36344f21c64eeb2d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page