Package to create and use Simple Explainable Language Multiset Representations
Project description
This crate provides a library for generating and using simple text data structures that work like language models. The data structures do not use real-valued vector embeddings; instead they use the mathematical concept of multisets and are derived directly from plain text data.
The data structures are named Simple Explainable Language Multiset Representations (SELMRs) and consist of multisets created from all multi-word expressions and all multi-word-context combinations contained in a collection of documents given some contraints. The multisets can be used for downstream NLP tasks like text classifications and searching, in a similar manner as real-valued vector embeddings.
SELMRs produce explainable results without any randomness and enable explicit links with lexical, linguistical and terminological annotations. No model is trained and no dimensionality reduction is applied.
For information on how to use this package, please look here.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file selmr-0.4.1.tar.gz
.
File metadata
- Download URL: selmr-0.4.1.tar.gz
- Upload date:
- Size: 28.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/1.5.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 262e3456a2747e412a5eb4e08201315d6ee49c49937c75bbd80b36fcdc02e602 |
|
MD5 | c0e0fabc89b9bbacd62622d6cb8bcf35 |
|
BLAKE2b-256 | 69ba8dbb761a6bb9f9521c40e9ad16739db1775ed6bd0f7dfaced4cf3a2e110d |
File details
Details for the file selmr-0.4.1-cp310-none-win_amd64.whl
.
File metadata
- Download URL: selmr-0.4.1-cp310-none-win_amd64.whl
- Upload date:
- Size: 1.4 MB
- Tags: CPython 3.10, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/1.5.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 945f4837043a3528047d03910657b6b3bd6bb58ec569d8a1effbbe45cb5e1490 |
|
MD5 | 4e71f9e7143877fe986a9ef19236a5e8 |
|
BLAKE2b-256 | 55951fa822c0fedbebc304148cd20c9635d9e463e22cea9ea4a8a51e94978aa4 |