Domains as words, genomes as documents.
Project description
nanotext
Name inspired by fastText
.
Run tests:
cd .../nanotext/
pytest # or python setup.py test
TODO: turn embedding in gensim fmt into something similar to glove (bin?)
would then be easier to train/ unfreeze and then save again, to be loaded w/ some model eg for PUL prediction
https://github.com/plasticityai/magnitude
nanotext compute
takes annotation (load domains) and our model and computes vector
nanotext train corpus model
nanotext compare ...
nanotext search ...
nanotext taxonomy (calculate a gtdb based taxonomy and get closest functional genome and use that or distance)
like sourmash really
check sourmash publication
nanotext predict model=medium
nanotext predict model=pul
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
nanotext-0.0.1.tar.gz
(5.7 kB
view details)
Built Distribution
File details
Details for the file nanotext-0.0.1.tar.gz
.
File metadata
- Download URL: nanotext-0.0.1.tar.gz
- Upload date:
- Size: 5.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a3da6978f2b1f71869130b719f018b3432207399e112e6e45e55d9ad596cbe59 |
|
MD5 | a948c01a129007045303abdd44d47761 |
|
BLAKE2b-256 | e7da90da39255f82c9544d8f6b6fc319b42a7768332f09b999005c01f1e426bd |
File details
Details for the file nanotext-0.0.1-py3-none-any.whl
.
File metadata
- Download URL: nanotext-0.0.1-py3-none-any.whl
- Upload date:
- Size: 7.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ce4c958c3022e48cf838efe4bb31570600c8d1c98ef121b9453967f4357df2a4 |
|
MD5 | fdef67ca0bd2f0ad1742fe491128f055 |
|
BLAKE2b-256 | 61f7b4c6f2b9399a865488322001a968592ea4af5d85b70fb7e4c448c0d9c36f |