Skip to main content

Domains as words, genomes as documents.

Project description

nanotext

Name inspired by fastText.

Run tests:

cd .../nanotext/
pytest  # or python setup.py test
TODO: turn embedding in gensim fmt into something similar to glove (bin?)
would then be easier to train/ unfreeze and then save again, to be loaded w/ some model eg for PUL prediction

https://github.com/plasticityai/magnitude

nanotext compute

takes annotation (load domains) and our model and computes vector

nanotext train corpus model

nanotext compare ...

nanotext search ...

nanotext taxonomy (calculate a gtdb based taxonomy and get closest functional genome and use that or distance)

like sourmash really



check sourmash publication

nanotext predict model=medium
nanotext predict model=pul

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for nanotext, version 0.0.1
Filename, size File type Python version Upload date Hashes
Filename, size nanotext-0.0.1-py3-none-any.whl (7.3 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size nanotext-0.0.1.tar.gz (5.7 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page