Skip to main content

Domains as words, genomes as documents.

Project description

nanotext

Name inspired by fastText.

Run tests:

cd .../nanotext/
pytest  # or python setup.py test
TODO: turn embedding in gensim fmt into something similar to glove (bin?)
would then be easier to train/ unfreeze and then save again, to be loaded w/ some model eg for PUL prediction

https://github.com/plasticityai/magnitude

nanotext compute

takes annotation (load domains) and our model and computes vector

nanotext train corpus model

nanotext compare ...

nanotext search ...

nanotext taxonomy (calculate a gtdb based taxonomy and get closest functional genome and use that or distance)

like sourmash really



check sourmash publication

nanotext predict model=medium
nanotext predict model=pul

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for nanotext, version 0.0.1
Filename, size File type Python version Upload date Hashes
Filename, size nanotext-0.0.1-py3-none-any.whl (7.3 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size nanotext-0.0.1.tar.gz (5.7 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page