Skip to main content

No project description provided

Project description

GeneVecTools

Reading in Variety of Genetic File Types

Vector Embedding Algorithms

Byte Array Encoders

Clustering and Preprocessing Steps for Compression

Similarity Search Tools for FASTA/FASTQ files

Installing

Tester files: https://tinyurl.com/cDNALibraryExampleFiles

.. code-block:: bash

pip install GeneVecTools

Usage

.. code-block:: bash

>>> from GeneVecTools import SimSearch

"""
file is location of the "small_cDNA_Sequences_pbmc_1k_v2_S1_L002_R2_001.fastq" 
that you downloaded from https://tinyurl.com/cDNALibraryExampleFiles
if it is in current directory, just use file name
"""
>>> file = "small_cDNA_Sequences_pbmc_1k_v2_S1_L002_R2_001.fastq"
>>> VECSS = SimSearch.VecSS(f=dir, length=10000, encoding="one-hot-encoding",bits=8)
>>> sequences = VECSS.readq()

>>> embedded = VECSS.embed(VECSS.s)
>>> print(embedded )

>>> D, I, time = VECSS.run_search()
>>> print(D,I,time)

>>> print(VECSS.unembed(VECSS.embed(VECSS.s)) == VECSS.s)

'True'

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

GeneVecTools-1.25.tar.gz (7.3 kB view details)

Uploaded Source

File details

Details for the file GeneVecTools-1.25.tar.gz.

File metadata

  • Download URL: GeneVecTools-1.25.tar.gz
  • Upload date:
  • Size: 7.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.10.2

File hashes

Hashes for GeneVecTools-1.25.tar.gz
Algorithm Hash digest
SHA256 0509e69b5a1a47fb59c1f7a4ceffb1e0eb0b17fd4d2aa7565a1f9c7bb9d8a869
MD5 2a6dae463a14aeaff9ff275a3ddb22d0
BLAKE2b-256 ea041a5271e63d6176b1935325fe9306537993530798d30bc848960c6f8e7091

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page