No project description provided
Project description
GeneVecTools
Reading in Variety of Genetic File Types
Vector Embedding Algorithms
Byte Array Encoders
Clustering and Preprocessing Steps for Compression
Similarity Search Tools for FASTA/FASTQ files
Installing
Tester files: https://tinyurl.com/cDNALibraryExampleFiles
.. code-block:: bash
pip install GeneVecTools
Usage
.. code-block:: bash
>>> from GeneVecTools import SimSearch
"""
file is location of the "small_cDNA_Sequences_pbmc_1k_v2_S1_L002_R2_001.fastq"
that you downloaded from https://tinyurl.com/cDNALibraryExampleFiles
if it is in current directory, just use file name
"""
>>> file = "small_cDNA_Sequences_pbmc_1k_v2_S1_L002_R2_001.fastq"
>>> VECSS = SimSearch.VecSS(f=dir, length=10000, encoding="one-hot-encoding",bits=8)
>>> sequences = VECSS.readq()
>>> embedded = VECSS.embed(VECSS.s)
>>> print(embedded )
>>> D, I, time = VECSS.run_search()
>>> print(D,I,time)
>>> print(VECSS.unembed(VECSS.embed(VECSS.s)) == VECSS.s)
'True'
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
GeneVecTools-1.25.tar.gz
(7.3 kB
view details)
File details
Details for the file GeneVecTools-1.25.tar.gz.
File metadata
- Download URL: GeneVecTools-1.25.tar.gz
- Upload date:
- Size: 7.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.10.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0509e69b5a1a47fb59c1f7a4ceffb1e0eb0b17fd4d2aa7565a1f9c7bb9d8a869
|
|
| MD5 |
2a6dae463a14aeaff9ff275a3ddb22d0
|
|
| BLAKE2b-256 |
ea041a5271e63d6176b1935325fe9306537993530798d30bc848960c6f8e7091
|