Automatize the download of DNA sequences from NCBI, sort them according to their taxonomy and filter them with a gene name (provided as a regular expression)
Project description
nsdpy
- Introduction
- Workfolw
- Quick start
- Usage
- Authors and acknowledgment
- Support
- Licence
- More Documentation
Introduction
nsdpy (nucleotide or NCBI sequence downloader) aims to ease the download and sort of big bacth of DNA sequences from the NCBI database. It can also be usefull to filter the sequences based on their annotations. Using nsdpy the user can:
- Search NCBI nucleotide database
- Download the fasta files or the cds_fasta files corresponding to the result of the search
- Sort the sequences based on their taxonomy
- Select coding sequences from cds files based on the gene names using one or more regular expressions. This can help the user retrieve some sequences for which the gene name is annotated in another field.
- Retrieve the taxonomic information and add it to the output sequences.
Quick start
- Clone the repo from Github:
git clone https://github.com/RaphaelHebert/nsdpy.git
- pip:
depending on the user environment pip may be replaced by pip3 if pip3 is used
pip install nsdpy
minimum python version for nsdpy: 3.8.2
- Google Colab: save a copy of this notebook in your drive.
Workflow
Usage
Google colab
Command line
nsdpy -r "USER'S REQUEST" [OPTIONS]
Authors and acknowledgment
Support
License
Code and documentation copyright 2021 the nsdpy authors. Code released under the MIT License.
More Documentation
Official documentation:
Readthedocs
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file nsdpy-0.3.3b0.tar.gz
.
File metadata
- Download URL: nsdpy-0.3.3b0.tar.gz
- Upload date:
- Size: 15.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.0 CPython/3.11.2 Darwin/22.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4841cfe6e5cb684167077e689791a8d5dee83f6407664501eb577e4a9e377204 |
|
MD5 | 65f11c9c914e296ba57f7343fc7bc954 |
|
BLAKE2b-256 | a2d4ef61ca20442d4515432ce273b827bad465cdc097c87e4162f9a95dd1e743 |
File details
Details for the file nsdpy-0.3.3b0-py3-none-any.whl
.
File metadata
- Download URL: nsdpy-0.3.3b0-py3-none-any.whl
- Upload date:
- Size: 14.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.0 CPython/3.11.2 Darwin/22.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | afe8d8c7d6e4736e49eaee3aa03931c41f5f1713134cbe1903244d0872ed5c60 |
|
MD5 | b3c2bb716630815063bc2f8b03c74ee6 |
|
BLAKE2b-256 | 7879dd1a56a68114240ef1ca25e210dca03da64832576c28cd129e31cceca596 |