Skip to main content

Automatize the download of DNA sequences from NCBI, sort them according to their taxonomy and filter them with a gene name (provided as a regular expression)

Project description

nsdpy

License: MIT pypi Python 3.8 Documentation Status

Introduction

nsdpy (nucleotide or NCBI sequence downloader) aims to ease the download and sort of big bacth of DNA sequences from the NCBI database. It can also be usefull to filter the sequences based on their annotations. Using nsdpy the user can:

  • Search NCBI nucleotide database
  • Download the fasta files or the cds_fasta files corresponding to the result of the search
  • Sort the sequences based on their taxonomy
  • Select coding sequences from cds files based on the gene names using one or more regular expressions. This can help the user retrieve some sequences for which the gene name is annotated in another field.
  • Retrieve the taxonomic information and add it to the output sequences.

Quick start

  • Clone the repo from Github:
git clone https://github.com/RaphaelHebert/nsdpy.git
  • pip:
    depending on the user environment pip may be replaced by pip3 if pip3 is used
pip install nsdpy

minimum python version for nsdpy: 3.8.2

Workflow

Usage

Google colab

nsdpy colab notebook

Command line

nsdpy -r "USER'S REQUEST" [OPTIONS] 

Authors and acknowledgment

Raphael Hebert
Emese Meglecz

Support

License

Code and documentation copyright 2021 the nsdpy authors. Code released under the MIT License.

More Documentation

Official documentation:
Readthedocs

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nsdpy-0.3.5b0.tar.gz (15.5 kB view details)

Uploaded Source

Built Distribution

nsdpy-0.3.5b0-py3-none-any.whl (15.4 kB view details)

Uploaded Python 3

File details

Details for the file nsdpy-0.3.5b0.tar.gz.

File metadata

  • Download URL: nsdpy-0.3.5b0.tar.gz
  • Upload date:
  • Size: 15.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.0 CPython/3.11.2 Darwin/22.4.0

File hashes

Hashes for nsdpy-0.3.5b0.tar.gz
Algorithm Hash digest
SHA256 ea3570365dfcde84b0a798460a032272a8dc7f5c68d4ac43a19d6c8a68d0abcf
MD5 44f76fef57133654428a63e96aa4b6b6
BLAKE2b-256 2dd7c93df9e3abfe0d01904329c011bdf36b13ac7000a1cdb01ef4f24f3f8332

See more details on using hashes here.

File details

Details for the file nsdpy-0.3.5b0-py3-none-any.whl.

File metadata

  • Download URL: nsdpy-0.3.5b0-py3-none-any.whl
  • Upload date:
  • Size: 15.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.0 CPython/3.11.2 Darwin/22.4.0

File hashes

Hashes for nsdpy-0.3.5b0-py3-none-any.whl
Algorithm Hash digest
SHA256 fac2ad94fc16599336560e25cfed6e5361badd0b2f924e80ec3e12509250dcb7
MD5 664a66ce1de61523987b4ff246d94c2e
BLAKE2b-256 c11ddc4f539a14f21091cbb4e6e53cbc6bcc44776da375b2ea6993d6416d7a77

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page