Skip to main content

Automatize the donwload of DNA sequences from NCBI, sort them according to their taxonomy and filter them with a gene name (provided as a regular expression)

Project description

NucleoPy

Introduction

NucleoPy aims to ease the download and sort of big bacth of DNA sequences from the NCBI database. It can also be usefull to filter the sequences based on their annotations. Using NucleoPy the user can:

  • Search NCBI nucleotide database
  • Download the fasta files or the cds_fasta files corresponding to the result of the search
  • Sort the sequences based on their taxonomy
  • Filter the sequences based on the name of the gene by giving one or more regular expression as filter(s)

Workflow

workflow

Installation

The package will be available from PyPI soon.

exepected:

pip3 install nucleopy

Usage

On google colab:

Nucleopy colab notebook

Command line

      python3 main.py -r USER'S REQUEST [OPTIONS] 

Authors and acknowledgment

Support

License

to choose, probably: https://choosealicense.com/licenses/gpl-3.0/

More Documentation

For examples of usage and more detailed documentation check: [Users manual on google doc](https://docs.google.com/document/d/1CJQg2Cv3P0lgWZRYd9xJQfj8qwIY4a-wtXa4VERdH2c/edit?usp=sharing =100)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nsdpy-0.1.12.tar.gz (11.5 kB view hashes)

Uploaded Source

Built Distribution

nsdpy-0.1.12-py3-none-any.whl (11.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page