Skip to main content

A comprehensive bioinformatics package with 10 essential programs

Project description

NGD - Bioinformatics Programs Package

A comprehensive Python package containing 10 essential bioinformatics programs for DNA/RNA analysis, sequence manipulation, and protein structure visualization.

Installation

You can install the package using pip:

pip install ngd

Or install from source:

git clone https://github.com/yourusername/ngd.git
cd ngd
pip install -e .

Available Programs

The package includes 10 comprehensive bioinformatics programs:

1. DNA Manipulation and Translation

  • DNA sequence slicing and concatenation
  • DNA to RNA transcription
  • RNA to protein translation
  • Basic sequence operations using Biopython

2. Reading FASTA Files

  • Parse and read FASTA format files
  • Extract sequence descriptions and data
  • Handle multiple sequences in a single file

3. Writing and Reading GenBank Format

  • Create GenBank records with annotations
  • Write sequences to GenBank format
  • Read and parse GenBank files
  • Handle sequence metadata

4. Converting FASTA to GenBank with Annotations

  • Convert between sequence formats
  • Add annotations during conversion
  • Batch processing of multiple sequences
  • Format validation

5. Adding Features and Annotations to SeqRecord

  • Create detailed sequence records
  • Add gene features and annotations
  • Modify sequence metadata
  • Handle complex biological annotations

6. Fetching Sequences from NCBI using Entrez

  • Access NCBI databases programmatically
  • Retrieve GenBank records by ID
  • Extract sequence information
  • Handle NCBI API responses

7. Pairwise Sequence Alignment

  • Align two DNA sequences
  • Calculate alignment scores
  • Visualize sequence similarities
  • Handle alignment parameters

8. Multiple Sequence Alignment using MUSCLE

  • Perform multiple sequence alignment
  • Use external MUSCLE tool
  • Handle alignment output
  • Process alignment results

9. Constructing Phylogenetic Trees

  • Build phylogenetic trees from alignments
  • Calculate distance matrices
  • Create UPGMA trees
  • Visualize and save tree structures

10. PDB 3D Structure Visualization

  • Download protein structures from PDB
  • Parse mmCIF files
  • Extract atomic coordinates
  • Create 3D structure visualizations

Usage

To view the code for any program, use the print_program function:

from ngd.programs import print_program

# Print program 1 (DNA Manipulation)
print_program(1)

# Print program 2 (FASTA Reading)
print_program(2)

# Print program 3 (GenBank Operations)
print_program(3)

# And so on for programs 4-10...

Requirements

The package requires the following dependencies:

  • biopython >= 1.79 - Core bioinformatics functionality
  • matplotlib >= 3.5.0 - Plotting and visualization
  • numpy >= 1.21.0 - Numerical computations
  • pandas >= 1.3.0 - Data manipulation
  • requests >= 2.25.0 - HTTP requests for NCBI access

Features

  • Comprehensive Coverage: From basic DNA manipulation to advanced phylogenetic analysis
  • Educational: Well-commented code suitable for learning bioinformatics
  • Practical: Real-world applications using popular bioinformatics tools
  • Extensible: Easy to modify and extend for specific research needs
  • Cross-platform: Works on Windows, macOS, and Linux

Examples

Basic DNA Operations

from ngd.programs import print_program
print_program(1)  # DNA manipulation and translation

Sequence Analysis

from ngd.programs import print_program
print_program(7)  # Pairwise sequence alignment

Database Access

from ngd.programs import print_program
print_program(6)  # NCBI sequence fetching

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation

If you use this package in your research, please cite:

NGD Bioinformatics Package (2024). A comprehensive collection of bioinformatics programs.
Available at: https://pypi.org/project/ngd/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ngd-0.2.1.tar.gz (5.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ngd-0.2.1-py3-none-any.whl (6.1 kB view details)

Uploaded Python 3

File details

Details for the file ngd-0.2.1.tar.gz.

File metadata

  • Download URL: ngd-0.2.1.tar.gz
  • Upload date:
  • Size: 5.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.9

File hashes

Hashes for ngd-0.2.1.tar.gz
Algorithm Hash digest
SHA256 9a9a3c510290513379d8e52738e9c002b20fbf1b6292a873550b70fa249ef94f
MD5 ad9cd99a46f94f07e3b2a36e1ca7d6c8
BLAKE2b-256 6fe42ebcbad5a57fdc71b21587140996ea633bb6aa07c3b9330ff25acc41847c

See more details on using hashes here.

File details

Details for the file ngd-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: ngd-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 6.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.9

File hashes

Hashes for ngd-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b59f3f90bf98a65f4faa5fe36ffd4a889ecac8f3350c22bc4111612e1a4f0b03
MD5 ded8b9b22f61305cea8ebd1b47d75285
BLAKE2b-256 d7d0c4b18c12e1f5ad011545167e287c67c7e054e632ce234c449dd184356c63

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page