Skip to main content

A comprehensive bioinformatics package with 10 essential programs

Project description

NGD - Bioinformatics Programs Package

A comprehensive Python package containing 10 essential bioinformatics programs for DNA/RNA analysis, sequence manipulation, and protein structure visualization.

Installation

You can install the package using pip:

pip install ngd

Or install from source:

git clone https://github.com/yourusername/ngd.git
cd ngd
pip install -e .

Available Programs

The package includes 10 comprehensive bioinformatics programs:

1. DNA Manipulation and Translation

  • DNA sequence slicing and concatenation
  • DNA to RNA transcription
  • RNA to protein translation
  • Basic sequence operations using Biopython

2. Reading FASTA Files

  • Parse and read FASTA format files
  • Extract sequence descriptions and data
  • Handle multiple sequences in a single file

3. Writing and Reading GenBank Format

  • Create GenBank records with annotations
  • Write sequences to GenBank format
  • Read and parse GenBank files
  • Handle sequence metadata

4. Converting FASTA to GenBank with Annotations

  • Convert between sequence formats
  • Add annotations during conversion
  • Batch processing of multiple sequences
  • Format validation

5. Adding Features and Annotations to SeqRecord

  • Create detailed sequence records
  • Add gene features and annotations
  • Modify sequence metadata
  • Handle complex biological annotations

6. Fetching Sequences from NCBI using Entrez

  • Access NCBI databases programmatically
  • Retrieve GenBank records by ID
  • Extract sequence information
  • Handle NCBI API responses

7. Pairwise Sequence Alignment

  • Align two DNA sequences
  • Calculate alignment scores
  • Visualize sequence similarities
  • Handle alignment parameters

8. Multiple Sequence Alignment using MUSCLE

  • Perform multiple sequence alignment
  • Use external MUSCLE tool
  • Handle alignment output
  • Process alignment results

9. Constructing Phylogenetic Trees

  • Build phylogenetic trees from alignments
  • Calculate distance matrices
  • Create UPGMA trees
  • Visualize and save tree structures

10. PDB 3D Structure Visualization

  • Download protein structures from PDB
  • Parse mmCIF files
  • Extract atomic coordinates
  • Create 3D structure visualizations

Usage

To view the code for any program, use the print_program function:

from ngd.programs import print_program

# Print program 1 (DNA Manipulation)
print_program(1)

# Print program 2 (FASTA Reading)
print_program(2)

# Print program 3 (GenBank Operations)
print_program(3)

# And so on for programs 4-10...

Requirements

The package requires the following dependencies:

  • biopython >= 1.79 - Core bioinformatics functionality
  • matplotlib >= 3.5.0 - Plotting and visualization
  • numpy >= 1.21.0 - Numerical computations
  • pandas >= 1.3.0 - Data manipulation
  • requests >= 2.25.0 - HTTP requests for NCBI access

Features

  • Comprehensive Coverage: From basic DNA manipulation to advanced phylogenetic analysis
  • Educational: Well-commented code suitable for learning bioinformatics
  • Practical: Real-world applications using popular bioinformatics tools
  • Extensible: Easy to modify and extend for specific research needs
  • Cross-platform: Works on Windows, macOS, and Linux

Examples

Basic DNA Operations

from ngd.programs import print_program
print_program(1)  # DNA manipulation and translation

Sequence Analysis

from ngd.programs import print_program
print_program(7)  # Pairwise sequence alignment

Database Access

from ngd.programs import print_program
print_program(6)  # NCBI sequence fetching

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation

If you use this package in your research, please cite:

NGD Bioinformatics Package (2024). A comprehensive collection of bioinformatics programs.
Available at: https://pypi.org/project/ngd/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ngd-0.2.0.tar.gz (5.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ngd-0.2.0-py3-none-any.whl (6.1 kB view details)

Uploaded Python 3

File details

Details for the file ngd-0.2.0.tar.gz.

File metadata

  • Download URL: ngd-0.2.0.tar.gz
  • Upload date:
  • Size: 5.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.9

File hashes

Hashes for ngd-0.2.0.tar.gz
Algorithm Hash digest
SHA256 4260ede83c4bd3e3c59c37836211a024a086c0917b87a1533dbb267714bb1aad
MD5 6f119209110b9b6a1a3f530cb351f2f8
BLAKE2b-256 a004e6a51d14dd2bced84e1679d8df0d364526fcb2ac61d8315ea7a44580d823

See more details on using hashes here.

File details

Details for the file ngd-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: ngd-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 6.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.9

File hashes

Hashes for ngd-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0ca847db809df68bceee2f2adacba11abb355f788bc9159e61e20b472174bc31
MD5 4308af3f208a7688656a61189aab6879
BLAKE2b-256 bdc43478e42096282bf483cb4a5ecac351df0f85df4a11a808ba121094f7ca47

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page