Skip to main content

PlasAnn - Plasmid sequence annotation tool with enhanced dependency management

Project description

PlasAnn - Plasmid Annotation Tool

This tool is designed for annotating plasmid sequences from FASTA or GenBank files. It provides comprehensive annotation including coding sequences (CDS), origins of replication (oriC/oriV), origins of transfer (oriT), transposons, replicons, and non-coding RNAs.

Features

  • Automatic Database Download: Databases are downloaded automatically on first run
  • Flexible Input: Single files or entire folders of FASTA/GenBank files
  • Gene Prediction: Uses Prodigal for accurate gene calling
  • Mobile Element Detection: Identifies transposons, insertion sequences, and origins
  • Visualizations: Generates circular plasmid maps
  • Batch Processing: Process multiple files simultaneously
  • Enhanced Annotation: Optional UniProt BLAST for comprehensive protein annotation

Installation

Dependencies

This tool requires the following external programs:

  • BLAST+ (makeblastdb, blastn, blastx, blastp)
  • Prodigal (CDS prediction)
  • Infernal (cmscan, cmpress - for ncRNA detection)

Install PlasAnn

pip install plasann

Installing External Dependencies

For macOS (Apple Silicon M1/M2):

brew tap brewsci/bio
brew install blast prodigal infernal

For Linux (Ubuntu/Debian):

sudo apt install ncbi-blast+ prodigal infernal

For other systems: Install from source following the official documentation for BLAST+, Prodigal, and Infernal.

Verify Installation

PlasAnn --check-deps

Usage

Basic Commands

PlasAnn -i <input_file_or_directory> -o <output_directory> -t <file_type>

Parameters

  • -i, --input: Path to input file or directory containing FASTA or GenBank files
  • -o, --output: Path to output directory where results will be stored
  • -t, --type: Type of input files: fasta, genbank, or auto

Examples

Single FASTA file:

PlasAnn -i plasmid.fasta -o results -t fasta

Folder of FASTA files:

PlasAnn -i fasta_folder/ -o results -t fasta

GenBank file (retain existing annotations):

PlasAnn -i plasmid.gb -o results -t genbank --retain

GenBank file (re-annotate with Prodigal):

PlasAnn -i plasmid.gb -o results -t genbank --overwrite

Auto-detect mixed file types:

PlasAnn -i mixed_folder/ -o results -t auto

Enhanced annotation with UniProt:

PlasAnn -i plasmid.fasta -o results -t fasta --uniprot-blast

GenBank Processing Options

When using GenBank files, you can choose:

  • --retain: Use existing CDS annotations in the GenBank file (default)
  • --overwrite: Ignore existing annotations and re-predict genes using Prodigal

Additional Options

  • --uniprot-blast: Run enhanced UniProt BLAST annotation (slower but more comprehensive)
  • --min-identity: Minimum identity percentage for UniProt BLAST hits (default: 50%)
  • --version: Show version information
  • --check-deps: Check external dependency status

Output Files

For each input file, PlasAnn generates:

  • CSV Table: Detailed annotation table with gene information
  • GenBank File: Annotated GenBank file with all features
  • Plasmid Map: Circular visualization of the annotated plasmid (PNG format)
  • Enhanced CSV: Additional file with UniProt annotations (if --uniprot-blast used)

Troubleshooting

Check dependencies:

PlasAnn --check-deps

Common issues:

  • Ensure BLAST+, Prodigal, and Infernal are properly installed
  • Check that input files are valid FASTA or GenBank format
  • For very short sequences (<100bp), CDS prediction may not work well

Contact

For questions or issues, contact: hislam2@ur.rochester.edu

Citation

If you use PlasAnn in your research, please cite:

PlasAnn: Comprehensive Plasmid Annotation Pipeline
Habibul Islam
University of Rochester

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

plasann-1.1.6.tar.gz (68.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

plasann-1.1.6-py3-none-any.whl (47.6 kB view details)

Uploaded Python 3

File details

Details for the file plasann-1.1.6.tar.gz.

File metadata

  • Download URL: plasann-1.1.6.tar.gz
  • Upload date:
  • Size: 68.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.18

File hashes

Hashes for plasann-1.1.6.tar.gz
Algorithm Hash digest
SHA256 0f86d7ce930e21bfa55530b2d1522a02de9e82f26289541c8fee19b1315b60f3
MD5 e067ce84cd3d26bb3b9872c1fcd11d1a
BLAKE2b-256 8706a43ba3bb5a2f07c99f3a848d2c873cad3f793a94836eb8fe016c7d23ee3d

See more details on using hashes here.

File details

Details for the file plasann-1.1.6-py3-none-any.whl.

File metadata

  • Download URL: plasann-1.1.6-py3-none-any.whl
  • Upload date:
  • Size: 47.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.18

File hashes

Hashes for plasann-1.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 5269836ef455b7a4d5a7136bb589e8bee882b6ae59eecda09b78303afa4b698e
MD5 3ab2fc673e6055eec7bcc02c5411310a
BLAKE2b-256 5b7a0eaefc55a63996d12d500ff8650287fe0a4f452b773c88334da4034bf4d7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page