Skip to main content

ANNOgesic - A tool for bacterial/archaeal RNA-Seq based genome annotations

Project description

https://img.shields.io/pypi/v/annogesic.svg https://img.shields.io/pypi/l/annogesic.svg https://zenodo.org/badge/34061246.svg

About ANNOgesic

ANNOgesic is the swiss army knife for RNA-Seq based annotation of bacterial/archaeal genomes.

It is a modular, command-line tool that can integrate different types of RNA-Seq data based on dRNA-Seq (differential RNA-Seq) or RNA-Seq protocols that inclusde transcript fragmentation to generate high quality genome annotations. It can detect genes, CDSs/tRNAs/rRNAs, transcription starting sites (TSS) and processing sites, transcripts, terminators, untranslated regions (UTR) as well as small RNAs (sRNA), small open reading frames (sORF), circular RNAs, CRISPR related RNAs, riboswitches and RNA-thermometers. It can also perform RNA-RNA and protein-protein interactions prediction. Furthermore, it groups genes into operons and sub-operons and reveal promoter motifs. It can also allocate GO term and subcellular localization to genes. Several of ANNOgesic features are new implementations while other build on well known third-party tools for which it offers adaptive parameter-optimizations. Additionally, numerous visualization and statistics help the user to quickly evaluat feature predictions resulting from an ANNOgesic analysis. The tool was heavily tested with several RNA-Seq data set from bacterial as well as archaeal samples.

Documentation

Documentation can be found on here.

Installation

Pip3

If you have all the requirements installed, installation can be done by using pip3.

With root permission:

$ pip3 install ANNOgesic

without root permission

$ pip3 install --user ANNOgesic

If you want to know the requirement, please refer to Documentation.

Docker and Singularity

In order to solve the issue of installing the dependencies, a Docker image of ANNOgesic is provided in Docker Hub. The image can be pulled by using Docker or Singularity. Moreover, the users can build an Docker image from the Dockerfile by themselves. For the details, please check the documentation.

Github

The alternative way for installing ANNOgesic is directly clone the Git repository.

$ git clone https://github.com/Sung-Huan/ANNOgesic.git

or

$ git clone git@github.com:Sung-Huan/ANNOgesic.git

In order to make ANNOgesic runnable, we should create a soft link of annogesiclib in bin.

$ cd ANNOgesic/bin
$ ln -s ../annogesiclib .

Then, you can run ANNOgesic via specifying the installed path if all the requirements are setup properly.

Arguments

usage: annogesic [-h] [--version]
                 {create,get_input_files,update_genome_fasta,annotation_transfer,
                  tss_ps,optimize_tss_ps,terminator,transcript,utr,srna,sorf,
                  promoter,operon,circrna,go_term,srna_target,snp,ppi_network,
                  localization,riboswitch_thermometer,crispr,merge_features,
                  screenshot,colorize_screenshot_tracks}
                 ...

positional arguments:
  {create,get_input_files,update_genome_fasta,annotation_transfer,tss_ps,
   optimize_tss_ps,terminator,transcript,utr,srna,sorf,promoter,operon,circrna,
   go_term,srna_target,snp,ppi_network,localization,riboswitch_thermometer,
   crispr,merge_features,screenshot,colorize_screenshot_tracks}
                        commands
    create              Create a project
    get_input_files     Get required files. (i.e. annotation files, fasta
                        files)
    update_genome_fasta
                        Get fasta files of query genomes if the query
                        sequences do not exist.
    annotation_transfer
                        Transfer the annotations from a closely related
                        species genome to a target genome.
    tss_ps              Detect TSSs or processing sites.
    optimize_tss_ps     Optimize TSSs or processing sites based on manual
                        detected ones.
    terminator          Detect rho-independent terminators.
    transcript          Detect transcripts based on coverage file.
    utr                 Detect 5'UTRs and 3'UTRs.
    srna                Detect intergenic, antisense and UTR-derived sRNAs.
    sorf                Detect expressed sORFs.
    promoter            Discover promoter motifs.
    operon              Detect operons and sub-operons.
    circrna             Detect circular RNAs.
    go_term             Extract GO terms from Uniprot.
    srna_target         Detect sRNA-mRNA interactions.
    snp                 Detect SNP/mutation and generate fasta file if
                        mutations were found.
    ppi_network         Detect protein-protein interactions suported by
                        literature.
    localization        Predict subcellular localization of proteins.
    riboswitch_thermometer
                        Predict riboswitches and RNA thermometers.
    crispr              Predict CRISPR related RNAs.
    merge_features      Merge all features to one gff file.
    screenshot          Generate screenshots for selected features using IGV.
    colorize_screenshot_tracks
                        Add color information to screenshots (e.g. useful for
                        dRNA-Seq based TSS and PS detection. It only works
                        after running "screenshot" (after running batch
                        script).

optional arguments:
  -h, --help            show this help message and exit
  --version, -v         show version

Citation

SH Yu, J. Vogel, KU Förstner. 2018, GigaScience, DOI:10.1093/gigascience/giy096, PMID:30169674.

License

ISC (Internet Systems Consortium license ~ simplified BSD license) - see LICENSE

Contact

If you have any questions, please contact Sung-Huan Yu

Project details


Release history Release notifications | RSS feed

This version

1.0.6

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ANNOgesic-1.0.6.linux-x86_64.tar.gz (581.1 kB view details)

Uploaded Source

Built Distribution

ANNOgesic-1.0.6-py3-none-any.whl (299.5 kB view details)

Uploaded Python 3

File details

Details for the file ANNOgesic-1.0.6.linux-x86_64.tar.gz.

File metadata

  • Download URL: ANNOgesic-1.0.6.linux-x86_64.tar.gz
  • Upload date:
  • Size: 581.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.18.4 setuptools/40.0.0 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.6.7

File hashes

Hashes for ANNOgesic-1.0.6.linux-x86_64.tar.gz
Algorithm Hash digest
SHA256 bccc3f13bd0a3f472d2505780cce6921237bf1f57030cc56ffd983c0b0acb367
MD5 d10ea16deaef66cca49193241e0043f2
BLAKE2b-256 94b386b07e43164191c34c164a91960e1e260d2a8abc31a57eaa9f5af4561247

See more details on using hashes here.

File details

Details for the file ANNOgesic-1.0.6-py3-none-any.whl.

File metadata

  • Download URL: ANNOgesic-1.0.6-py3-none-any.whl
  • Upload date:
  • Size: 299.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.18.4 setuptools/40.0.0 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.6.7

File hashes

Hashes for ANNOgesic-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 a17d6bff6f733c3ce0940666ff9f10c3073d913ca4147b28d2edbb034283df9b
MD5 6a3524155557d1397b37e2566a0745b8
BLAKE2b-256 cf260dbbd369b3baaf29af5172f4459de98d2af6083742ed4ef034612d775f93

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page