Skip to main content

Annotation histories from Genbank files.

Project description

nebgbhist

Tools for building annotation histories from multiple Genbank files.

Example

$ mkdir gbhist
$ neb-rev-fetch -d gbhist NC_008512
$ neb-gbhist -d gbhist | neb-diff-apply NC_008512.git
$ git --git-dir=NC_008512.git gc --aggressive
$ neb-validate-history -p NC_008512.git -g gbhist/2009-04-29-04-04.gbk

Playing with files

If you clone the pack repository after building you can poke around at the contents on the file system. For larger genomes with lots of edits this may run afoul of directory entry limits until I rewrite the object storage. For the Carsonella (NC_008512) example I use there isn’t an issue.

$ git clone NC_008512.git
$ cd NC_008512
$ ls -1
accession  dblink     keywords   objects/   seq/       version
comment    definition locus      ref/       source

Files in the root directory represent keywords defined in the Genbank file. The ref and seq subdirectories contain information about references and the sequence hashes. The objects directory lists the features in the history. The files objects/known and objects/alive are dictionaries of hashes that point toward the current version of a feature at a given location. Active features are anything that is not deleted or replaced with a new version.

I recommend that you work with nebpack.Pack objects if you want to actually do anything with the objects.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nebgbhist-0.0.5.tar.gz (9.4 kB view details)

Uploaded Source

File details

Details for the file nebgbhist-0.0.5.tar.gz.

File metadata

  • Download URL: nebgbhist-0.0.5.tar.gz
  • Upload date:
  • Size: 9.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for nebgbhist-0.0.5.tar.gz
Algorithm Hash digest
SHA256 b2241788d8e7750baf41d8678d744c2a25c67214bd80228a05482d32f3a8f2c6
MD5 cbbd8b8a759e4eb72921c1215f9d5f16
BLAKE2b-256 65062667ac288e04c85b6d74ace3fbcd414e9444fba489d0533bda5f333ec498

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page