Skip to main content

CaPSID: Computational Pathogen Sequence Identification

Project description

CaPSID

CaPSID (Computational Pathogen Sequence IDentification) is a comprehensive open source platform which integrates a high-performance computational pipeline for pathogen sequence identification and characterization in human genomes and transcriptomes together with a scalable results database and a user-friendly web-based software application for managing, querying and visualizing results.

Project Leader Vincent Ferretti Development Team Ivan Borozan, Philippe Laflamme, Shane Wilson, Stuart Watt

Downloading and using CaPSID is free, if you use CaPSID or its code in your work please acknowledge CaPSID by referring to its github homepage https://github.com/capsid. This is important for us since obtaining grants is one significant way to fund planning and implementation for our project. Also if you find CaPSID useful in your research feel free to let us know.

Getting Started

You will need a MongoDB database, a Python 2.7 installation and a Java Web Server. For more details, read the wiki:

https://github.com/capsid/capsid/wiki/

Acknowledgement

This project is supported by the Ontario Institute for Cancer Research (OICR) through funding provided by the government of Ontario, Canada.

News

1.6.2

Release date: 16 July 2014

  • enforce the use of pymongo version==2.8

1.6.1

Release date: 16 September 2014

  • added taxon tables for viruses

  • added gra (genome relative abundance) calculations

  • metrics are now calculated for i) reads aligning to pathogen only (excluding those aligning simultaneously to the human ref (i.e digital subtraction)) and ii) reads aligning simultaneously to both pathogen and human refs

  • paired reads from the human ref pair-end BAM file are considered valid only if aligned as proper pairs

  • changed the filtering approach from collapsing of FASTQ and filtering to filtering

  • fixed a bug to do with the RAM usage when reading a xeno file during subtraction

1.4.3

Release date: 2 October 2012

  • Fixes Error message when running subtraction and alignment is not found

  • More work on fixing long running queries

1.4.2

Release date: 2 September 2012

  • Removes MongoKit dependency

1.4.1

Release date: 4 July 2012

  • Fixes cursor timeout for long running statistics queries

1.4.0

Release date: 1 June 2012

  • Adds statistics while they are being run rather than all at the end

  • Saves unique id for genes as uid

1.3.0

Release date: 28 Feb 2012

  • Adds a new field in mapped for AS tag

  • Links mapped genes to NCBI page

  • Uses max sample coverage for project max coverage

  • Adds timer to wait for file collapsing to finish when intersecting

  • Using a path/to/file input no longer breaks intersecting

1.2.6

Release date: 6 Feb 2012

  • Subtraction filters out unmapped when building mapped reads

1.2

Release date: 20 Jan 2012

  • Utility to create FastQ files from unmapped reads

  • Utility to return intersection of FastQ files

  • Utility to return filter FastQ files

  • Added mapq threshold flag to subtraction

  • Gbloader can now load bacteria and fungal genomes

  • Saves genome sequences to database using GridFS

  • Reduced memory footprint when calculating statistics

  • use subprocesses instead of os.system

  • mapq scores filter needs to include 0 <= mapq <=3 due to multiple mapping reads

1.1

Release date: 25 Nov 2011

  • Adds support for pair-end reads

1.0.1

Release date: 19 Oct 2011

  • Added support for MongoDB authentication

1.0

Release date: 12 Oct 2011

  • Initial release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

capsid-1.6.2.tar.gz (28.8 kB view details)

Uploaded Source

File details

Details for the file capsid-1.6.2.tar.gz.

File metadata

  • Download URL: capsid-1.6.2.tar.gz
  • Upload date:
  • Size: 28.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for capsid-1.6.2.tar.gz
Algorithm Hash digest
SHA256 33cfdf54ea737ad41c8902c2f6ccb0f9ac182a29414279236ae9cd45235fb41e
MD5 a6cdf7f7316e8b1a2534d4255c4b6ff3
BLAKE2b-256 bbbc94b0662d93cd6b1f9976e9c23530995731d14955c4c2c129a9a66e204850

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page