Skip to main content

Generate mutated sequence files from a reference genome.

Project description

https://img.shields.io/pypi/v/snp-mutator.svg

Generate mutated sequence files from a reference genome.

SNP Mutator was developed by the United States Food and Drug Administration, Center for Food Safety and Applied Nutrition.

Features

  • Reads a fasta file and generates any number of mutated fasta replicate files.

  • Mutations can be any number of single-base substitutions, insertions, and deletions at randomly chosen positions, uniformly distributed across the genome.

  • Mutations can be chosen from a subset (pool) of all possible positions.

  • Replicates can be partitioned into multiple groups with each group sharing a pool of eligible positions.

  • Generates a summary file listing the original base and the mutation for all mutated positions.

  • Mutations can be either monomorphic or polymorphic.

  • VCF file generation.

  • Various metrics can be saved to an output file.

  • The fasta defline sequence id can be customized.

Citing SNP Mutator

To cite SNP Mutator, please cite the publication below:

Davis S, Pettengill JB, Luo Y, Payne J, Shpuntoff A, Rand H, Strain E. (2015) CFSAN SNP Pipeline: an automated method for constructing SNP matrices from next-generation sequence data. PeerJ Computer Science 1:e20 https://doi.org/10.7717/peerj-cs.20

License

See the LICENSE file included in the SNP Mutator distribution.

History

1.2.0 (2019-05-15)

  • Added support for Python 3.6 and 3.7.

  • Added a command line option to specify the output directory for the generated fasta files.

1.1.0 (2018-10-23)

  • Added the capability to specify the fasta defline sequence id in the output files.

  • The unmutated, but concatenated sequence reference can be written to a separate output file. All the replicates will be mutations of this file.

1.0.0 (2018-10-12)

  • Mutations can be chosen from a subset (pool) of all possible positions.

  • Replicates can be partitioned into multiple groups with each group sharing a pool of eligible positions.

  • Mutations can be either monomorphic or polymorphic.

  • Add the capability to generate VCF output files.

  • Add the capability to generate a metrics output file.

  • The installer creates an executable script called snpmutator.

  • Insertions are now placed after the original reference position, not before, as in prior versions of snpmutator. This means it will not be possible to exactly reproduce the results of prior versions of this software.

0.2.0 (2016-01-15)

  • Allow lowercase bases in the input fasta file.

  • Do not mutate gaps or ambiguous positions.

  • Add a command line switch to show the program version.

0.1.1 (2015-06-17)

  • Remove spaces from the summary file column headings. This will simplify downstream analysis in some scripting languages.

0.1.0 (2015-05-21)

  • First release on GitHub, Read the Docs, and PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

snp-mutator-1.2.0.tar.gz (27.8 kB view details)

Uploaded Source

File details

Details for the file snp-mutator-1.2.0.tar.gz.

File metadata

  • Download URL: snp-mutator-1.2.0.tar.gz
  • Upload date:
  • Size: 27.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Python-urllib/3.6

File hashes

Hashes for snp-mutator-1.2.0.tar.gz
Algorithm Hash digest
SHA256 ddf14bb955d6863a24c4a25700f51c3a76067a4f68843cfb4d0caa0c97b1c91e
MD5 f598b8864b19a5bac48f8a3eeba51c84
BLAKE2b-256 4d97b0d745de911f88a5ad381095aa08b615da9ae098f8a928e00e425c715334

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page