Generate synthetic molecular data
Project description
InSilicoDNA
Note: This package is still in a planning phase and is not intended for external usage
Why create synthetic molecular data when there is an ever-growing body of real data to use? Simply stated, control. For the Engineer or Data Engineer, it is critical to have data when building pipelines. For the Scientist, Data Scientist, or Machine Learning Engineer, it is critical to have data when modeling and testing assumptions. While there are now many sets of real molecular data to select from, there are few cases where all types of data exist for a population of individuals in an unrestricted manner.
Getting Started
From CLI
Review options with --help
% insilicodna --help
Print common usage
% insilicodna
Create .fasta and .gff3 synthetic data for 50 genes
% insilicodna --gene-count 50 --fasta --gff3
From python
import insilicodna
insilicodna.generate_contig(
output_prefix="MySyntheticData",
n_genes=50,
fasta_file=True,
gff3_file=True)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for insilicodna-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 78c31e2e0cba6b207407d81c3ab5c434b44474f0f7c930ad543cd4dd5a383555 |
|
MD5 | 1ad13b355aab7c4f665e8662e955f152 |
|
BLAKE2b-256 | db7627545aeecda415e9fa334b3eebe2da85fbd95562af32ad4b55fa504a403e |