Skip to main content

Install functions to simulate gene expression compendia

Project description

ponyo

Coverage Status

Alexandra Lee and Casey Greene 2020

University of Pennsylvania

This repository is named after the the character Ponyo, from Hayao Miyazaki's animated film Ponyo, as she uses her magic to simulate a human appearance after getting a sample of human blood. The method simulates a compendia of new gene expression data based on existing gene expression data to learn a representation of gene expression patterns.

Installation

This package can be installed using pip:

pip install ponyo

How to use

Example notebooks using ponyo on test data can be found in human_tests

Additionally, this method has been used in simulate-expression-compendia and generic-expression-patterns repositories.

Setting random seeds

To keep the VAE training deterministic, you will need to set multiple random seeds:

  1. numpy random
  2. python random
  3. tensorflow random

For an example of this, see human_tests

Configuration file

The tables lists the core parameters required to generate simulated data using modules from ponyo. Those marked with * indicate those parameters that will vary depending on the type of approach .

Name Description
local_dir str: Parent directory on local machine to store intermediate results
dataset_name str: Name for analysis directory containing notebooks using ponyo
raw_data_filename str: File storing raw gene expression data
normalized_data_filename str: File storing normalized gene expression data
metadata_filename* str: File containing metadata associated with data
experiment_ids_filename* str: File containing list of experiment ids that have gene expression data available
scaler_transform_filename str: File to store mapping from normalized to raw gene expression range
simulation_type str: Name of simulation approach directory to store results locally
NN_architecture str: Name of neural network architecture to use. Format 'NN__'
learning_rate float: Step size used for gradient descent. In other words, it's how quickly the methods is learning
batch_size str: Training is performed in batches. So this determines the number of samples to consider at a given time
epochs int: Number of times to train over the entire input dataset
kappa float: How fast to linearly ramp up KL loss
intermediate_dim int: Size of the hidden layer
latent_dim int: Size of the bottleneck layer
epsilon_std float: Standard deviation of Normal distribution to sample latent space
validation_frac float: Fraction of input samples to use to validate for VAE training
num_simulated_samples* int: If using random sampling approach, simulate a compendia with these many samples
num_simulated_experiments* int: If using latent-transformation approach, simulate a compendia with these many experiments
num_simulated* int: If using template-based approach, simulate these many experiments
metadata_delimiter* str: Delimiter to parse metadata file
metadata_experiment_colname* str: Column header that contains experiment id that maps expression data and metadata
metadata_sample_colname* str: Column header that contains sample id that maps expression data and metadata
project_id* int: If using template-based approach, experiment id to use as template experiment

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ponyo-0.4.tar.gz (15.0 kB view details)

Uploaded Source

Built Distribution

ponyo-0.4-py3-none-any.whl (17.1 kB view details)

Uploaded Python 3

File details

Details for the file ponyo-0.4.tar.gz.

File metadata

  • Download URL: ponyo-0.4.tar.gz
  • Upload date:
  • Size: 15.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.7.3

File hashes

Hashes for ponyo-0.4.tar.gz
Algorithm Hash digest
SHA256 32aed0c4a3a1962648259f77b03222ccfba62a6543f1fa7b0efea0b9a3db3672
MD5 06df40cb7a5ab3e5c3025354c68307a9
BLAKE2b-256 cfa83112918716c469cb3fdf8d432ddae20a545624497d878ca0eb1edb461944

See more details on using hashes here.

Provenance

File details

Details for the file ponyo-0.4-py3-none-any.whl.

File metadata

  • Download URL: ponyo-0.4-py3-none-any.whl
  • Upload date:
  • Size: 17.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.7.3

File hashes

Hashes for ponyo-0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 54ee46dd0525d28172e1888e421e1ba76cc123cacccaf93f0c7d4d2118c3c8f7
MD5 24a40d33ac95eb96172996c733d20eb6
BLAKE2b-256 6291c07c93f2022c8eff30d01a95c3cda19208863a0854579193050e84cfe270

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page