Skip to main content

Python package providing retrieval for conservation scores.

Project description

Pipeline to automatically retrieve and preprocess conservation scores

Installing the package

As for any python package, simply run:

pip install conservation_scores_dataset

Usage examples

The following two usage examples will show both how to run the pipeline and how to retrieve the already pre-computed values stored within this GitHub repository.

Running the pipeline to retrieve the data

If you want to run the pipeline yourself, after taking note that some preprocessed data are already available within this repository, consider that the downloaded file will require a total size of ADDSIZE.

After having installed the package, just run the following to retrieve all data relative to the FANTOM5 and ROADMAP datasets for enhancers and promoters:

from conservation_scores_dataset import retrieve_all
retrieve_all()

It will display some loading bars showing you what the pipeline is doing at a given time, such as the following ones:

https://github.com/LucaCappelletti94/conservation_scores_dataset/blob/main/example_screenshot.png?raw=true

Based on the main loop provided in the retrieve all file, other similar automati retrieval pipelines for other BED files can be trivially built.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

conservation_scores_dataset-0.0.1.tar.gz (5.0 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page