Skip to main content

Simple python tool to create dataset for arrps.

Project description

Simple python tool to render dataset that can be used for training models for active regulatory regions prediction.

How to get the dataset?

Just clone the repo.

How to get the package?

Just type into your terminal:

pip install arrp_dataset

Which genome does it use by default?

By default it uses hg19, as it is the genome used in the labeled data currently available from the Wasserman team.

Dependencies

This package will use the package bedtools to elaborate the bed files. A setup for the package is available here.

Rendering the dataset

Just type into your terminal:

python run.py

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for arrp-dataset, version 1.0.2
Filename, size File type Python version Upload date Hashes
Filename, size arrp_dataset-1.0.2-py3-none-any.whl (8.5 kB) File type Wheel Python version py3 Upload date Hashes View hashes
Filename, size arrp_dataset-1.0.2.tar.gz (5.2 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page