Simple python tool to create dataset for arrps.
Project description
Simple python tool to render dataset that can be used for training models for active regulatory regions prediction.
How to get the dataset?
Just clone the repo.
How to get the package?
Just type into your terminal:
pip install arrp_dataset
Which genome does it use by default?
By default it uses hg19, as it is the genome used in the labeled data currently available from the Wasserman team.
Dependencies
This package will use the package bedtools to elaborate the bed files. A setup for the package is available here.
Rendering the dataset
Just type into your terminal:
python run.py
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
arrp_dataset-1.0.0.tar.gz
(5.2 kB
view hashes)
Built Distribution
Close
Hashes for arrp_dataset-1.0.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5c9841a96a400df9fa235e59716ec1dce5d8eece6f14b3637cf0e49a1ad6ed1c |
|
MD5 | 684976cdbfc221b7cd72989381940437 |
|
BLAKE2b-256 | 6b032dc284b2d7248b98b111f20c828c876e0f8fbb2addb19f18ecd05944da43 |