Pipelines for genomic region file to produce bed files, and it's statistics
Project description
bedboss
bedboss is a command-line pipeline that standardizes and calculates statistics for genomic interval data, and enters the results into a BEDbase database. It has 3 components: 1) bedmaker (bedboss make
); 2) bedqc (bedboss qc
); and 3) bedstat bedboss stat
. You may run all 3 pipelines separately, together (bedbase all
).
1) bedmaker
Converts supported file types into BED and bigBed format. Currently supported formats:
- bedGraph
- bigBed
- bigWig
- wig
2) bedqc
Assess QC of BED files and flag potential problems for further evaluation so you can determine whether they should be included in downstream analysis. Currently, it flags BED files that are larger than 2 GB, have over 5 milliom regions, or have mean region width less than 10 bp. These thresholds can be changed with pipeline arguments.
bedstat
Calculates statistics about BED files.
Documentation
Detailed information about each pipeline can be found in the bedboss Readme.
For the specific bedbase.org instance, see instructions in the bedbase.org repo.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for bedboss-0.1.0a3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4a50aaeeb01610bcc19fcdfa791a9741625a0858ad2c74309df74bc3208c772d |
|
MD5 | 035cbf01a48afb5fb486996994a83553 |
|
BLAKE2b-256 | 797d065375bd783a81cd001d1a8d3747c8ad745edad05d31dd37985a50a94533 |