Bootstrap JTK analysis for circadian rhythm detection

These details have not been verified by PyPI

Project links

Project description

BooteJTK

BooteJTK is an implementation of empirical JTK (eJTK) on parametrically bootstrapped resamplings of time series, used for detecting circadian rhythms in genomic data.

Based on BooteJTK by Alan Hutchison et al.; this fork improves Python 3 compatibility and integration with LIMBR.

References

Hutchison AL et al. (2016), "BooteJTK: Improved Rhythm Detection via Bootstrapping", bioRxiv.
Hutchison AL, Maienschein-Cline M, Chiang AH et al. "Improved statistical methods enable greater sensitivity in rhythm detection for genome-wide data." PLoS Computational Biology 2015 11(3): e1004094. doi:10.1371/journal.pcbi.1004094

Installation

pip install bootjtk

Requires Python 3.8 or later. All dependencies (numpy, scipy, pandas, matplotlib, statsmodels) are installed automatically. No R installation is required.

Quick start

Run 10 bootstrap resamplings on data with 2 replicates per timepoint:

bootejtk-calcp -f example/TestInput4.txt -x MYPREFIX -r 2 -z 10

The -p (period), -s (phases), and -a (asymmetries) ref-file arguments default to the standard 24 h files bundled with the package, so they can be omitted for typical circadian analyses.

Usage

`bootejtk-calcp` — full pipeline

This is the main entry point. It runs BooteJTK bootstrapping followed by p-value calculation.

bootejtk-calcp -f <input_file> -x <prefix> -r <replicates> -z <bootstraps> [options]

Option	Description	Default
`-f` / `--filename`	Input data file (tab-delimited, header row starting with `#` or `ID`)	required
`-x` / `--prefix`	Output file prefix	required
`-r` / `--reps`	Number of replicates per timepoint	`2`
`-z` / `--size`	Number of bootstrap resamplings	`50`
`-j` / `--workers`	Worker processes (`0` = all CPUs)	`1`
`-w` / `--waveform`	Reference waveform shape (see below)	`cosine`
`-p` / `--period`	Period reference file	bundled 24 h file
`-s` / `--phase`	Phase reference file	bundled 0–22 h by 2 file
`-a` / `--width`	Asymmetry reference file	bundled 2–22 h by 2 file
`-B` / `--basic`	Skip variance shrinkage preprocessing	off
`-L` / `--limma`	Use limma vooma variance estimation only (without vash imputation)	off
`--vash`	Use vash NA imputation before variance estimation	off
`-U` / `--noreps`	No replicates mode: estimate variance from arrhythmic genes	off
`-R` / `--rnaseq`	RNA-seq mode (passed to preprocessing)	off
`-W` / `--write`	Write pickle output files (`.pkl`) from BooteJTK	off

Run bootejtk-calcp --help to see all options and current defaults.

Preprocessing / variance shrinkage

By default, bootejtk-calcp applies vooma-style variance estimation with NA imputation and empirical Bayes shrinkage (the equivalent of --limma --vash). This preprocessing is implemented entirely in Python — no R installation is required.

Flag	Behaviour
(default, no flag)	Run limma vooma + vash NA imputation + eBayes shrinkage
`-L` / `--limma`	Run limma vooma + eBayes shrinkage (no NA imputation)
`--vash`	Enable vash NA imputation (implied by default)
`-B` / `--basic`	Skip all variance shrinkage; use raw values directly

`bootejtk` — core analysis only

Runs the BooteJTK analysis step without the CalcP p-value fitting step. Useful if you want to run CalcP separately or with custom settings.

bootejtk -f <input_file> -x <prefix> -r <replicates> -z <bootstraps> [options]

Waveform shapes

Value	Shape
`cosine` (default)	Smooth sinusoidal peak
`trough`	Triangular trough
`impulse`	Narrow spike
`step`	Rectangular step

Parallel processing

Use -j to speed up large datasets by distributing genes across CPUs:

bootejtk-calcp -f example/TestInput4.txt -r 2 -z 50 -j 8

`-j` value	Behaviour
`1` (default)	Sequential, single process
`N > 1`	Use N worker processes
`0`	Use all available CPUs

Input format

Tab-delimited text file. The header row must start with # or ID; subsequent columns are zeitgeber time labels. Each data row begins with a gene/feature identifier.

The preferred header format encodes both the timepoint and replicate number:

#	ZT00_1	ZT00_2	ZT02_1	ZT02_2	ZT04_1	ZT04_2	...
gene1	1.23	1.31	2.45	2.38	3.10	3.05	...
gene2	5.01	4.95	4.87	4.91	3.92	4.01	...

Column labels follow the pattern ZT{HH}_{rep} where HH is the zero-padded hour (e.g. 00, 02, 04) and rep is the replicate number. This format is shared with LIMBR and PIRS, so output from those tools can be passed directly to BooteJTK without reformatting.

The legacy formats ZT0, ZT2, CT0, CT2 (no replicate suffix) remain fully supported for backwards compatibility, as do decimal values (e.g. ZT14.7). Time labels do not need to be evenly spaced.

Output files

Running bootejtk-calcp produces output files prefixed with the value passed to -x:

File	Contents
`*_GammaP.txt`	BooteJTK output with Gamma-fitted p-values
`*.txt`	Main BooteJTK output (best-matching waveform per gene, feeds into CalcP)
`*_NULL1000.txt`	Randomly generated null time series used to fit the null tau distribution
`*_order_probs.pkl`	(requires `-W`) Pickle: per-gene `[means, stds, ns]` and rank-order bootstrap frequencies
`*_order_probs_vars.pkl`	(requires `-W`) Pickle: per-gene tau and phase probability distributions

Running the example command on an already-existing output directory appends _1 to output filenames.

FAQ

Can I use non-integer or uneven time intervals (e.g. ZT14.7)? Yes. The label just needs to start with ZT or CT; decimal values are read correctly.

Does BooteJTK handle uneven sampling intervals? Yes. All timepoints in the header are used as given.

Why does BooteJTK report phases like 14.4 that don't match my sampling intervals? BooteJTK runs bootstrap resamplings and reports the mean phase across those resamplings. For example, if 8 of 10 resamplings give phase 14 and 2 give phase 16, the reported mean phase is 14.4.

Do the phase/asymmetry search intervals need to match the sampling intervals? No. You can sample every hour but only search for phases every two hours, for example.

Development

git clone https://github.com/aleccrowell/BooteJTK-c
cd BooteJTK-c
pip install poetry
poetry install
poetry run pytest tests/ -v

License

Released under the MIT License. See LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.2.0

Apr 23, 2026

1.1.0

Apr 22, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bootjtk-1.2.0.tar.gz (58.3 kB view details)

Uploaded Apr 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

bootjtk-1.2.0-py3-none-any.whl (60.1 kB view details)

Uploaded Apr 23, 2026 Python 3

File details

Details for the file bootjtk-1.2.0.tar.gz.

File metadata

Download URL: bootjtk-1.2.0.tar.gz
Upload date: Apr 23, 2026
Size: 58.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.3 CPython/3.13.7 Linux/6.17.0-1011-raspi

File hashes

Hashes for bootjtk-1.2.0.tar.gz
Algorithm	Hash digest
SHA256	`ba28cf718484137520b1427bd03b46855ef0c48bdbdfc707eca1c72e9cc2a690`
MD5	`a7250513a22e6fb5782a494436fe48a1`
BLAKE2b-256	`0df0e5de75e0a21e2a0ce5c34692035228af647357e72be8434d1806af9df80a`

See more details on using hashes here.

File details

Details for the file bootjtk-1.2.0-py3-none-any.whl.

File metadata

Download URL: bootjtk-1.2.0-py3-none-any.whl
Upload date: Apr 23, 2026
Size: 60.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.3 CPython/3.13.7 Linux/6.17.0-1011-raspi

File hashes

Hashes for bootjtk-1.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`44a8cab7a03667ccf6d21f860393bc358be13501a955b455cb5dac1acedf3d81`
MD5	`202c64d8d243fba5adfcb5c189057ba7`
BLAKE2b-256	`4420880758be1af92913439e39bffa895651b2a85ea7cf6f7ea8031e642df8b3`

See more details on using hashes here.

bootjtk 1.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

BooteJTK

Installation

Quick start

Usage

`bootejtk-calcp` — full pipeline

Preprocessing / variance shrinkage

`bootejtk` — core analysis only

Waveform shapes

Parallel processing

Input format

Output files

FAQ

Development

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

bootjtk 1.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

BooteJTK

Installation

Quick start

Usage

bootejtk-calcp — full pipeline

Preprocessing / variance shrinkage

bootejtk — core analysis only

Waveform shapes

Parallel processing

Input format

Output files

FAQ

Development

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`bootejtk-calcp` — full pipeline

`bootejtk` — core analysis only