a command executor with caching for data processing pipelines
Project description
Razel
A command executor with caching. It is:
- Fast: caching avoids repeated execution of commands which haven't changed
- Reliable: commands are executed in a sandbox to detect missing dependencies
- Easy to use: commands are specified using a high-level TypeScript or Python API and convenience functions/tasks are built-in
- Made for: data processing pipelines with executables working on files and many dependencies between those
Razel is not the best choice for building software, especially there's no built-in support for compiler setup and header dependencies.
Getting Started
Use rustup to install Rust. Install protobuf-compiler
. Clone and build Razel:
cargo install --locked --git https://github.com/reu-dev/razel.git
The native input format for Razel is a razel.jsonl
file, see the example test/razel.jsonl.
razel exec -f test/razel.jsonl
The preferred way to create a razel.jsonl
file is using one of the high-level APIs.
TypeScript API
Install Deno to use the TypeScript API.
Run the example Deno script to create test/razel.jsonl
and execute it with Razel:
deno run --allow-write=. --check test/deno.ts
razel exec -f test/razel.jsonl
Python API
The Python API requires Python >= 3.8.
Run the example Python script to create test/razel.jsonl
and execute it with Razel:
pip install razel
python3 test/python.py
razel exec -f test/razel.jsonl
Batch file
In addition to razel.jsonl
, Razel can directly execute a batch file containing commands.
Input and output files need to be specified, which is WIP.
Execute the example test/batch.sh with Razel:
razel exec -f test/batch.sh
Project Status
Razel is in active development and not ready for production. CLI and format of razel.jsonl
will likely change.
OS | Status | Note |
---|---|---|
Linux | ✓ | stable, main development platform |
Mac | ✓ | used and tested in CI |
Windows | (✓) | tested in CI only |
Feature | Status | Note |
---|---|---|
command execution in sandbox | ✓ | |
multithreaded execution | ✓ | |
local caching | ✓ | |
remote caching | ✘ | WIP |
remote execution | ✘ | TODO |
OOM handling: retry with less concurrency | ✓ Linux | requires sudo cgcreate -a $USER -t $USER -g memory:razel |
Why not ...?
- Bazel is a multi-language build tool. However, for the use case Razel targets, there are some
issues:
- additional launcher script required for some simple tasks
- using stdout of action as input for another action
- parsing measurements from stdout of action
- CTest features like FAIL_REGULAR_EXPRESSION, WILL_FAIL
- difficult to get command lines for debugging
- no automatic disk usage limit/cleanup for local cache - all temp output needs to fit on disk
- no native support for response files
- resources cannot be reserved to run real-time critical tests
- content of bazel-bin/out directories is not defined (contains mixture of current build and cache)
- additional launcher script required for some simple tasks
- CTest is nice for building C/C++ code and CTest can be used for testing, but it does not support caching and managing dependencies between tests is difficult.
Features
Measurements
Razel parses the stdout of executed commands to capture runtime measurements and writes them to razel-out/measurements.csv
.
Currently, the <CTestMeasurement>
and <DartMeasurement>
tags as used by CTest/CDash are supported:
<CTestMeasurement type="numeric/double" name="score">12.3</CTestMeasurement>
<CTestMeasurement type="text/string" name="result">ok</CTestMeasurement>
Supporting custom formats is planned.
Acknowledgements
The idea to build fast and correct is based on Bazel. Razel uses data structures from the Bazel Remote Execution API for caching.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.