Skip to main content

a command executor with caching for data processing pipelines

Project description

Razel

Rust MIT CI

Deno module Python module

A command executor with caching. It is:

  • Fast: caching avoids repeated execution of commands which haven't changed
  • Reliable: commands are executed in a sandbox to detect missing dependencies
  • Easy to use: commands are specified using a high-level TypeScript or Python API and convenience functions/tasks are built-in
  • Made for: data processing pipelines with executables working on files and many dependencies between those

Razel is not the best choice for building software, especially there's no built-in support for compiler setup and header dependencies.

Getting Started

Use rustup to install Rust. Install protobuf-compiler. Clone and build Razel:

cargo install --locked --git https://github.com/reu-dev/razel.git

The native input format for Razel is a razel.jsonl file, see the example test/razel.jsonl.

razel exec -f test/razel.jsonl

The preferred way to create a razel.jsonl file is using one of the high-level APIs.

TypeScript API

Install Deno to use the TypeScript API. Run the example Deno script to create test/razel.jsonl and execute it with Razel:

deno run --allow-write=. --check test/deno.ts
razel exec -f test/razel.jsonl

Python API

The Python API requires Python >= 3.8. Run the example Python script to create test/razel.jsonl and execute it with Razel:

pip install razel
python3 test/python.py
razel exec -f test/razel.jsonl

Batch file

In addition to razel.jsonl, Razel can directly execute a batch file containing commands. Input and output files need to be specified, which is WIP.

Execute the example test/batch.sh with Razel:

razel exec -f test/batch.sh

Project Status

Razel is in active development and not ready for production. CLI and format of razel.jsonl will likely change.

OS Status Note
Linux stable, main development platform
Mac used and tested in CI
Windows (✓) tested in CI only
Feature Status Note
command execution in sandbox
multithreaded execution
local caching
remote caching WIP
remote execution TODO
OOM handling: retry with less concurrency ✓ Linux requires sudo cgcreate -a $USER -t $USER -g memory:razel

Why not ...?

  • Bazel is a multi-language build tool. However, for the use case Razel targets, there are some issues:
    • additional launcher script required for some simple tasks
      • using stdout of action as input for another action
      • parsing measurements from stdout of action
      • CTest features like FAIL_REGULAR_EXPRESSION, WILL_FAIL
    • difficult to get command lines for debugging
    • no automatic disk usage limit/cleanup for local cache - all temp output needs to fit on disk
    • no native support for response files
    • resources cannot be reserved to run real-time critical tests
    • content of bazel-bin/out directories is not defined (contains mixture of current build and cache)
  • CTest is nice for building C/C++ code and CTest can be used for testing, but it does not support caching and managing dependencies between tests is difficult.

Features

Measurements

Razel parses the stdout of executed commands to capture runtime measurements and writes them to razel-out/measurements.csv. Currently, the <CTestMeasurement> and <DartMeasurement> tags as used by CTest/CDash are supported:

<CTestMeasurement type="numeric/double" name="score">12.3</CTestMeasurement>
<CTestMeasurement type="text/string" name="result">ok</CTestMeasurement>

Supporting custom formats is planned.

Acknowledgements

The idea to build fast and correct is based on Bazel. Razel uses data structures from the Bazel Remote Execution API for caching.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

razel-0.1.1.tar.gz (5.9 kB view hashes)

Uploaded Source

Built Distribution

razel-0.1.1-py3-none-any.whl (6.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page