Skip to main content

No project description provided

Project description

Urgap is a cloud-native framework for file-based data engineering, containing abstraction layers for data and meta data, extensive re-run skipping logic and data versioning. Urgap can be incorporated with any scheduling/pipelining tool making pipeline development independent from business logic and data storage, while offering standardized logging and execution, which makes monitoring and debugging easy.

Urgap gives us the governance constraints required for a decentralized data domain autonomy as Urgap will enforce shared common data IO for storage, a common meta data capturing process in form of an interface thus can be plugged into any existing processes and finally global data lineages.

ADO CI status

Learn More

Watch our introduction talk urgap - unified resource governance and data provenance by Christian Fufezan to get a comprehensive overview of urgap’s design and capabilities:

Watch the video

How to Setup

Prerequisites

We recommend using a virtual environment for Python projects. This guide uses uv for dependency management.

Installation

Basic Installation (local file system access only):

uv pip install -e .

With Cloud Storage Support:

uv pip install -e ".[cloud]"

With All Optional Dependencies:

uv pip install -e ".[all]"

Available extras include:

  • cloud: Azure and Google Cloud storage backends

  • all: All optional dependencies

Running Tests

Install test dependencies:

uv pip install pytest

Run the test suite:

pytest tests

Quickstart: Writing Your First Pipeline

The best way to learn urgap is through a complete example. Check out the end-to-end filter CSV pipeline:

  • Location: tests/integrationtests/end2end/test_filter_csv_pipeline.py

  • What it demonstrates: Complete pipeline setup, node configuration, and execution

  • Requirements: Everything needed to run this example is included in the repository

This example can be run entirely on your local machine without any external dependencies.

To run the example:

pytest tests/integrationtests/end2end/test_filter_csv_pipeline.py

Documentation

Please use sphinx in the docs folder

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urgap-3.3.10.tar.gz (330.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

urgap-3.3.10-py2.py3-none-any.whl (398.6 kB view details)

Uploaded Python 2Python 3

File details

Details for the file urgap-3.3.10.tar.gz.

File metadata

  • Download URL: urgap-3.3.10.tar.gz
  • Upload date:
  • Size: 330.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for urgap-3.3.10.tar.gz
Algorithm Hash digest
SHA256 fba8e9bdb3c0f09c51389c30107a0a24774c31fcc31672d9ad53ce462b41a9b0
MD5 d3b7d23a3d37d3d0513e3e627c0b3fb4
BLAKE2b-256 a08776f719e4b09c7907d9100a8606fdf9a31f788fd96100ba2e89c77535d1c7

See more details on using hashes here.

File details

Details for the file urgap-3.3.10-py2.py3-none-any.whl.

File metadata

  • Download URL: urgap-3.3.10-py2.py3-none-any.whl
  • Upload date:
  • Size: 398.6 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for urgap-3.3.10-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 9859fae28bb04c06237177e2df929471c8890da57f2d7a4cdf074ef15773d812
MD5 f6b1d72fb4c04f19aa904f00c7e6f6f3
BLAKE2b-256 4a31e0a7ada032a5c4f1bb0eab852a0eef9f3fd44d9b82a662bc66bbf0e8d917

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page