A set of utilities for creating and managing ETL Pipelines with pyspark.
Project description
Jorvik
Jorvik is a collection of utilities for creating and managing ETL pipeline in Pyspark. Build from Data Engineers for Data Engineers.
Contribute
The Jorvik project welcomes your expertise and enthusiasm!
Writing code isn’t the only way to contribute. You can also:
- review pull requests
- suggest improvements through issues
- let us know your pain-points and repetitive tasks
- help us stay on top of new and old issues
- develop tutorials, videos, presentations, and other educational materials
See How to Contribute for instructions on setting up your local machine and opening your first Pull Request.
Getting Started.
Jorvik is available in Pypi and can be installed with pip
pip install jorvik
Packages:
- Storage: Interact with the storage layer
- Pipelines: Build and test etl pipelines with ease
- Data Lineage: Track data lineage
Examples:
See the full power of jorvik when all the features come together in the examples bellow:
Databricks
- Transactions: A multi step pipeline that creates customer statistics from customers and transaction data.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file jorvik-1.2.3-py3-none-any.whl.
File metadata
- Download URL: jorvik-1.2.3-py3-none-any.whl
- Upload date:
- Size: 35.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0010572b9a5cecc44bda19bf760bcc95e67c7bcb539e9b44a0ab9dd14d3521df
|
|
| MD5 |
0478db80696c2f8adcc8e318c082a069
|
|
| BLAKE2b-256 |
affd38221743c0912d40313637509590ef6384031c5dba5643898fad5bb4bc30
|