Git-Managed Distributed Data Lake Framework
Project description
GitLake
GitLake is a distributed data lake management framework based on Git. It defines a file system that is optimized to perform ETLM tasks within a data lake environment. It also provides a CLI tool gitlake
which offers user a git-like experience to manage and share raw data files and perform massively parallel compute tasks.
- Documentation: https://gitlake.readthedocs.io
- Website: https://www.gitlake.com
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Filename, size | File type | Python version | Upload date | Hashes |
---|---|---|---|---|
Filename, size gitlake-0.0.7-py3-none-any.whl (5.1 kB) | File type Wheel | Python version py3 | Upload date | Hashes View |
Filename, size gitlake-0.0.7.tar.gz (2.6 kB) | File type Source | Python version None | Upload date | Hashes View |