deterministic-zip wrapped for usage with pip and/or python.
Project description
deterministic-zip
Simple (almost drop-in) replacement for zip that produces deterministic files.
Features
- dropin for zip
- remove all metadata from files added
- immutable zip util
Installation
Automatic install
bash <(curl -sS https://raw.githubusercontent.com/timo-reymann/deterministic-zip/main/installer)
Manual
Linux (64-bit)
curl -LO https://github.com/timo-reymann/deterministic-zip/releases/download/$(curl -Lso /dev/null -w %{url_effective} https://github.com/timo-reymann/deterministic-zip/releases/latest | grep -o '[^/]*$')/deterministic-zip_linux-amd64 && \
chmod +x deterministic-zip_linux-amd64 && \
sudo mv deterministic-zip_linux-amd64 /usr/local/bin/deterministic-zip
Darwin (Intel)
brew
brew tap timo-reymann/deterministic-zip
brew install deterministic-zip
manual
curl -LO https://github.com/timo-reymann/deterministic-zip/releases/download/$(curl -Lso /dev/null -w %{url_effective} https://github.com/timo-reymann/deterministic-zip/releases/latest | grep -o '[^/]*$')/deterministic-zip_darwin-amd64 && \
chmod +x deterministic-zip_darwin-amd64 && \
sudo mv deterministic-zip_darwin-amd64 /usr/local/bin/deterministic-zip
Install with go
go install github.com/timo-reymann/deterministic-zip@latest
Install with pip(x)
Using pipx you can just use the following command use deterministic-zip as it is:
pipx install deterministic-zip-go
If you want to use it directly using the subprocess
module you can install it with pip:
pip install deterministic-zip-go
And use the package like this:
import subprocess
from deterministic_zip_go import exec
# Run process and prefix stdout and stderr
exec.exec_with_templated_output(["--help"])
# Create a subprocess, specifying how to handle stdout, stderr
exec.create_subprocess(["--help"], stdout=subprocess.PIPE, stderr=subprocess.PIPE)
# Perform command with suppressed output and return finished proces instance,
# on that one can also check if the call was successfully
exec.exec_silently(["--version"])
Docker
Please check the Containerized section in Usage for more details.
Supported platforms
The following platforms are supported (and have prebuilt binaries / ready to use integration):
- Linux
- 32-bit
- 64-bit
- ARM 64-bit
- ARM 32-bit
- Darwin
- 64-bit
- ARM (M1/M2)
- Windows
- ARM
- 32-bit
- 64-bit
- FreeBSD
- 32-bit
- 64-bit
- ARM 64-bit
- ARM 32-bit
- OpenBSD
- 32-bit
- 64-bit
- OCI compatible container engines (Docker, podman etc)
- ARM
- 64-bit
- CircleCI
- GitHub Actions
Where to find the latest release for your platform
Binaries
Binaries for all of these can be found on the latest release page.
Docker
For the docker image check the docker hub.
CI Provider
Usage
Command Line
If you installed the binary via Releases, Install-Script or using go you can just run deterministic-zip as a command.
deterministic-zip -h
Containerized
Please be aware that the image contains just the binary, no OS, libs or anything else. It also runs as root to be able to zip files no matter the ownership, feel free to build your own images based on that as well.
Using the container directly
If you want to use the tool on a platform not supported yet or dont want
to install the tool locally you can also mount your folder in
/workspace
which is the default working directory. Than you can just
execute commands as you want to.
docker run -v $PWD:/workspace timoreymann/deterministic-zip:latest
Integrating into your CI image
If you want to integrate the tool directly into your build image, you can also utilize the auto updates from tools like renovatebot or dependabot. Using docker built in features you can just get the binary directly from the image.
FROM base-image:tag
# do your customizations
COPY --from=timoreymann/deterministic-zip:latest /deterministic-zip /usr/bin/deterministic-zip
Motivation
Why another zip-tool? What is this deterministic stuff?!
When we are talking about deterministic it means that the hash of the zip file won't change unless the contents of the zip file changes.
This means only the content, no metadata. You can achieve this with zip, yes.
The problem that still remains is that the order is almost unpredictable and zip is very platform specific, so you will end up with a bunch of crazy shell pipelines. And I am not even talking about windows at this point.
So this is where this tool comes in, it is intended to be a drop-in replacement for zip in your build process.
The use cases for this are primary:
- Zipping serverless code
- Backups or other files that get rsynced
Want to know more about the topic of deterministic/reproducible builds?
I can recommend the following resources:
Documentation
How reliable is it?
Of course, it is not as reliable as the battle-proven and billions of times executed zip.
Even though I am heavily relying on the go stdlib this software can of course have bugs. And you are welcome to report them and help make this even more stable. Of course there will be tests to cover most use cases but at the end this is still starting from scratch, so if you need advanced features or just dont feel comfortable about using this tool don't do it!
Differences between zip and deterministic-zip
Please see docs/differences
Contributing
I love your input! I want to make contributing to this project as easy and transparent as possible, whether it's:
- Reporting a bug
- Discussing the current state of the configuration
- Submitting a fix
- Proposing new features
- Becoming a maintainer
To get started please read the Contribution Guidelines.
Development
Requirements
Test
make test-coverage-report
Build
make build
Alternatives
As far as I know the following (GitHub) projects exist:
- bboe/deterministic_zip (Python)
- You must list files explicitly
- Changed order -> changed zip
- You will need to install Python (no problem on Linux/Mac) and the package
- bitgenics/deterministic-zip (NodeJS/JavaScript)
- Support for globs and ignores order
- You need to install node.js, the package, and it has no cli interface
- orf/deterministic-zip (Rust)
- has prebuilt binaries for all relevant platforms (and other can be built easily)
- very basic, but you can customize compression (nice feature)
All in all they are just simply not what I needed. My favourite is Rust, because its just simply dropping in a binary. Something that's very convenient especially when it comes to Docker builds.
The main problem that all these solutions share is that it in my opinion cool things like excluding patterns, that I regularly use are simply not implemented, and i REALLY love glob patterns.
Credits
This whole project wouldnt be possible with the great work of the following libraries:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
File details
Details for the file deterministic_zip_go-3.0.1-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
.
File metadata
- Download URL: deterministic_zip_go-3.0.1-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
- Upload date:
- Size: 915.0 kB
- Tags: Python 3, manylinux: glibc 2.12+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e490214e144bd90cf8f421d74c536ad11a30bc36f21cd893ba7e252d92a0b602 |
|
MD5 | b4c7300f8890db484b811f01aadc7cf0 |
|
BLAKE2b-256 | 3b52d0125cf7b1ebcbbf6bbd63c978d98c5d15690829f68d03c8cacf52aeda0f |
File details
Details for the file deterministic_zip_go-3.0.1-py3-none-macosx_11_0_arm64.whl
.
File metadata
- Download URL: deterministic_zip_go-3.0.1-py3-none-macosx_11_0_arm64.whl
- Upload date:
- Size: 887.0 kB
- Tags: Python 3, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fe0b5bbc33fe248d1864c8028ecf530d3d523fb60e196980c94a99a44a502598 |
|
MD5 | 1f09c801727f62961f3cad959c8a92cc |
|
BLAKE2b-256 | a1c50fd563f9c410acae13ede87314da5661211e10278d8d21a6049cf3ddae2d |
File details
Details for the file deterministic_zip_go-3.0.1-py3-none-macosx_10_9_x86_64.whl
.
File metadata
- Download URL: deterministic_zip_go-3.0.1-py3-none-macosx_10_9_x86_64.whl
- Upload date:
- Size: 929.4 kB
- Tags: Python 3, macOS 10.9+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ae6d5b86e9faf89fed8fc48d92b0e730336a552fdaca4063986d08f3f3954234 |
|
MD5 | bda38c58dd28c6e614288b70df94f893 |
|
BLAKE2b-256 | ab00f9d70538aa264990bf4cb8a49db006f61cb4551de5baa7719f923e87e56c |