A greedy Python standalone application bundler
Project description
shenzi
shenzi helps you create standalone Python applications from your development virtual environment. Using shenzi, you can create standalone folders which can be distributed to any machine, and the application will work (even when python is not installed on the target system).
The python packaging problem
Given a development environment (a virtual environment), we want to produce a single directory containing ALL the dependencies that the application needs. Other languages like rust and go provide easy way to create statically linked executables, which makes them very easy to distribute.
Python struggles in this area mainly because of how flexible it is when it comes to delegating work to C code (shared libraries on your system).
Out in the wild, python libraries regularly links to shared libraries in your system:
- C Extensions
- loading shared libraries using
dlopenand equivalents
Even creating a development environment for some pip package might require you to install some system dependencies (a good example is weasyprint)
It becomes difficult to ship applications if we need to install system dependencies in target machines. Docker solves this problem by packaging everything in a single docker image.
shenzi does not compete with docker, if you can use docker, you should. shenzi is useful for shipping desktop applications.
Getting Started
First install shenzi in your virtual environment.
pip install shenzi
Initializing the workspace
If you have a project run using poetry, run
# only poetry package manager is supported
shenzi init
It will ask you some questions and generate shenzi_workspace.toml file. The TOML file looks like this.
# shenzi_workspace.toml
# all relative paths are relative to the directory containing this file
# you can add a list of binaries that your application calls
# something like calling aws cli. Shenzi would try to find all these in your path and add them to the distribution
binaries = ["tesseract"]
[packaging]
kind = "poetry"
config_file = "<relative-path-to-poetry.lock>"
# you can add the dependency groups you want in the distribution (dev, or other custom groups)
groups = ["main"]
[execution]
main = "<relative-path-to-main-python-script>"
Intercepting
You need to first configure shenzi to listen to all the imports that your python application makes. You can either do this by running your application in your development environment and testing it. Or running tests.
Running an application
In you main script, add the following lines
import os
if os.environ.get("SHENZI_INIT_DISCOVERY", "False") == "True":
from shenzi.discovery import shenzi_init_discovery
shenzi_init_discovery()
In pytest
If you are running tests in pytest, you can add this function in your root conftest.py
# root conftest.py
# this function is run by pytest in the beginning
def pytest_configure():
from shenzi.discover import shenzi_init_discovery
shenzi_init_discovery()
Run your application as you normally do/or run tests. shenzi will start intercepting all shared libraries that your code is importing.
You should run as much of your application code as possible, like running all the tests. This allows shenzi to detect every dependency linked to your application at runtime.
Once you stop the application, a file shenzi.json (called the manifest) will be dumped in the current directory. This file contains all the shared library loads that shenzi detected. It also contains some information about your virtual environment.
Now run the shenzi CLI with this manifest file
Building the application
From the directory containing shenzi_workspace.toml (your project's root directory), run this command:
RUST_LOG=INFO shenzi build ./shenzi.json
This can take a moment, after it is done, your application would be packaged in a dist folder.
You can ship this dist folder to any target machine and it should work out of the box. The only required dependency is bash.
Note: by default
shenziwould try to validate if some warnings are actually errors. It needs to scan the whole file system to do that, it would print a log like this:shenzi will now validate if any of your warnings are errors, this can take time (it will scan your whole file system). You can skip this by passing --skip-warning-checks. If you feel its taking too long, you can skip it by passing--skip-warning-checks. You should however, at least have one successful build with all warnings validated.
Run dist/bootstrap.sh to run your application.
# bootstrap.sh is the entrypoint for your application
# you can run this from any directory generally
bash dist/bootstrap.sh
Note that if you don't specify main file in your shenzi_workspace.toml, shenzi would try to dynamically query that file, this can be annoying if you are running tests, so setting the file in workspace config is useful.
Next steps
You should at least read the doc which describes the structure of shenzi.json here.
If you use this, feel free to raise an issue on any problem, I need feedback for this :)
How is this different?
I will add a small comparison to PyInstaller, which I feel is the most mature tool in the ecosystem.
From what I've seen, PyInstaller statically analyses your python code (and does some imports too) to create the smallest possible packaged application. It is smarter than shenzi.
shenziis much simpler. It tries to intercept all linker activity during runtime.- During packaging,
shenziwill faithfully analyze all dependencies in the same order as done by the linker. Following the linker might solve a class of edge cases (not proved though, for all I know, this algorithm might end up performing very poorly)
- During packaging,
- It also packages everything in your python path (all data+code in your site-packages).
- This makes
shenzifaster in some cases (where you have complex applications, as we do not do any static analysis), but slower in others (mainly if your virtual environment is huge, and not all dependencies are used by your application normally)
- This makes
Apart from that, there are some other internal differences that may or may not matter
- The structure of the final application (described here). It's slightly similar to how
pnpmorganizesnode_modulesas far as I'm aware. - The bootstrap script in
shenziis pretty a simple bash script, it simply sets up the correct Python environment variables and starts the interpreter. PyInstaller has a very sophisticated bootstrapping CLI written in C
Supported Platforms
Currently only Mac and Linux are supported.
The project is very new right now, I've tested it on Ubuntu 20.04 and MacOS Sequoia with Python 3.9
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file shenzi-0.0.3.tar.gz.
File metadata
- Download URL: shenzi-0.0.3.tar.gz
- Upload date:
- Size: 4.0 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.5.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a1fedec390d37d6af83fee37481291559ac1e9c5ab03f418cdd45a7bde980267
|
|
| MD5 |
db264a384bf8dbede9f039ed0816c327
|
|
| BLAKE2b-256 |
35a7c18dbc092cdaf1ee4c40cd7cc51c5e9fde96d3927e8f4f1d6d02cf0a2243
|
File details
Details for the file shenzi-0.0.3-py3-none-manylinux_2_28_x86_64.whl.
File metadata
- Download URL: shenzi-0.0.3-py3-none-manylinux_2_28_x86_64.whl
- Upload date:
- Size: 4.6 MB
- Tags: Python 3, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.5.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b42558ab46238f6e31a9875a615e7e5baefbc36b1e56143e8243a7f5732ebf5d
|
|
| MD5 |
19932f56c8e5d546a10cab75560a2b7e
|
|
| BLAKE2b-256 |
e8c4d620458ff5c675209b9bf4bcc50759c16100dc50073240966eebf52f57ec
|
File details
Details for the file shenzi-0.0.3-py3-none-macosx_10_9_universal2.whl.
File metadata
- Download URL: shenzi-0.0.3-py3-none-macosx_10_9_universal2.whl
- Upload date:
- Size: 4.0 MB
- Tags: Python 3, macOS 10.9+ universal2 (ARM64, x86-64)
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.5.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
13af92d4c0ed504c31a04fa5566409be0f3692321e8400dd4ba28bb4241b71b2
|
|
| MD5 |
a6b299d1eec2c7e80420d78e7a2836b2
|
|
| BLAKE2b-256 |
4bbfd05217dd43a87a0f615193efab888bd319a0782ae066fb79f13ded32e2bc
|