PIE-Net: Probabilistic Intensity-Event Modeling for High Quality Event-Based Video Generation
Project description
PIE-Net
Probabilistic Intensity-Event Modeling for High-Quality Event-Based Video Generation
Turn asynchronous event camera streams into high-quality grayscale video in real time — with per-pixel uncertainty maps and a principled probabilistic formulation grounded in event camera physics.
Two models. One pip install. Ready for research and deployment.
Highlights
| Feature | Description |
|---|---|
| Probabilistic reconstruction | PIEM maps polarity events to intensity via a closed-form lognormal model |
| Uncertainty-aware | Every pixel gets a confidence map — useful for downstream robotics & vision |
| Real-time capable | 30+ FPS on modern GPUs; Lite variant for edge devices |
| Tiny footprint | 154K params (full) / 79K params (lite) — orders of magnitude smaller than competitors |
| Plug & play | Pretrained weights ship with the package — no manual download |
| Benchmark-ready | EVREAL configs included for ECD, MVSEC, and HQF |
| PIEM representation export | Full 5-channel learned representation for downstream tasks (following the idea of EvRepSL) |
Model Zoo
Two pretrained variants are included:
| PIE-Net | PIE-Net-Lite | |
|---|---|---|
| Encoder depth | 3 layers | 2 layers |
| Parameters | 154K | 79K |
| FLOPs @ 240×180 | 1.59G | 1.58G |
| Best for | Highest quality | Speed & edge deployment |
Benchmark performance (EVREAL eval)
Metrics from the shipped checkpoints on standard benchmarks:
PIE-Net
| Dataset | MSE ↓ | SSIM ↑ | LPIPS ↓ |
|---|---|---|---|
| IJRR (ECD) | 0.0257 | 0.6122 | 0.1957 |
| MVSEC | 0.0484 | 0.3798 | 0.4356 |
| HQF | 0.0204 | 0.6302 | 0.2248 |
PIE-Net-Lite
| Dataset | MSE ↓ | SSIM ↑ | LPIPS ↓ |
|---|---|---|---|
| IJRR (ECD) | 0.0221 | 0.6197 | 0.2079 |
| MVSEC | 0.0428 | 0.3889 | 0.4418 |
| HQF | 0.0267 | 0.5993 | 0.2494 |
PIE-Net leads on perceptual quality (LPIPS) and HQF. PIE-Net-Lite wins on IJRR MSE/SSIM with half the parameters — ideal when latency matters.
Installation
From PyPI (recommended)
pip install event-pienet
With optional dependencies
# Real-time event camera demo (DVS / DAVIS)
pip install event-pienet[realtime]
# Benchmark evaluation helpers
pip install event-pienet[eval]
# Everything
pip install event-pienet[all]
From source
git clone https://github.com/VincentQQu/pie-net.git
cd pie-net
pip install -e .
CUDA PyTorch
Install PyTorch with CUDA support first if you have a GPU:
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu121
pip install event-pienet
Quick Start
Python API
import torch
from pie_net import load_model, load_model_lite, stack_piem_representation
# PIE-Net (full model — default)
model = load_model(pretrained=True, device="cuda")
model.eval()
# PIE-Net-Lite (faster, smaller)
lite = load_model_lite(pretrained=True, device="cuda")
# or: lite = load_model(variant="pie-net-lite", device="cuda")
events = torch.randn(1, 5, 180, 240).cuda() # [B, bins, H, W]
with torch.no_grad():
output = model(events)
# Reconstruction (backward-compatible keys)
frame = output["image"] # alias for mean_f1 — [1, 1, H, W]
uncertainty = output["var"] # alias for var_f1 — [1, 1, H, W]
# Full PIEM latent maps (for representation / downstream tasks)
mean_z = output["mean_exp_z"] # expected log-intensity change
var_z = output["var_exp_z"] # uncertainty of Z
k = output["k"] # PIEM scaling parameter
mean_f1 = output["mean_f1"] # reconstructed intensity
var_f1 = output["var_f1"] # frame uncertainty
# Stack into a 5-channel PIEM representation [1, 5, H, W]
piem_rep = stack_piem_representation(output)
model.reset_states() # call between sequences
Output keys (since v1.1.2):
| Key | Shape | Description |
|---|---|---|
mean_exp_z |
[B, 1, H, W] |
Expected log-intensity change (Z mean) |
var_exp_z |
[B, 1, H, W] |
Uncertainty of Z |
k |
[B, 1, H, W] |
Learned PIEM scaling parameter |
mean_f1 / image |
[B, 1, H, W] |
Reconstructed intensity frame |
var_f1 / var |
[B, 1, H, W] |
Per-pixel frame uncertainty |
Use stack_piem_representation(output) to combine the five PIEM maps into a single [B, 5, H, W] tensor — compatible with EvRepSL representation pipelines.
Real-time demo (event camera)
Connect a DVS/DAVIS camera and run:
# PIE-Net (default, best quality)
pie-net-demo
# PIE-Net-Lite (faster)
pie-net-demo --variant pie-net-lite
# Options
pie-net-demo --variant pie-net --no-visualize-voxel --use-amp --frame-interval 16
Or via the script:
python -m pie_net.demo --variant pie-net-lite
Press q to quit.
Method Overview
Event cameras do not capture full frames at fixed intervals. They asynchronously report pixel-level brightness changes as events (x, y, polarity, t). Given a previous intensity frame and the events that follow, PIE-Net reconstructs the next frame using Probabilistic Intensity-Event Mapping (PIEM).
Previous frame + Event stream → PIE-Net → Reconstructed next frame + uncertainty
Core idea
Events describe intensity changes: positive events mean brightness increased, negative events mean it decreased. By accumulating polarity-weighted events over time, we estimate how much each pixel's intensity has changed.
Real event data is noisy — thresholds vary across pixels and some events are unreliable. PIEM therefore models intensity change probabilistically, estimating both the reconstructed image and a per-pixel uncertainty map.
Probabilistic Intensity-Event Mapping (PIEM)
PIEM links events to frame reconstruction in three steps:
- Accumulate events — count positive and negative events per pixel to estimate log-intensity change
- Model uncertainty — treat event counts and thresholds as uncertain, yielding a latent change variable
Zwith meanμZand varianceσZ² - Reconstruct the next frame — apply the probabilistic intensity change to the previous (or refined) frame:
next frame ≈ previous frame × event-based intensity change
PIE-Net architecture
PIE-Net estimates the probabilistic variables required by PIEM. It has two main parts:
Probabilistic Event Priors Estimator (PEPE) — a dual-branch encoder that takes a voxel-grid event tensor and the previous intensity frame, fuses motion/change and appearance features, and outputs μZ, σZ², and a refined previous-frame representation.
Probabilistic Intensity-Event Mapper — applies PIEM to map μZ, σZ², and the refined frame to the final reconstruction.
Event Voxel Grid [B, 5, H, W] + Previous Frame [B, 1, H, W]
↓
Dual Stem (Event + Intensity) → Recurrent Encoder + MCSE
↓
Decoder + UGSG (uncertainty-guided skip gating)
↓
PIEM Head → Mean Intensity [B, 1, H, W] + Variance [B, 1, H, W]
Key components:
- MCSE — Modality-Conditioned Shared Encoder adapts to event vs. frame reliability
- UGSG — Uncertainty-Guided Skip Gating routes features by predicted confidence
- PUAR loss — Probabilistic Uncertainty-Aware Reconstruction penalizes confident wrong predictions more strongly than uncertain ones
Pipeline summary
1. Encode asynchronous events as voxel grids
2. Combine event data with the previous intensity frame
3. Estimate probabilistic intensity-change priors (PEPE)
4. Reconstruct the next frame via PIEM
5. Train with an uncertainty-aware reconstruction loss (PUAR)
Evaluation on Benchmarks
We recommend EVREAL for standardized evaluation.
git clone https://github.com/ercanburak/EVREAL.git && cd EVREAL
pip install event-pienet
cp /path/to/pie-net/config/method/PIENet.json config/method/
cp /path/to/pie-net/config/method/PIENetLite.json config/method/
cp /path/to/pie-net/pie_net/evreal_wrapper.py model/PIENet.py
# Evaluate both variants
python eval.py -m PIENet -c std -d ECD MVSEC HQF -qm mse ssim lpips
python eval.py -m PIENetLite -c std -d ECD MVSEC HQF -qm mse ssim lpips
Project Structure
pie-net/
├── pie_net/
│ ├── model.py # Architecture + load_model()
│ ├── demo.py # Real-time camera demo (CLI entry point)
│ ├── evreal_wrapper.py # EVREAL integration
│ └── pretrained/
│ ├── model.pth # PIE-Net (full)
│ └── model_lite.pth # PIE-Net-Lite
├── config/method/ # EVREAL method configs
├── examples/ # Usage examples
├── scripts/ # Legacy script aliases
├── pyproject.toml
└── README.md
Citation
PIE-Net is the next generation of E2HQV. If you use PIE-Net in your research, please cite:
@inproceedings{qu2024e2hqv,
title={E2HQV: High-Quality Video Generation from Event Camera via Theory-Inspired Model-Aided Deep Learning},
author={Qu, Qiang and Shen, Yiran and Chen, Xiaoming and Chung, Yuk Ying and Liu, Tongliang},
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
volume={38},
number={5},
pages={4632--4640},
year={2024}
}
@article{qu2024evrepsl,
title={EvRepSL: Event-Stream Representation via Self-Supervised Learning for Event-Based Vision},
author={Qu, Qiang and Chen, Xiaoming and Chung, Yuk Ying and Shen, Yiran},
journal={IEEE Transactions on Image Processing},
year={2024},
publisher={IEEE}
}
Acknowledgments
- EvRepSL — event representation toolkit (PIEM export)
- EVREAL — evaluation framework
- dv-processing — event camera I/O
License
MIT License — see LICENSE for details.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file event_pienet-1.1.3.tar.gz.
File metadata
- Download URL: event_pienet-1.1.3.tar.gz
- Upload date:
- Size: 964.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.8.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bdf3a1ad688086bf80123dc33006435f55fa8056d0dcf942b807816d6e9139fb
|
|
| MD5 |
74750c22e8f8ffe21ae700556848009d
|
|
| BLAKE2b-256 |
0b3683f5c7b9b744958ca12e480ac7aff1a0c55c04232e31c91ea542f5b46406
|
File details
Details for the file event_pienet-1.1.3-py3-none-any.whl.
File metadata
- Download URL: event_pienet-1.1.3-py3-none-any.whl
- Upload date:
- Size: 961.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.8.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
833e384708f2eb01b29ecf712c32ba49e0de69235ca2d5ead85e836cbfbf94e3
|
|
| MD5 |
3542cd26c5d63de894d193fc3899dd69
|
|
| BLAKE2b-256 |
0479e5122e1258c85df798517f6f7d3f2590d9a5b3eb3961f390e2230ef15988
|