A package for spectral-domain ghost imaging
Project description
Spooktroscopy: Spectral-Domain Ghost Imaging
Target Problem
Solve for $X$ from $(A \otimes G)X = B$ in the least-square manner, under regularizations. The full-index representation of the objective to minimize is $\sum_{iq}|\sum_{w,b}A_{iw}G_{bq}X_{wb} - B_{iq}|^2$ .
| Index | $i$ | $w$ | $q$ | $b$ |
|---|---|---|---|---|
| What does it number? | shot | photon energy bin | observed property | interested property |
| Example 0 | shot | photon energy bin | kinetic energy | kinetic energy |
| Example 1 | shot | photon energy bin | projected momentum | coefficient on a basis function |
G is an optional linear operator from a function of the interested property (discretized by index $b$) to a function of the observed property (discretized by index $q$), e.g. from a KE spectrum to a KE spectrum.
By default G=None, $G=I$ is the identity matrix. This is the most common use case of spooktroscopy, i.e. to solve $AX=B$ without any extra linear mapping.
When G accommodates a linear operation that's not identity mapping, the two linear inversions are solved in a single step.
With mode='raw', pass in matrix $G_{bq}$ to G, and with mode='contracted', pass in matrix $\sum_qG_{bq}G_{b'q}$. For example, for Abel transform in Velocity Map Imaging, pBasex offers this G with loadG.
Key Advantages
The key advantages of this package are
- Efficient optimization: Contraction over shots is decoupled from optimization. It is recommended to input precontracted results when instantiating, or to save the caches using method
save_prectrof a solver created with raw inputs. Once instantiated, the precontracted results are cached to be reused every time solving with a different hyperparameter. Seespook.contraction_utils.adaptive_contraction. - Support dimension reduction on the dependent variable B, through basis functions in G. In that case, it is also recommended to contract (B,G) over the dependent variable space (index q) prior to instantiating a spook solver.
- Support multiple combinations of regularizations. See Solvers .
- Support time-dependent measurement (Beta): when each entry in the raw input A is a pair of (photon spectrum, delay bin), index w is the flattened axis of ($\omega, \tau$). In this case, the third smoothness hyperparameter is for the delay axis.
At the very bottom level, this package depends on either OSQP to solve a quadratic programming or LAPACK gesv through numpy.linalg.solve .
Installation
The stable version is on PyPI. Unfortunately in a different name.
pip install FDGI
Solvers
Different combinations of regularizations can lead to different forms of objective function. Solvers in package always formalize the specific problem into either a Quadratic Programming or a linear equation. Examples can be found in unit tests
| Nonnegativity | Sparsity | Smoothness | Solver | Notes |
|---|---|---|---|---|
| True | L1 | Quadratic | SpookPosL1 |
This solver can serve tasks like in Li et al |
| True | L2 squared | Quadratic | SpookPosL2 |
|
| False | L2 squared | Quadratic | SpookLinSolve |
This solver is so far the work-horse for SpookVMI |
| False | L1 | Quadratic | SpookL1 |
A family tree of solvers is in docs/figs/famtree.svg.
Quadratic Programming
For cases where it can be formalized into a Quadratic Programming , OSQP is the weight-lifter. The root numerical method is the alternating direction method of multipliers (ADMM). Looking into the solver settings of OSQP is always encouraged, but the default settings usually work fine for spook . If one needs to pass in settings, use spk._prob.update_settings(**osqp_kwargs) on your solver instance spk.
Linear Equation
A rare case that it can be formalized into a linear equation is the third line in the table above: no nonnegativity constraint, and the sparsity is L2 norm squared. This is implemented in SpookLinSolve , which calls numpy.linalg.solve or scipy.sparse.linalg.spsolve .
Regularizations
Common regularizations are the following three types, all of which optional, depending on what a prior knowledge one wants to enforce on the problem solving.
- Nonnegativity: To constrain $X\geq 0$ everywhere
- Sparsity: To penalize on $|X|_1$ or $|X|_2$
- Smoothness: To penalize on roughness of X , along the two dimensions indexed by $w$ and $b$ independently. For the dimension indexed by $w$, the form is fixed to the laplacian-square $|(L\otimes I) X|^2_2$ where $L$ is the laplacian. Roughness along the second dimension of X is customizable with input parameter
Bsmoother, which by default is laplacian squared too.
Sparsity and Smoothness are enforced through penalties in the total obejctive function, and the penalties are weighted by hyperparameters lsparse and lsmooth. lsmooth is a 2-tuple that weight roughness penalty along the two axes of X respectively. The hyperparameters can be passed in during instantiation and also updated afterwards. It is recommended to call method getXopt with the hyperparameter(s) to be updated, because it will update, solve, and return the optimal X in one step. Calling solve with the hyperparameter(s) to be updated and then calling getXopt() without input is effectively the same, and the problem will be solved once as long as there is no update.
Evaluating Terms of Objective Function
| Term | Method | Notes |
|---|---|---|
| $|(A \otimes G)X-B|_2$ | residueL2 |
For full docstring, check help(spk.residueL2), where spk is a solver instance. |
| $|L X|_2$ | smoothness |
By default, $L$ is a 2nd-order finite difference operator along dimension specified by input argument dim. For full docstring, check help(spk.smoothness). |
| $|X|_1$ or $|X|_2$ | sparsity |
Depends on solver class. |
- Method
residueL2is based on functionutils.calcL2fromContracted. This function evaluates the residual term from the precontracted results. Also seeXValidation.calc_residualdefined inxval.py. - Function
utils.calcL2fromContractedis also used in the cross-validation classXValidationto evaluate the residual on validation sets. Seexval.pyfor more details.
Normalization Convention
The entries in $A^TA, G^TG$ are preferred to be on the order of unity, because regularization-related quadratic form matrices have their entries around unity. The scale factors are set as
$$s_a=\sqrt{\frac{1}{N_w}\mathrm{tr}(A^TA)},s_g=\sqrt{\frac{1}{N_q}\mathrm{tr}(G^TG)}$$
where $N_w, N_q$ are the dimensions along the respective indices. $s_as_g$ is an accessible property of the solver AGscale. To normalize or not is controlled by parameter normalize in creating a solver.
By default normalize=True, i.e. self._AtA =$A^TA/s_a^2$, self._GtG =$G^TG/s_g^2$, and self._Bcontracted = $(A^T \otimes G^T)B/(s_as_g)$, and the normalization is not in-place starting from version 0.9.4 . In this case, the direct solution self.res is scaled as $s_as_g X_\mathrm{opt}$ , but the getXopt method returns the unscaled result of $X_\mathrm{opt}$. In v0.9.3 and before, the normalization is in-place, which means the first two arguments are modified in-place. In-place normalization saves memory but causes confusion when people reuses the precontracted results for other purposes after passing them to create a solver in the contracted mode. Therefore starting from v0.9.4, in-place normalization is only done when requested with normalize='inplace' in creating a solver.
With this normalization convention, the hyperparmeters whose associated regularization function is not quadratic in $X$ need scaling when changing the size of dataset. If $h(s X) = s^p h(X)$, then the scaling power of $h(X)$ is $p$, e.g. $p=2$ for quadratic terms. When duplicating the same dataset to $m$ times its original size, in order to keep the optimal point unchanged, we need to set $\lambda\mapsto m^{1-p/2} \lambda$. Therefore for $p=1$ regularization terms, the scaling is $\sqrt{m}$, and for $p=2$ terms they do not scale.
Citing
Please cite Wang et al 2023 when using this package in your publishable work. If SpookPosL1, SpookPosL2 or SpookL1 is used, we also strongly recommend to cite the original OSQP paper as suggested here.
Unit Tests
unittest/testPosL1.pyis a good example to play withSpookPosL1.unittest/testL1L2.ipynbincludes good examples to play withSpookPosL1,SpookPosL2, andSpookL1.unittest/test_resid_reg_eval.pyshowcases how to evaluate the residual and regularization terms of the objective function.
Dependencies
pip installation should manage the dependencies automatically. If not, check requirements.txt .
Acknowledgement
This work was supported by the U.S. Department of Energy (DOE), Office of Science, Office of Basic Energy Sciences (BES), Chemical Sciences, Geosciences, and Biosciences Division (CSGB).
Documentation
Full documentation is here. Alternatively, clone this repo, switch to gh-pages, and open docs/index.html in a browser.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file fdgi-1.0.3.tar.gz.
File metadata
- Download URL: fdgi-1.0.3.tar.gz
- Upload date:
- Size: 20.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
820a93f265fd774613fe90d74725f7f65d9afb4704b22b7bcc71f84e676c159b
|
|
| MD5 |
2f1a4d917e8cbcbefcecb4844b4baf75
|
|
| BLAKE2b-256 |
f0f228bd060baa159970addf05bf261e52480ceca8e4130335ec5d57684932f2
|
File details
Details for the file fdgi-1.0.3-py3-none-any.whl.
File metadata
- Download URL: fdgi-1.0.3-py3-none-any.whl
- Upload date:
- Size: 23.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
66273e197c05613c1afe9c08ea37a9610f21ac897e71e44af698fa3305f6be62
|
|
| MD5 |
879faf0933114128d60de677024cffda
|
|
| BLAKE2b-256 |
8ba6d86a06b871477e1ff96ac75f7c8a9df606bf096505b9773cae4d0a3e3e70
|