A Bayesian Model of the Diffuse Neutral Interstellar Medium
Project description
caribou_hi
A Bayesian Model of the Diffuse Neutral Interstellar Medium
caribou_hi is a Bayesian model of the diffuse neutral interstellar medium written in the bayes_spec spectral line modeling framework, which enables inference from observations of neutral hydrogen (HI) 21-cm emission and absorption spectra.
Read below to get started, and check out the tutorials and guides here: https://caribou-hi.readthedocs.io.
- Installation
- Notes on Physics & Radiative Transfer
- Models
- Syntax & Examples
- Issues and Contributing
- License and Copyright
Installation
Basic Installation
Install with pip in a conda virtual environment:
conda create --name caribou_hi -c conda-forge pymc cxx-compiler pip
conda activate caribou_hi
# Due to a bug in arviz, this fork is temporarily necessary
# See: https://github.com/arviz-devs/arviz/issues/2437
pip install git+https://github.com/tvwenger/arviz.git@plot_pair_reference_labels
pip install caribou_hi
Development Installation
Alternatively, download and unpack the latest release, or fork the repository and contribute to the development of caribou_hi!
Install in a conda virtual environment:
cd /path/to/caribou_hi
conda env create -f environment.yml
conda activate caribou_hi-dev
pip install -e .
Notes on Physics & Radiative Transfer
All models in caribou_hi apply the same physics and equations of radiative transfer.
The 21-cm excitation temperature (also called the spin temperature) is derived from the gas kinetic temperature, gas density, and Lyα photon density following Kim et al. (2014) equation 4.
Clouds are assumed to be homogenous and isothermal. The ratio of the column density to the volume density, both free parameters, thus determines the path length through the cloud. The non-thermal line broadening assumes a Larson law relationship.
The optical depth and radiative transfer prescriptions follow that of Marchal et al. (2019). By default, the clouds are ordered from nearest to farthest, so optical depth effects (i.e., self-absorption) may be present.
Notably, since these are forward models, we do not make assumptions regarding the optical depth. These effects are predicted by the model. There is one exception: the ordered argument, described below.
Models
The models provided by caribou_hi are implemented in the bayes_spec framework. bayes_spec assumes that the source of spectral line emission can be decomposed into a series of "clouds", each of which is defined by a set of model parameters. Here we define the models available in caribou_hi.
AbsorptionModel
Note that this model struggles to fit absorption-only data. We recommend sticking to EmissionModel and EmissionAbsorptionModel, so hopefully you have some emission data too!
AbsorptionModel is a model that predicts 21-cm absorption spectra. The SpecData key for this model must be absorption. The following diagram demonstrates the relationship between the free parameters (empty ellipses), deterministic quantities (rectangles), model predictions (filled ellipses), and observations (filled, round rectangles). Many of the parameters are internally normalized (and thus have names like _norm). The subsequent tables describe the model parameters in more detail.
Cloud Parametervariable |
Parameter | Units | Prior, where ($p_0, p_1, \dots$) = prior_{variable} |
Defaultprior_{variable} |
|---|---|---|---|---|
log10_NHI |
log10 HI column density | cm-2 |
$\log_{10}N_{\rm HI} \sim {\rm Normal}(\mu=p_0, \sigma=p_1)$ | [20.0, 1.0] |
log10_depth |
log10 line-of-sight depth | pc |
$\log_{10}d \sim {\rm Normal}(\mu=p_0, \sigma=p_1)$ | [1.0, 1.0] |
log10_pressure |
log10 pressure | K cm-3 |
$\log_{10}P_{\rm th} \sim {\rm Normal}(\mu=p_0, \sigma=p_1)$ | [3.0, 1.0] |
velocity |
Velocity (same reference frame as data) | km s-1 |
$V \sim {\rm Normal}(\mu=p_0, \sigma=p_1)$ | [0.0, 10.0] |
log10_nth_fwhm_1pc |
Non-thermal FWHM at 1 pc | km s-1 |
$\log_{10}\Delta V_{\rm 1 pc} \sim {\rm Normal}(\mu=p_0, \sigma=p_1)$ | [0.2, 0.1] |
depth_nth_fwhm_power |
Nonthermal size-linewidth power law index | unitless | $\alpha \sim {\rm Gamma}(\mu=p_0, \sigma=p_1)$ | [0.3, 0.1] |
Hyper Parametervariable |
Parameter | Units | Prior, where ($p_0, p_1, \dots$) = prior_{variable} |
Defaultprior_{variable} |
|---|---|---|---|---|
log10_n_alpha |
log10 Lyα photon density | cm-3 |
$\log_{10}n_\alpha \sim {\rm Normal}(\mu=p_0, \sigma=p_1)$ | [-6.0, 1.0] |
fwhm_L |
Lorentzian FWHM line width | km s-1 |
$\Delta V_{L} \sim {\rm HalfNormal}(\sigma=p)$ | None |
The parameters describing the non-thermal line broadening, log10_nth_fwhm_1pc and depth_nth_fwhm_power, are treated as cloud parameters by default and with hyper_depth_linewidth = False. With hyper_depth_linewidth = True, however, these parameters are treated as hyper parameters (and thus shared between all clouds). The later imposes a bias by assuming that all clouds share the same size-linewidth relationship.
EmissionModel
EmissionModel is similar to AbsorptionModel, except it predicts 21-cm emission brightness temperature spectra. The SpecData key for this model must be emission. EmissionModel takes an additional initialization argument, bg_temp, which is the assumed background brightness temperature (by default it is bg_temp=3.77, an estimate for the cosmic microwave background and Galactic synchrotron emission at 21-cm). The following diagram demonstrates the model.
EmissionAbsorptionModel
EmissionAbsorptionModel predicts both 21-cm emission (brightness temperature) and absorption (1-exp(-tau)) spectra. The SpecData keys must be emission and absorption. EmissionAbsorptionModel takes an additional initialization argument, bg_temp, which is the assumed background brightness temperature (by default it is bg_temp=3.77, an estimate for the cosmic microwave background and Galactic synchrotron emission at 21-cm). The following diagram demonstrates the model, and the subsequent table describe the additional model parameters.
Note that the filling factor, $f$, is unconstrained, so the reported column densities are really $N_{\rm HI}/f$.
Cloud Parametervariable |
Parameter | Units | Prior, where ($p_0, p_1, \dots$) = prior_{variable} |
Defaultprior_{variable} |
|---|---|---|---|---|
filling_factor |
Filling Factor | `` | $f \sim {\rm Uniform}(0, 1)$ | `` |
absorption_weight |
Absorption weight | `` | $w_\tau \sim {\rm Beta}(\alpha=0, \beta=1-f)$ | `` |
The filling_factor parameter accounts for beam dilution in the emission spectrum. The expected brightness temperature contribution from a cloud is multiplied by filling_factor, which takes values between zero and one.
The absorption_weight parameter accounts for the probability that a cloud that appears in the emission beam is not traced by the absorption sightline. The absorption optical depth of a cloud is multiplied by absorption_weight, which takes values between zero and one. The prior is chosen to require absorption_weight=1 when filling_factor=1.
ordered
An additional parameter to set_priors for these models is ordered. By default, this parameter is False, in which case the order of the clouds is from nearest to farthest. Sampling from these models can be challenging due to the labeling degeneracy: if the order of clouds does not matter (i.e., the emission is optically thin), then each Markov chain could decide on a different, equally-valid order of clouds.
If we assume that the emission is optically thin, then we can set ordered=True, in which case the order of clouds is restricted to be increasing with velocity. This assumption can drastically improve sampling efficiency. When ordered=True, the velocity prior is defined differently:
Cloud Parametervariable |
Parameter | Units | Prior, where ($p_0, p_1, \dots$) = prior_{variable} |
Defaultprior_{variable} |
|---|---|---|---|---|
velocity |
Velocity | km s-1 |
$V_i \sim p_0 + \sum_0^{i-1} V_i + {\rm Gamma}(\alpha=2, \beta=1.0/p_1)$ | [0.0, 1.0] |
fwhm_L
The velocity of a cloud can be challenging to identify when spectral lines are narrow and widely separated. We overcome this limitation by modeling the line profiles as a "pseudo-Voight" profile, which is a linear combination of a Gaussian and Lorentzian profile. The parameter fwhm_L is a latent hyper-parameter (shared among all clouds) that characterizes the width of the Lorentzian part of the line profile. When fwhm_L is zero, the line is perfectly Gaussian. This parameter produces line profile wings that may not be physical but nonetheless enable the optimization algorithms (i.e, MCMC) to converge more reliably and efficiently. Model solutions with non-zero fwhm_L should be scrutinized carefully. This feature can be turned off by supplying None (default) to prior_fwhm_L, in which case the line profiles are assumed Gaussian.
It can also be useful to give Variational Inference a head-start by initializing it evenly spread over the velocity prior. Supply start = {"velocity_norm", np.linspace(-3.0, 3.0, model.n_clouds)} to model.fit() or in init_kwargs to model.sample(). Check out the bayes_spec tutorial for an example.
Syntax & Examples
See the various tutorial notebooks under docs/source/notebooks. Tutorials and the full API are available here: https://caribou-hi.readthedocs.io.
Issues and Contributing
Anyone is welcome to submit issues or contribute to the development of this software via Github.
License and Copyright
Copyright (c) 2024 Trey Wenger
GNU General Public License v3 (GNU GPLv3)
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file caribou_hi-2.0.0.tar.gz.
File metadata
- Download URL: caribou_hi-2.0.0.tar.gz
- Upload date:
- Size: 38.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d37573a7cfc9c005213914480fb72e74bc0acb9bf67bf62a165b4ad3f7d4039a
|
|
| MD5 |
a0a4b5d68fc79b110f344f4f48d8be3f
|
|
| BLAKE2b-256 |
492f15152d1a8b36a02ddfddd5506472d42c00294d354375e0de6c7b52c5ec4a
|
Provenance
The following attestation bundles were made for caribou_hi-2.0.0.tar.gz:
Publisher:
publish.yml on tvwenger/caribou_hi
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
caribou_hi-2.0.0.tar.gz -
Subject digest:
d37573a7cfc9c005213914480fb72e74bc0acb9bf67bf62a165b4ad3f7d4039a - Sigstore transparency entry: 194949579
- Sigstore integration time:
-
Permalink:
tvwenger/caribou_hi@80b423eb8bdac03009774d3e10f6b0f59cbb03ba -
Branch / Tag:
refs/tags/v2.0.0 - Owner: https://github.com/tvwenger
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@80b423eb8bdac03009774d3e10f6b0f59cbb03ba -
Trigger Event:
release
-
Statement type:
File details
Details for the file caribou_hi-2.0.0-py3-none-any.whl.
File metadata
- Download URL: caribou_hi-2.0.0-py3-none-any.whl
- Upload date:
- Size: 16.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
72bec428dc9b4a02808f05f29b48265ead91bf5367e0da09c97e5016fae16259
|
|
| MD5 |
2f8386079033aa10296f2d769697ba96
|
|
| BLAKE2b-256 |
b0ee76adb704faa4688df736656d5bf7ec64792ca38b72aa85c2c63f03865b29
|
Provenance
The following attestation bundles were made for caribou_hi-2.0.0-py3-none-any.whl:
Publisher:
publish.yml on tvwenger/caribou_hi
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
caribou_hi-2.0.0-py3-none-any.whl -
Subject digest:
72bec428dc9b4a02808f05f29b48265ead91bf5367e0da09c97e5016fae16259 - Sigstore transparency entry: 194949585
- Sigstore integration time:
-
Permalink:
tvwenger/caribou_hi@80b423eb8bdac03009774d3e10f6b0f59cbb03ba -
Branch / Tag:
refs/tags/v2.0.0 - Owner: https://github.com/tvwenger
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@80b423eb8bdac03009774d3e10f6b0f59cbb03ba -
Trigger Event:
release
-
Statement type: