Tools to create matrices from data packages

These details have not been verified by PyPI

Project links

Project description

matrix_utils

Library for building matrices from data packages from bw_processing. Designed for use with the Brightway life cycle assessment framework.

Background
Install
Usage
API
Contributing
Maintainers
License

Background

The calculation library of the Brightway LCA framework has traditionally include matrix-building functionality. As the new capabilities in bw_processing have increased matrix-building complexity, this library is a refactoring to split matrix utilities from the LCA classes, which will remain in the calculation library.

matrix_utils supports all the features made available in bw_processing: static and dynamic resources, data package policies, vector and array resources. It also improves on the previous matrix building code by speeding up the mapping from data source ids to row and column ids.

Backwards compatibility

This library presents a completely different API than the functions previously present in bw2calc. Most ideas become easier, or even possible; however, some things are more complicated. In particular, the notion that we have a single array that defines a matrix is no longer true - a matrix can be defined by many input arrays, and they can interact with each other (either adding to existing matrix values or replacing them altogether).

Install

Install using pip or conda (channel cmutel).

Depends on numpy, scipy, pandas, bw_processing, stats_arrays.

Usage

`MappedMatrix` class

The primary use case for matrix_utils is the MappedMatrix class:

In [1]: from matrix_utils import MappedMatrix

In [2]: mm = MappedMatrix(packages=[some_datapackage], matrix="foo")

In [3]: mm.matrix
Out[3]:
<8x8 sparse matrix of type '<class 'numpy.float32'>'
    with 11 stored elements in Compressed Sparse Row format>

MappedMatrix takes the following arguments. Note that all arguments must be keyword arguments:

packages: list, required. List of bw_processing data packages. The packages must be instantiated, you can't give file locations of PyFilesystem2 file systems.
matrix: str, required. Label of matrix to build. Used to filter data in packages, so must be identical to the matrix value in the package(s).
use_vectors: bool, default True. Include vector data resources when building matrices.
use_arrays: bool, default True. Include array data resources when building matrices. Note that each data package resource group can only provide either vector or array data.
use_distributions: bool, default False. Include probability distribution data resources when building matrices.
row_mapper: matrix_utils.ArrayMapper, default None. Optional mapping class used to translate data source ids to matrix row indices. In LCA, one would reuse this mapping class to make sure the dimensions of multiple matrices align.
col_mapper: matrix_utils.ArrayMapper, default None. Optional mapping class used to translate data source ids to matrix column indices. In LCA, one would reuse this mapping class to make sure the dimensions of multiple matrices align.
seed_override: int, default None. Override the random seed given in the data package. Note that this is ignored if the data package is combinatorial.

MappedMatrix is iterable; calling next() will draw new samples from all included stochastic resources, and rebuild the matrix.

Indexers

Each datapackage is assigned exactly one indexer, shared across all its resource groups. This indexer determines which column of an array resource is used on each iteration. The design assumption is one indexer per datapackage — MappedMatrix.indexers returns a {name: indexer} dict at that level. MappedMatrix.local_indexers returns a {group_label: indexer} dict for what each resource group is actually using, which will normally be the same object (or a Proxy wrapping it for combinatorial packages). You may replace group.indexer on any individual group after construction, but you are then responsible for the consequences — in particular, next() only advances the package-level indexers, so any custom group-level indexer must be advanced manually.

NaN as a sentinel value

A NaN value in a data vector or array is treated as "no data insertion" — that element is skipped when the matrix is built or rebuilt, leaving the matrix cell at its current value. This is useful for scenario or override packages: place NaN at positions you want to inherit from an earlier package, and a real number where you want to override. Non-NaN elements behave normally. ResourceGroup.data_current still carries the raw NaN so input_data_vector() reflects the actual datapackage contents.

You may also find it useful to iterate through MappedMatrix.groups, which are instances of ResourceGroup, documented below.

`ResourceGroup` class

A bw_processing data package is essentially a metadata file and a bag of data resources. These resources are grouped, for multiple resources are needed to build one matrix, or one component of one matrix. For example, one needs not only the data vector, but also the row and column indices to build a simple matrix. One could also have a flip vector (to negate signs before insertion) or a scale vector (to apply a per-exchange multiplicative factor before insertion, e.g. for allocation or unit conversion).

The ResourceGroup class provides a single interface to these data files and their metadata. ResourceGroup instances are created automatically by MappedMatrix, and available via MappedMatrix.groups. The source code is pretty readable, and in general you probably don't need to worry about this low-level class, but the following could be useful:

ResourceGroup.data_original: The data as it is present in the datapackage, before masking (i.e. the Numpy data vector or array, or the data interface). This is the raw input data, duplicate elements are not aggregated (if applicable).
ResourceGroup.data_current: The data sample used (before aggregation) to build the matrix. It is masked, flipped (if a flip array is present), and scaled (if a scale array is present).
ResourceGroup.scale: The scale array with all masks applied. Raises KeyError if no scale array is present.
ResourceGroup.row|col_mapped: Mapped row and column indices. Has the same length as the datapackage resource, but uses -1 for values which weren't mapped.
ResourceGroup.row|col_masked: The data after the custom filter and mapping mask have been applied.
ResourceGroup.row|col_matrix: Row and column indices (but not data) for insertion into the matrix. These indices are after aggregation within the resource group (if any).
ResourceGroup.calculate(vector=None): Function to recalculate matrix row, column, and data vectors. Uses the current state of the indexers, but re-draws values from data iterators. If vector is given, use this instead of the given data source.
ResourceGroup.indexer: The instance of the Indexer class applicable for this ResourceGroup. Only used for data arrays. By design each datapackage has one indexer shared across all its resource groups; this is set up automatically by MappedMatrix. You may assign a different indexer to individual groups after construction, but you are then responsible for keeping them consistent (e.g. ensuring they advance together when next() is called).
ResourceGroup.ncols: The integer number of columns in a data array. Returns None if a data vector is present.

Contributing

Your contribution is welcome! Please follow the pull request workflow, even for minor changes.

When contributing to this repository with a major change, please first discuss the change you wish to make via issue, email, or any other method with the owners of this repository.

Please note we have a code of conduct, please follow it in all your interactions with the project.

Documentation and coding standards

Maintainers

Chris Mutel

License

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.9

Jul 12, 2026

0.8

Jun 4, 2026

0.7.1

Mar 20, 2026

0.7.0

Feb 27, 2026

0.6.3

Feb 23, 2026

0.6.2

Aug 6, 2025

0.6

Nov 1, 2024

0.5

Aug 20, 2024

0.4.3

Jun 4, 2024

0.4.2

Jun 4, 2024

0.4.1

Dec 18, 2023

0.4

Dec 7, 2023

0.3

Oct 25, 2023

0.2.5

May 21, 2022

0.2.4

May 15, 2022

0.2.3

Feb 2, 2022

0.2.2

Nov 26, 2021

0.2.1

Nov 26, 2021

0.2

Nov 25, 2021

0.1.4

Oct 20, 2021

0.1.3

Jun 16, 2021

0.1.2

Jun 15, 2021

0.1.1

Jun 2, 2021

0.1

May 19, 2021

0.1.dev0 pre-release

Feb 3, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

matrix_utils-0.9.tar.gz (1.0 MB view details)

Uploaded Jul 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

matrix_utils-0.9-py3-none-any.whl (23.7 kB view details)

Uploaded Jul 12, 2026 Python 3

File details

Details for the file matrix_utils-0.9.tar.gz.

File metadata

Download URL: matrix_utils-0.9.tar.gz
Upload date: Jul 12, 2026
Size: 1.0 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for matrix_utils-0.9.tar.gz
Algorithm	Hash digest
SHA256	`a785b5c113a371e64c2fdc6ab7b719ac80dd4ec5388e74ed3eaebd941d27d28d`
MD5	`c42fd8004a8ba545c7da499df5493cfd`
BLAKE2b-256	`892fb6dd64767e7f0c276e05a1898664caee969aabc31792d62738dad06fed96`

See more details on using hashes here.

File details

Details for the file matrix_utils-0.9-py3-none-any.whl.

File metadata

Download URL: matrix_utils-0.9-py3-none-any.whl
Upload date: Jul 12, 2026
Size: 23.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for matrix_utils-0.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3a41d86675766dd67744128564e066b6c8161edd5ceff51e707064e5a975fe3d`
MD5	`9b030279396b5ccaef488580f521d76e`
BLAKE2b-256	`a892af9471c3494183123eea85e6518d4b717eaa3bf80f17d3771b479962628b`

See more details on using hashes here.

matrix-utils 0.9

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

matrix_utils

Table of Contents

Background

Backwards compatibility

Install

Usage

`MappedMatrix` class

Indexers

NaN as a sentinel value

`ResourceGroup` class

Contributing

Documentation and coding standards

Maintainers

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

matrix-utils 0.9

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

matrix_utils

Table of Contents

Background

Backwards compatibility

Install

Usage

MappedMatrix class

Indexers

NaN as a sentinel value

ResourceGroup class

Contributing

Documentation and coding standards

Maintainers

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`MappedMatrix` class

`ResourceGroup` class