Redistricting ensembles
Project description
rdaensemble
Redistricting ensembles
Methods
This project supports several methods for generating ensembles of redistricting plans:
- Random maps from random spanning trees (RMfRST)
- Random maps from random starting points (RMfRSP)
- ReCom -- TODO
- Sequential Monte Carlo (SMC) -- TODO
Input Files
The inputs for generating & scoring ensembles are:
from rdascore import load_data, load_shapes, load_graph, load_metadata
data: Dict[str, Dict[str, int | str]] = load_data(data_path)
shapes: Dict[str, Any] = load_shapes(shapes_path)
graph: Dict[str, List[str]] = load_graph(graph_path)
metadata: Dict[str, Any] = load_metadata(state_code, data_path)
The precinct data, shapes, and graphs are all available in the companion repository
rdabase
in the data
directory by state.
They are named NC_2020_data.csv
, NC_2020_shapes_simplified.json
, and NC_2020_graph.json
,
for example.
Theoretically, these inputs can come from any source, but for simplicity, reproducibility, and apples-to-apples comparisons,
it's best to use the input files in rdabase
.
Output Files
Ensembles are saved as JSON files.
A file contains metadata about the ensemble, including the method used to generate it,
and then a plans
key with a list of plans:
plans: List[Dict[str, str | float | Dict[str, int | str]]]
Each plan item has a name
(str
), an optional weight
(float
), and a
plan
(Dict[str, int | str]]
) which represents the assignments as
geoid: district_id key: value pairs.
Scores for the plans in an ensemble are saved as a CSV file, with one row per plan and one column per metric. The metrics are the same as those produced by rdatools/rdascore, except they also include the energy of the plan. The metric names are descriptive.
There is also a companion JSON file with metadata about the scores.
Naming Conventions
You can name ensemble and score files anything you want. To facilitate understanding the contents of these files without having to open them, we recommend the following the convention:
- Ensemble example:
NC20C_RMfRST_1000_plans.json
- Scores example:
NC20C_RMfRST_1000_scores.csv
where "NC" is the state code, "20" stands for the 2020 census cycle, "C" abbreviates "Congress" (as opposed to state upper or lower house), "RMfRST" is the method, 1000 is the number of plans in the ensemble, and "plans" and "scores" distinguish between the two types of files.
Note: The scores metadata file will be named the same as the scores file,
except it will end _metadata.json
instead of .csv
,
for example, NC20C_RMfRST_1000_scores_metadata.json
.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file rdaensemble-1.0.3.tar.gz
.
File metadata
- Download URL: rdaensemble-1.0.3.tar.gz
- Upload date:
- Size: 5.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 111ddfe606340224bf48b7757c74509521694a142ad9dfbf0536b7c06385c333 |
|
MD5 | cbe90d027ab8725d9d94d41275dcb813 |
|
BLAKE2b-256 | 344eadeab32f7eede1844c477f0583ccb2b71c8291cd279d099ff79f2c75760c |
File details
Details for the file rdaensemble-1.0.3-py3-none-any.whl
.
File metadata
- Download URL: rdaensemble-1.0.3-py3-none-any.whl
- Upload date:
- Size: 5.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | eb3a1a6e40df2a1205d41fa8fe8e4fec006d391d0eab4dadfdaa345c1c52387c |
|
MD5 | 5d97b224cf60f6fe98108128a1d7c21f |
|
BLAKE2b-256 | bad88741014d472330f2bed1e089b12f3ba8898e02625ddb9f22f6d7f2961fcf |