Redistricting ensembles

These details have not been verified by PyPI

Project links

Project description

rdaensemble

Redistricting ensembles

Methods

The code in this repository supports several methods for generating ensembles of redistricting plans (maps):

Random maps from random spanning trees (RMfRST)
Random maps from random starting points (RMfRSP)
Ensemble of maps using MCMC/ReCom (ReCom)
Ensemble of maps using Sequential Monte Carlo (SMC) <<< TODO

Input Files

The inputs for generating & scoring ensembles are:

from rdascore import load_data, load_shapes, load_graph, load_metadata

data: Dict[str, Dict[str, int | str]] = load_data(data_path)
shapes: Dict[str, Any] = load_shapes(shapes_path)
graph: Dict[str, List[str]] = load_graph(graph_path)
metadata: Dict[str, Any] = load_metadata(state_code, data_path)

The precinct data, shapes, and graphs are all available in the companion repository rdatools/rdabase in the data directory by state. They are named NC_2020_data.csv, NC_2020_shapes_simplified.json, and NC_2020_graph.json, for example.

Theoretically, these inputs can come from any source, but for simplicity, reproducibility, and apples-to-apples comparisons, it's best to use the input files in rdabase.

Output Files

Ensembles are saved as JSON files. A file contains metadata about the ensemble, including the method used to generate it, and then a plans key with a list of plans:

plans: List[Dict[str, str | float | Dict[str, int | str]]]

Each plan item has a name (str), an optional weight (float), and a plan (Dict[str, int | str]]) which represents the assignments as geoid: district_id key: value pairs.

Scores for the plans in an ensemble are saved as a CSV file, with one row per plan and one column per metric. The metrics are the same as those produced by rdatools/rdascore, except they also include the energy of the plan. The metric names are descriptive.

When a scores CSV file is produced, a companion JSON file with metadata about the scoring is also generated.

Naming Conventions

You can name ensemble and score files anything you want. To facilitate understanding the contents of these files without having to open them, we recommend the following the convention:

Ensemble example: NC20C_RMfRST_1000_plans.json
Scores example: NC20C_RMfRST_1000_scores.csv

where "NC" is the state code, "20" stands for the 2020 census cycle, "C" abbreviates "Congress" (as opposed to state upper or lower house), "RMfRST" is the method, 1000 is the number of plans in the ensemble, and "plans" and "scores" distinguish between the two types of files.

Note: The scores metadata file will be named the same as the scores file, except it will end _metadata.json instead of .csv, for example, NC20C_RMfRST_1000_scores_metadata.json.

Usage

To generate an ensemble of 1,000 plans using the random maps from random spanning trees method (RMfRST), run:

scripts/rmfrst_ensemble.py \
--state NC \
--data ../rdabase/data/NC/NC_2020_data.csv \
--shapes ../rdabase/data/NC/NC_2020_shapes_simplified.json \
--graph ../rdabase/data/NC/NC_2020_graph.json \
--size 1000 \
--plans ~/iCloud/fileout/ensembles/NC20C_RMfRST_1000_plans.json \
--log ~/iCloud/fileout/ensembles/NC20C_RMfRST_1000_log.txt \
--no-debug

To score the resulting ensemble, run:

scripts/score_ensemble.py \
--state NC \
--plans ~/iCloud/fileout/ensembles/NC20C_RMfRST_1000_plans.json \
--data ../rdabase/data/NC/NC_2020_data.csv \
--shapes ../rdabase/data/NC/NC_2020_shapes_simplified.json \
--graph ../rdabase/data/NC/NC_2020_graph.json \
--scores ~/iCloud/fileout/ensembles/NC20C_RMfRST_1000_scores.csv \
--no-debug

To generate random maps from random starting points (RMfRSP) instead, use the rmfrst_ensemble.py script. For ReCom, use the recom_ensemble.py script.

Note: Ensemble JSON files can be quite large, bigger than GitHub's 100 MB file size limit, so we recommend that you write them to the ensembles directory, which is ignored by Git.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.0.2

Oct 25, 2024

2.0.1

Oct 24, 2024

2.0.0

Oct 24, 2024

1.9.0

Oct 13, 2024

1.8.1

Oct 12, 2024

1.8.0

Oct 6, 2024

1.7.0

Oct 1, 2024

1.6.0

Sep 16, 2024

1.5.1

Sep 16, 2024

1.5.0

Sep 8, 2024

1.4.0

Aug 28, 2024

1.3.1

Feb 5, 2024

1.2.3

Jan 17, 2024

1.2.2

Jan 17, 2024

1.2.1

Jan 16, 2024

1.2.0

Jan 16, 2024

1.1.6

Jan 3, 2024

1.1.5

Jan 3, 2024

1.1.4

Jan 3, 2024

1.1.3

Jan 3, 2024

This version

1.1.2

Jan 3, 2024

1.1.1

Jan 3, 2024

1.1.0

Jan 3, 2024

1.0.5

Dec 13, 2023

1.0.4

Dec 13, 2023

1.0.3

Dec 13, 2023

1.0.2

Dec 13, 2023

1.0.1

Dec 13, 2023

1.0.0

Dec 13, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rdaensemble-1.1.2.tar.gz (15.4 kB view details)

Uploaded Jan 3, 2024 Source

Built Distribution

rdaensemble-1.1.2-py3-none-any.whl (18.3 kB view details)

Uploaded Jan 3, 2024 Python 3

File details

Details for the file rdaensemble-1.1.2.tar.gz.

File metadata

Download URL: rdaensemble-1.1.2.tar.gz
Upload date: Jan 3, 2024
Size: 15.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for rdaensemble-1.1.2.tar.gz
Algorithm	Hash digest
SHA256	`632db1b90472942483d65694a5250cada9adc4b55572b85dc22ab66d33e1a870`
MD5	`3aa3c0022818fd75caccb7c78bb5d5bf`
BLAKE2b-256	`1d88d4d23e5206d5bbebc03aac0ba95b49218c2fe61e341670d6b4860b19aad1`

See more details on using hashes here.

File details

Details for the file rdaensemble-1.1.2-py3-none-any.whl.

File metadata

Download URL: rdaensemble-1.1.2-py3-none-any.whl
Upload date: Jan 3, 2024
Size: 18.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for rdaensemble-1.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`59b5ffb29c5b93a5629bcc679bfa78487e80d70e7a702bcb4cafd8a610ef7ec8`
MD5	`dadf96adfe6e85628ba1cf4935bef75c`
BLAKE2b-256	`93ef71ccca6186a3ca1b6a81bf56b7f3fe2d6370885cd219c627e59d5957299b`