Skip to main content

GeoPops

Project description

GeoPops

Full documentation and tutorials coming soon!

GeoPops is a package for generating geographically and demographically realistic synthetic populations for any US Census location using publically available data. GeoPops is in development, and we welcome feedback in the issues folder. Population generation includes three steps:

  1. Generate individuals within households using combinatorial optimization (CO)
  2. Assign individuals to schools and workplace locations using enrollment data and commute flows
  3. Connect individuals within locations using graph algorithms

Resulting files include a list of agents with attributes (e.g., age, gender, income) and networks detailing their connections within home, school, workplace, and group quarters (e.g., correctional facilities, nursing homes) locations. GeoPops is meant to produce reasonable approximations of state and county population characteristics with granularity down to the Census Block Group (CBG). GeoPops builds on a previous package, GREASYPOP-CO, and incorporates the following changes:

  • All code wrapped in convenient Python package that can be pip installed
  • Compatibility with Census data beyond 2019
  • Automated data downloading
  • Users can adjust all config parameters from the front-end
  • Class for exporting files compatible with the agent-based modeling software Starsim

How to use

First, create a Julia environment with the dependencies listed below. It may be easiest to store the environment in the same folder you will use for output files. While called with Python commands, combinatorial optimization, school and workplace assignment, and network generation steps occur in Julia to decrease run time. Try running the following in the terminal.

curl -fsSL https://install.julialang.org | sh
juliaup add 1.9.0        # Install Julia 1.9.0
juliaup default 1.9.0    # Make 1.9.0 the default (optional)
julia +1.9.0 --version   # Run that version once
juliaup update           # Update installed versions
julia                    # Launch Julia and see version
Base.active_project()    # Get path where environment is located. Copy this - will need later
]                        # Enter package mode. prompt changes to "(@v1.9) pkg>"
add CSV@0.10.10          # Add required package versions
add DataFrames@1.5.0
add Graphs@1.8.0 
add InlineStrings@1.4.0 
add JSON@0.21.4
add StatsBase@0.33.21
add Distributions@0.25.112
add MatrixMarket@0.4.0
add ProportionalFitting@0.3.0
status                   # View list of packages

You'll also need a Python environment with the dependencies listed in the GeoPops pyproject.toml. Install GeoPops from PyPI.

pip install geopops

Next, obtain a Census API key here, which will be used for pulling Census data.

Now in a Python or Notebook script, create a dictionary of parameters. Default parameters are stored in a package file called config.json. Pass your dictionary into WriteConfig() to overwrite config.json with the parameters for your population of interest. Here's an example to for Howard County, MD.

pars_geopops = {'path': 'YOUR_OUTPUT_DIR', # designate folder for output files
                'census_api_key': "YOUR_CENSUS_API_KEY", 
                'julia_env_path': "YOUR_JULIA_ENV_PATH",
                'main_year': 2019, # year of data
                'geos': ["24027"], # state or county fips code of main geographical area
                'commute_states': ["24"], # fips of commute states to use
                'use_pums': ["24"]} # Same as commute_states
                
geopops.WriteConfig(**pars_geopops) # Overwrite config.json with your parameters

The commands below will create your popoulation and store files in the output directory defined above. Downloaded raw data files are stored in the subfolders census, geo, pums, school, and work. Files created in the preprocessing step are stored in the subfolder called processed. The population in jlse format is stored in the subfolder jlse. Export() outputs csv versions into the subfolder pop_export. ForStarsim() outputs files formated for use with Starsim into the subfolder pop_export/starsim.

geopops.DownloadData()          # Download all Census and other data sources
geopops.ProcessData()           # Preprocessing for next steps
geopops.RunJulia().run_all()    # Run Julia scripts (much faster than Python)
geopops.ForStarsim()            # Format people and networks for Starsim

Tutorials

For more detailed descriptions of each step, see the tutorials folder.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geopops-0.1.1.post4.tar.gz (3.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geopops-0.1.1.post4-py3-none-any.whl (3.2 MB view details)

Uploaded Python 3

File details

Details for the file geopops-0.1.1.post4.tar.gz.

File metadata

  • Download URL: geopops-0.1.1.post4.tar.gz
  • Upload date:
  • Size: 3.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.22

File hashes

Hashes for geopops-0.1.1.post4.tar.gz
Algorithm Hash digest
SHA256 646dad6eb4cdbad0bc2b0907d008b55392a7c569de80791b0cfbaa86242fa1ce
MD5 d4e6e1b516b202a84ae844dddb09b912
BLAKE2b-256 f3ac6ad75da74878b0bbdcc9b36abc8b1b31e2ef3ec1e1cea9735f556f8320cc

See more details on using hashes here.

File details

Details for the file geopops-0.1.1.post4-py3-none-any.whl.

File metadata

File hashes

Hashes for geopops-0.1.1.post4-py3-none-any.whl
Algorithm Hash digest
SHA256 e7282962db4d5e9e68cb31eef0dd2cc7c2e716d3880fe849cc3094712db018ec
MD5 40851827e3c1741f317e129bcef5fbf8
BLAKE2b-256 93c8c16ab61a6166998e5f94ce947c000f4e1505eee6446f1aaac54b377b6964

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page