Skip to main content

Add your description here

Project description

CensusForge

[!WARNING]
This project is still in development and may change very quickly. I will add more functionality in the future but this contains the bare minimum to support the needs of the project.

CensusForge is a Python toolkit for retrieving data from the U.S. Census API while also leveraging a local SQLite metadata database for fast lookups, the SQLite database creation repo can be found in this link in GitHub. It simplifies working with Census datasets by providing a unified interface for:

  • Downloading and caching geographic files
  • Querying the Census API
  • Looking up dataset, variable, year, and geography metadata
  • Returning results as Polars or GeoPandas objects

CensusForge consists of two main classes:

  • DataPull – Handles local metadata queries and file downloads
  • CensusAPI – Extends DataPull and adds direct Census API querying

Installation

pip install CensusForge

Quick Start Example

The following example shows how to query the Census API using the CensusAPI class.

from CensusForge import CensusAPI

def main():
    ca = CensusAPI()
    print(
        ca.query(
            dataset="acs-acs1-pumspr",
            year=2019,
            params_list=["AGEP", "SCH", "SCHL", "HINCP", "PWGTP", "PUMA"],
        )
    )

if __name__ == "__main__":
    main()

Running the above will:

  1. Look up the dataset in the local metadata database
  2. Construct the correct Census API URL
  3. Fetch the API response
  4. Convert it to a Polars DataFrame

Project Structure

CensusForge/
│
├── CensusAPI.py       # CensusAPI and DataPull classes
├── database.db        # Local SQLite metadata database
├── jp_tools/          # Utility functions (e.g., file download helper)
│
├── data/              # Output directory for downloaded/cached files
└── README.md          # Project documentation

API Overview

CensusAPI

query(dataset, params_list, year, extra="") → pl.DataFrame

Query a Census dataset using any set of variables or geography parameters.

Example

ca.query(
    dataset="acs-acs1-pumspr",
    year=2019,
    params_list=["AGEP", "HINCP", "PUMA"],
    extra="&for=state:*"
)

Metadata Helpers (inherited from DataPull)

Method Description
get_database(id) Returns dataset name for ID
get_database_id(name) Returns dataset ID
get_year(id) Returns year for ID
get_year_id(year) Returns year ID
get_variable_id(name) Returns variable ID
get_geo_id(name) Returns geography type ID
get_geo_years(dataset_id, geo_id) Returns valid years for a dataset+geography

Geospatial Tools

pull_geos(url, filename) → gpd.GeoDataFrame

Downloads a geographic file (if missing), caches it as Parquet, and returns a GeoDataFrame.


Requirements

  • Python 3.9+
  • DuckDB
  • GeoPandas
  • Polars
  • Requests
  • jp_tools (for download helper)

Install dependencies:

pip install -r requirements.txt

Development

To run tests or modify the project:

git clone https://github.com/yourusername/CensusForge.git
cd CensusForge
pip install -e .

Cite

@software{ouslan2026censusforge,
    author       = {Ouslan, Alejandro},
    title        = {CensusForge},
    month        = jan,
    year         = 2026,
    publisher    = {Zenodo},
    version      = {0.5.0},
    doi          = {10.5281/zenodo.18121581},
    url          = {https://doi.org/10.5281/zenodo.18121581}
}

License

This project is licensed under the GNU General Public License v3.0 (GPL-3.0).

You may copy, modify, and distribute this software only under the terms of the GPL-3.0 license.

Full license text: https://www.gnu.org/licenses/gpl-3.0.en.html

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

censusforge-0.6.1.tar.gz (23.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

censusforge-0.6.1-py3-none-any.whl (23.3 MB view details)

Uploaded Python 3

File details

Details for the file censusforge-0.6.1.tar.gz.

File metadata

  • Download URL: censusforge-0.6.1.tar.gz
  • Upload date:
  • Size: 23.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.3 {"installer":{"name":"uv","version":"0.10.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Rocky Linux","version":"9.7","id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for censusforge-0.6.1.tar.gz
Algorithm Hash digest
SHA256 234f1c51affcb44944dc188ddaa65ac67bc15ca50a57855ef03c242d6a1053c7
MD5 a46bfa5bb378e71a0a2c3099074ecd86
BLAKE2b-256 e96e1f3ec296cf695cc48d775a434ab2a0e1f7c3bbe4aa1bb73dd217f0baca6d

See more details on using hashes here.

File details

Details for the file censusforge-0.6.1-py3-none-any.whl.

File metadata

  • Download URL: censusforge-0.6.1-py3-none-any.whl
  • Upload date:
  • Size: 23.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.10.3 {"installer":{"name":"uv","version":"0.10.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Rocky Linux","version":"9.7","id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for censusforge-0.6.1-py3-none-any.whl
Algorithm Hash digest
SHA256 0acfc0e9a86513bc9358096520facee2e2ed281a0881597f5f2525bc8311224d
MD5 71279cb1a471389bd93be8a9a929cf57
BLAKE2b-256 56286fd983cd1dd68fd50b5f9be2c08ad0bce6d48dc6b2034e3b787bf732d8fe

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page