Skip to main content

A Python client for accessing sub-national variation data through the Space2Stats API

Project description

Space2Stats Python Client

A Python client for accessing the Space2Stats API, providing easy access to consistent, comparable, and authoritative sub-national variation data from the World Bank.

Installation

Core package only

pip install space2stats-client

With optional interactive widgets

pip install space2stats-client[widgets]

The widgets provide interactive Jupyter notebook components for data exploration and area selection. They require additional dependencies (ipywidgets, ipyleaflet, IPython) that are not needed for core API functionality.

API Methods

get_topics()

Returns a DataFrame containing available dataset themes/topics from the STAC catalog.


get_fields()

Returns a list of all available fields that can be used with the API.


get_properties(item_id: str)

Returns a DataFrame with descriptions of variables for a specific dataset.


fetch_admin_boundaries(iso3: str, adm: str)

Fetches administrative boundaries from GeoBoundaries API for a given country and admin level.


get_summary(gdf, spatial_join_method, fields)

Extracts H3 level data for areas of interest.

  • Parameters:
    • gdf: GeoDataFrame containing areas of interest
    • spatial_join_method: "touches", "centroid", or "within"
    • fields: List of field names to retrieve
    • geometry: Optional "polygon" or "point" to include H3 geometries
    • verbose: Optional boolean to display progress messages

get_aggregate(gdf, spatial_join_method, fields, aggregation_type)

Extracts summary statistics from H3 data.

  • Parameters:
    • gdf: GeoDataFrame containing areas of interest
    • spatial_join_method: "touches", "centroid", or "within"
    • fields: List of field names to retrieve
    • aggregation_type: "sum", "avg", "count", "max", or "min"
    • verbose: Optional boolean to display progress messages

get_summary_by_hexids(hex_ids, fields, geometry)

Retrieves statistics for specific H3 hexagon IDs.

  • Parameters:
    • hex_ids: List of H3 hexagon IDs to query
    • fields: List of field names to retrieve
    • geometry: Optional; specifies if H3 geometries should be included ("polygon" or "point")
    • verbose: Optional boolean to display progress messages

get_aggregate_by_hexids(hex_ids, fields, aggregation_type)

Aggregates statistics for specific H3 hexagon IDs.

  • Parameters:
    • hex_ids: List of H3 hexagon IDs to aggregate
    • fields: List of field names to aggregate
    • aggregation_type: Type of aggregation ("sum", "avg", "count", "max", "min")
    • verbose: Optional boolean to display progress messages

get_timeseries_fields()

Returns a list of available fields from the timeseries table.


get_timeseries(gdf, spatial_join_method, fields, start_date=None, end_date=None)

Gets timeseries data for areas of interest.

  • Parameters:
    • gdf: GeoDataFrame containing areas of interest
    • spatial_join_method: "touches", "centroid", or "within"
    • fields: List of field names to retrieve
    • start_date: Optional start date (format: 'YYYY-MM-DD')
    • end_date: Optional end date (format: 'YYYY-MM-DD')
    • geometry: Optional "polygon" or "point" to include H3 geometries
    • verbose: Optional boolean to display progress messages

get_timeseries_by_hexids(hex_ids, fields, start_date=None, end_date=None)

Gets timeseries data for specific H3 hexagon IDs.

  • Parameters:
    • hex_ids: List of H3 hexagon IDs to query
    • fields: List of field names to retrieve
    • start_date: Optional start date (format: 'YYYY-MM-DD')
    • end_date: Optional end date (format: 'YYYY-MM-DD')
    • geometry: Optional "polygon" or "point" to include H3 geometries
    • verbose: Optional boolean to display progress messages

ADM2 Summaries

Access pre-computed administrative level 2 (ADM2) summaries from the World Bank Development Data Hub.

get_adm2_dataset_info()

Returns information about available ADM2 datasets.

  • Returns: DataFrame with dataset names, resource IDs, descriptions, and URLs

get_adm2_summaries(dataset, iso3_filter=None)

Retrieves ADM2-level summary statistics from the World Bank DDH API.

  • Parameters:
    • dataset: Dataset to retrieve. Options:
      • "urbanization": Urban and rural settlement data (GHS settlement model)
      • "nighttimelights": Nighttime lights intensity data (satellite-derived luminosity)
      • "population": Population statistics (demographic data)
      • "flood_exposure": Flood exposure risk data (flood hazard and exposure metrics)
    • iso3_filter: Optional ISO3 country code to filter by (e.g., 'USA', 'BRA', 'IND')
    • verbose: Optional boolean to display progress messages
  • Returns: DataFrame containing ADM2-level statistics records

Interactive Widgets

Note: Widgets are optional components that require additional dependencies. Install with pip install space2stats-client[widgets]

Space2Stats provides interactive widgets that make it easy to explore, select data fields, and define areas of interest in Jupyter notebooks.

AOISelector

This widget provides an interactive map interface for selecting Areas of Interest (AOI) by drawing polygons or rectangles directly on the map. The AOI is automatically captured in a GeoDataFrame that can be used directly with Space2Stats API calls.

from space2stats_client.widgets.aoi_selector import AOISelector
from space2stats_client import Space2StatsClient

# Create and display the AOI selector
aoi_selector = AOISelector(center=(27.0, 29.7), zoom=6)
aoi_selector.display()

# After drawing on the map, access the AOI data
if aoi_selector.aoi:
    print(f"AOI selected with {len(aoi_selector.aoi.gdf)} polygon(s)")
    print(aoi_selector.aoi.gdf)
    
    # Use the AOI directly in API calls
    client = Space2StatsClient()
    summary = client.get_summary(
        gdf=aoi_selector.aoi.gdf,
        spatial_join_method="centroid",
        fields=["population", "gdp"]
    )

# Programmatically clear the AOI if needed
aoi_selector.clear_aoi()

Key Features:

  • Interactive map with drawing tools (polygon and rectangle)
  • Real-time feedback on selected area with approximate area calculation
  • Support for multiple polygons in a single AOI
  • Clear button to reset the selection
  • Collapsible instructions panel
  • Automatic population of GeoDataFrame for immediate use
  • Consistent API with other widgets using .display() method

Area Calculation: The approximate area is calculated by projecting the geometry from WGS84 (EPSG:4326) to Web Mercator (EPSG:3857) and computing the planar area. This provides a quick estimation but has inherent accuracy limitations:

  • Equatorial regions: Generally accurate within 5-10%
  • Mid-latitudes (30°-60°): May have 10-40% error
  • High latitudes (>60°): May have >50% error

For precise area calculations, consider using appropriate local coordinate reference systems or geodesic calculations.

CrossSectionFieldSelector

This widget helps users interactively select fields from the Space2Stats API for cross-sectional data. Fields are organized by their source STAC items for easier navigation.

from space2stats_client.widgets.cross_section_field_selector import CrossSectionFieldSelector
from space2stats_client import Space2StatsClient
import geopandas as gpd

# Initialize the client
client = Space2StatsClient()

# Create the field selector widget
selector = CrossSectionFieldSelector(client)

# Display the interactive widget in your notebook
selector.display()

# Later, retrieve the selected fields
selected_fields = selector.get_selected_fields()

# Use the selected fields in an API call
gdf = gpd.read_file("path/to/your/area.geojson")
summary = client.get_summary(
    gdf=gdf,
    spatial_join_method="centroid",
    fields=selected_fields
)

TimeSeriesFieldSelector

This widget allows users to interactively select fields for time series data and specify a valid time period based on the available data range.

from space2stats_client.widgets.time_series_field_selector import TimeSeriesFieldSelector
from space2stats_client import Space2StatsClient
import geopandas as gpd

# Initialize the client
client = Space2StatsClient()

# Create the time series field selector widget
ts_selector = TimeSeriesFieldSelector(client)

# Display the interactive widget in your notebook
ts_selector.display()

# Later, retrieve the selected fields and time period
selections = ts_selector.get_selections()

# Use the selections in a time series API call
gdf = gpd.read_file("path/to/your/area.geojson")
timeseries_data = client.get_timeseries(
    gdf=gdf,
    spatial_join_method="centroid",
    fields=selections['fields'],
    start_date=selections['time_period']['start_date'].strftime('%Y-%m-%d'),
    end_date=selections['time_period']['end_date'].strftime('%Y-%m-%d')
)

Quick Start

pip install space2stats-client
from space2stats_client import Space2StatsClient
import geopandas as gpd

# Initialize the client
client = Space2StatsClient()

# Get available topics/datasets
topics = client.get_topics()
print(topics)

# Get fields for all datasets
fields = client.get_fields()
print(fields)

# Get data for an area of interest
gdf = gpd.read_file("path/to/your/area.geojson")
summary = client.get_summary(
    gdf=gdf,
    spatial_join_method="centroid",  # Options: "touches", "centroid", "within"
    fields=["population", "gdp"],
    geometry="polygon"  # Optional: "polygon" or "point"
)

# Get aggregated statistics
aggregates = client.get_aggregate(
    gdf=gdf,
    spatial_join_method="centroid",  # Options: "touches", "centroid", "within"
    fields=["population", "gdp"],
    aggregation_type="sum"  # Options: "sum", "avg", "count", "max", "min"
)

# Get timeseries data
timeseries = client.get_timeseries(
    gdf=gdf,
    spatial_join_method="centroid",
    fields=["spi"],
    start_date="2020-01-01",  # Optional
    end_date="2020-12-31",    # Optional
    geometry="polygon"         # Optional
)

# Get ADM2 summaries for a specific country
adm2_pop = client.get_adm2_summaries(
    dataset="population",
    iso3_filter="USA"  # Optional country filter
)

# Get all available ADM2 datasets info
adm2_info = client.get_adm2_dataset_info()
print(adm2_info)

Documentation

For full documentation, visit Space2Stats Documentation.

License

This project is licensed under the World Bank Master Community License Agreement. See the LICENSE file for details.

Disclaimer

The World Bank makes no warranties regarding the accuracy, reliability, or completeness of the results and content. The World Bank disclaims any liability arising from the use of this software.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

space2stats_client-1.4.0.tar.gz (31.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

space2stats_client-1.4.0-py3-none-any.whl (28.5 kB view details)

Uploaded Python 3

File details

Details for the file space2stats_client-1.4.0.tar.gz.

File metadata

  • Download URL: space2stats_client-1.4.0.tar.gz
  • Upload date:
  • Size: 31.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for space2stats_client-1.4.0.tar.gz
Algorithm Hash digest
SHA256 9e617ebe10e0bd48a10f108044484ea74c1c0591f89f581c031d94cd5a72e2c2
MD5 34ada04a8c8ca611704fea4f50dc0c9e
BLAKE2b-256 2172e6c832f64d5e711ca4794d8f6a8bd7f3d977ed625aca5150b86a350c3cc3

See more details on using hashes here.

Provenance

The following attestation bundles were made for space2stats_client-1.4.0.tar.gz:

Publisher: client-release.yml on worldbank/DECAT_Space2Stats

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file space2stats_client-1.4.0-py3-none-any.whl.

File metadata

File hashes

Hashes for space2stats_client-1.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6712ee19760f0b9a805c70f557a8ee7dbf3ed1f43f500a19a044b0cbbb0831b3
MD5 509c3d884467612a9231add0a5a97df0
BLAKE2b-256 9e18abd08e1c8e00f0edd1b9cf2d7b83686b156174819a97689af8aa9ceb50fc

See more details on using hashes here.

Provenance

The following attestation bundles were made for space2stats_client-1.4.0-py3-none-any.whl:

Publisher: client-release.yml on worldbank/DECAT_Space2Stats

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page