A Python client for accessing sub-national variation data through the Space2Stats API
Project description
Space2Stats Python Client
A Python client for accessing the Space2Stats API, providing easy access to consistent, comparable, and authoritative sub-national variation data from the World Bank.
API Methods
get_topics()
Returns a DataFrame containing available dataset themes/topics from the STAC catalog.
get_fields()
Returns a list of all available fields that can be used with the API.
get_properties(item_id: str)
Returns a DataFrame with descriptions of variables for a specific dataset.
fetch_admin_boundaries(iso3: str, adm: str)
Fetches administrative boundaries from GeoBoundaries API for a given country and admin level.
get_summary(gdf, spatial_join_method, fields)
Extracts H3 level data for areas of interest.
- Parameters:
gdf: GeoDataFrame containing areas of interestspatial_join_method: "touches", "centroid", or "within"fields: List of field names to retrievegeometry: Optional "polygon" or "point" to include H3 geometriesverbose: Optional boolean to display progress messages
get_aggregate(gdf, spatial_join_method, fields, aggregation_type)
Extracts summary statistics from H3 data.
- Parameters:
gdf: GeoDataFrame containing areas of interestspatial_join_method: "touches", "centroid", or "within"fields: List of field names to retrieveaggregation_type: "sum", "avg", "count", "max", or "min"verbose: Optional boolean to display progress messages
get_summary_by_hexids(hex_ids, fields, geometry)
Retrieves statistics for specific H3 hexagon IDs.
- Parameters:
hex_ids: List of H3 hexagon IDs to queryfields: List of field names to retrievegeometry: Optional; specifies if H3 geometries should be included ("polygon" or "point")verbose: Optional boolean to display progress messages
get_aggregate_by_hexids(hex_ids, fields, aggregation_type)
Aggregates statistics for specific H3 hexagon IDs.
- Parameters:
hex_ids: List of H3 hexagon IDs to aggregatefields: List of field names to aggregateaggregation_type: Type of aggregation ("sum", "avg", "count", "max", "min")verbose: Optional boolean to display progress messages
get_timeseries_fields()
Returns a list of available fields from the timeseries table.
get_timeseries(gdf, spatial_join_method, fields, start_date=None, end_date=None)
Gets timeseries data for areas of interest.
- Parameters:
gdf: GeoDataFrame containing areas of interestspatial_join_method: "touches", "centroid", or "within"fields: List of field names to retrievestart_date: Optional start date (format: 'YYYY-MM-DD')end_date: Optional end date (format: 'YYYY-MM-DD')geometry: Optional "polygon" or "point" to include H3 geometriesverbose: Optional boolean to display progress messages
get_timeseries_by_hexids(hex_ids, fields, start_date=None, end_date=None)
Gets timeseries data for specific H3 hexagon IDs.
- Parameters:
hex_ids: List of H3 hexagon IDs to queryfields: List of field names to retrievestart_date: Optional start date (format: 'YYYY-MM-DD')end_date: Optional end date (format: 'YYYY-MM-DD')geometry: Optional "polygon" or "point" to include H3 geometriesverbose: Optional boolean to display progress messages
Interactive Widgets
Space2Stats provides interactive widgets that make it easy to explore and select data fields in Jupyter notebooks.
CrossSectionFieldSelector
This widget helps users interactively select fields from the Space2Stats API for cross-sectional data. Fields are organized by their source STAC items for easier navigation.
from space2stats_client import Space2StatsClient, CrossSectionFieldSelector
# Initialize the client
client = Space2StatsClient()
# Create the field selector widget
selector = CrossSectionFieldSelector(client)
# Display the interactive widget in your notebook
selector.display()
# Later, retrieve the selected fields
selected_fields = selector.get_selected_fields()
# Use the selected fields in an API call
gdf = gpd.read_file("path/to/your/area.geojson")
summary = client.get_summary(
gdf=gdf,
spatial_join_method="centroid",
fields=selected_fields
)
TimeSeriesFieldSelector
This widget allows users to interactively select fields for time series data and specify a valid time period based on the available data range.
from space2stats_client import Space2StatsClient, TimeSeriesFieldSelector
# Initialize the client
client = Space2StatsClient()
# Create the time series field selector widget
ts_selector = TimeSeriesFieldSelector(client)
# Display the interactive widget in your notebook
ts_selector.display()
# Later, retrieve the selected fields and time period
selections = ts_selector.get_selections()
# Use the selections in a time series API call
gdf = gpd.read_file("path/to/your/area.geojson")
timeseries_data = client.get_timeseries(
gdf=gdf,
spatial_join_method="centroid",
fields=selections['fields'],
start_date=selections['time_period']['start_date'].strftime('%Y-%m-%d'),
end_date=selections['time_period']['end_date'].strftime('%Y-%m-%d')
)
Quick Start
pip install space2stats-client
from space2stats_client import Space2StatsClient
import geopandas as gpd
# Initialize the client
client = Space2StatsClient()
# Get available topics/datasets
topics = client.get_topics()
print(topics)
<<<<<<< HEAD
# Get fields for all datasets
=======
# Get fields for a specific dataset
>>>>>>> main
fields = client.get_fields()
print(fields)
# Get data for an area of interest
gdf = gpd.read_file("path/to/your/area.geojson")
summary = client.get_summary(
gdf=gdf,
spatial_join_method="centroid", # Options: "touches", "centroid", "within"
fields=["population", "gdp"],
geometry="polygon" # Optional: "polygon" or "point"
)
# Get aggregated statistics
aggregates = client.get_aggregate(
gdf=gdf,
spatial_join_method="centroid", # Options: "touches", "centroid", "within"
fields=["population", "gdp"],
aggregation_type="sum" # Options: "sum", "avg", "count", "max", "min"
)
# Get timeseries data
timeseries = client.get_timeseries(
gdf=gdf,
spatial_join_method="centroid",
fields=["spi"],
start_date="2020-01-01", # Optional
end_date="2020-12-31", # Optional
geometry="polygon" # Optional
)
# Get time series data
timeseries = client.get_timeseries(
gdf=gdf,
spatial_join_method="centroid",
fields=["spi"],
start_date="2020-01-01",
end_date="2020-12-31"
)
Documentation
For full documentation, visit Space2Stats Documentation.
License
This project is licensed under the World Bank Master Community License Agreement. See the LICENSE file for details.
Disclaimer
The World Bank makes no warranties regarding the accuracy, reliability, or completeness of the results and content. The World Bank disclaims any liability arising from the use of this software.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file space2stats_client-1.3.0.tar.gz.
File metadata
- Download URL: space2stats_client-1.3.0.tar.gz
- Upload date:
- Size: 24.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4b46918967cb43b0fc97145cc4437a92a2b6acc3dcbc34a30234e513ed7b665d
|
|
| MD5 |
4d5b6aa12aaa30029af11d1ecb044471
|
|
| BLAKE2b-256 |
93de083d3f5657e06f40d666224c875e77e5afd9ff141d3663a2fa9f3b0952e5
|
Provenance
The following attestation bundles were made for space2stats_client-1.3.0.tar.gz:
Publisher:
client-release.yml on worldbank/DECAT_Space2Stats
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
space2stats_client-1.3.0.tar.gz -
Subject digest:
4b46918967cb43b0fc97145cc4437a92a2b6acc3dcbc34a30234e513ed7b665d - Sigstore transparency entry: 227073297
- Sigstore integration time:
-
Permalink:
worldbank/DECAT_Space2Stats@9e9c3b19f73e9b769d8ab18afc2e7095e8df47e9 -
Branch / Tag:
refs/tags/v1.3.0 - Owner: https://github.com/worldbank
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
client-release.yml@9e9c3b19f73e9b769d8ab18afc2e7095e8df47e9 -
Trigger Event:
release
-
Statement type:
File details
Details for the file space2stats_client-1.3.0-py3-none-any.whl.
File metadata
- Download URL: space2stats_client-1.3.0-py3-none-any.whl
- Upload date:
- Size: 22.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ef874df025fee564eff5397fafe4a6861715af28900858d234c7a39eea5cbe68
|
|
| MD5 |
848486c3c221489f0d9e388dad1cb1b6
|
|
| BLAKE2b-256 |
b5a08d39df2a2be4797311b9bc9abadb057beb9409e40a843764457cb5993284
|
Provenance
The following attestation bundles were made for space2stats_client-1.3.0-py3-none-any.whl:
Publisher:
client-release.yml on worldbank/DECAT_Space2Stats
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
space2stats_client-1.3.0-py3-none-any.whl -
Subject digest:
ef874df025fee564eff5397fafe4a6861715af28900858d234c7a39eea5cbe68 - Sigstore transparency entry: 227073301
- Sigstore integration time:
-
Permalink:
worldbank/DECAT_Space2Stats@9e9c3b19f73e9b769d8ab18afc2e7095e8df47e9 -
Branch / Tag:
refs/tags/v1.3.0 - Owner: https://github.com/worldbank
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
client-release.yml@9e9c3b19f73e9b769d8ab18afc2e7095e8df47e9 -
Trigger Event:
release
-
Statement type: