Skip to main content

A Python package for Korean census data processing

Project description

pycensuskr

A Python package for Korean district-level census and geographic data.

Installation

PyPI

pip install pycensuskr

From source

git clone https://github.com/sigmafelix/pycensuskr.git
cd pycensuskr
pip install -e .

# python setup.py install

For development

git clone https://github.com/sigmafelix/pycensuskr.git
cd pycensuskr
pip install -e ".[dev]"

Usage

from pycensuskr import __version__
from pycensuskr import CensusKR
from matplotlib import pyplot as plt
import geopandas as gpd

# Print version
print(__version__)

# Create a CensusData instance
census = CensusKR()

# load specific year data
data_2020 = census.load_data(2020)
print(data_2020)

# load district boundaries for a specific year
districts_2020 = census.load_districts(2020)
districts_2020["adm2_re"] = districts_2020["adm2_code"].astype(str).str.slice(0,4)
# aggregate geometries by adm2_re
districts_2020 = districts_2020.dissolve(by="adm2_re", as_index=False)
districts_2020["adm2_code"] = districts_2020["adm2_re"] + "0"
districts_2020["adm2_code"] = districts_2020["adm2_code"].astype(int)

# cleaned data with variable types
df_tax_2020 = census.anycensus(year = 2020, type = "tax", aggregator = "sum")

districts_tax_2020 = districts_2020.merge(df_tax_2020, on="adm2_code")
print(districts_tax_2020)

districts_tax_2020.plot("income_labor_mil")
plt.show()

Notes on data updates

Our data cleaning and processing pipeline is based on the original R package tidycensuskr. We periodically synchronize the bundled datasets used in this package with those in R tidycensuskr and tidycensussfkr to ensure accuracy and relevance. Please refer to the tidycensuskr webpage for details on the latest data updates and changes.

Development

Running tests

pytest

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycensuskr-0.2.6.tar.gz (9.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pycensuskr-0.2.6-py3-none-any.whl (9.7 MB view details)

Uploaded Python 3

File details

Details for the file pycensuskr-0.2.6.tar.gz.

File metadata

  • Download URL: pycensuskr-0.2.6.tar.gz
  • Upload date:
  • Size: 9.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pycensuskr-0.2.6.tar.gz
Algorithm Hash digest
SHA256 4bf87d8c31e1819482e3cbfa9f8b0d370d593b0ad86005aabd6edc687c20f360
MD5 e1ed903842df2f6d6ce80335e00e99de
BLAKE2b-256 df1226beff8ba1fc4d3231fa1645e200549fd429c4e9ded6d5e4d78c3b619a7c

See more details on using hashes here.

Provenance

The following attestation bundles were made for pycensuskr-0.2.6.tar.gz:

Publisher: build-and-release.yml on sigmafelix/pycensuskr

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pycensuskr-0.2.6-py3-none-any.whl.

File metadata

  • Download URL: pycensuskr-0.2.6-py3-none-any.whl
  • Upload date:
  • Size: 9.7 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pycensuskr-0.2.6-py3-none-any.whl
Algorithm Hash digest
SHA256 4211b53700f65b96a68fcf153e0fff25f2dfdffcb9ae92864f6a31f8afac1eb1
MD5 842680f83a59863f14bff5dc65398893
BLAKE2b-256 c68c2f9cbaa654717b107367f8b3a81ee56dc93f9495adf3007b045cee2c4f2f

See more details on using hashes here.

Provenance

The following attestation bundles were made for pycensuskr-0.2.6-py3-none-any.whl:

Publisher: build-and-release.yml on sigmafelix/pycensuskr

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page