Skip to main content

An ETL for fetching/cleaning St. Louis crime data sets.

Project description

geocrime-stl

License: MIT Python 3.10+

A Python utility for cleaning and standardizing St. Louis Metropolitan Police Department (SLMPD) crime data for spatial analysis.

✨ Features

  • End-to-End ETL: Fetch, clean, and standardize monthly SLMPD crime data files automatically.
  • Spatial Formats: Convert cleaned DataFrames seamlessly into GeoJSON, GeoPackage (GPKG), or CSV.
  • Analysis Ready: Access built-in methods for minimal data summarization and visualization.

🌐 Data Source & Disclaimer

This utility programmatically fetches publicly available crime data directly from the St. Louis Metropolitan Police Department (SLMPD) Stats Page.

⚠️ Data Disclaimer & Limitations:

  • Availability: This tool relies entirely on the upstream availability and hosting structure of the SLMPD website. If their servers are down or their URL layout changes, the extraction pipeline may fail.
  • Data Integrity & Spatial Boundaries: The data processed is strictly "as-is" from the published monthly files. To ensure high data quality for spatial analysis, this utility applies two strict filtering constraints:
    1. Temporal Filtering: Because the SLMPD frequently updates historical records retroactively, this utility isolates and preserves only the records belonging to the target month.
    2. Spatial Clipping: Records containing invalid, malformed, or erroneous latitude/longitude coordinates that plot completely outside the official St. Louis city boundaries are automatically dropped.
  • Project Status: This is an independent, open-source utility. It is not affiliated with, endorsed by, or officially maintained by the SLMPD or the City of St. Louis.

🚀 Quick Start

Install

pip install geocrime-stl

Fetch/Clean/Visualize Data

import geocrime_stl as gc

data_pkg = gc.run_pipeline(4, 2026)

gc.generate_monthly_metrics(data_pkg)

gc.plot_monthly_maps(data_pkg)

🛠️ Local Setup (Alternative)

If you want to run the demo notebook locally or explore the source code, you can clone the repository directly:

git clone https://github.com/lweber89/geocrime-stl.git
cd geocrime-stl
pip install -e .

📓 Demo & Tutorial

To see a step-by-step walkthrough of the entire ETL process, check out the interactive Jupyter notebook included in this package:

👉 View the Demo Notebook

Note: If you have cloned the repository locally, you can open this file using Jupyter Lab or Jupyter Notebook to run the cells yourself.

🤝 Contributing & Pull Requests

Thank you for your interest in the project!

⚠️ Please Note: This is a personal project I developed and maintain in an effort to better understand the implementation of Python in geospatial data engineering / solutions architecture. I am not accepting pull requests or code contributions at this time.

If you find a bug or have an idea, feel free to open an Issue to let me know, or feel free to fork the repository and adapt it for your own personal needs!

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

🔗 Project Links

GitHub Repository: https://github.com/lweber89/geocrime-stl

PyPI Home: https://pypi.org/project/geocrime-stl/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geocrime_stl-0.1.3.tar.gz (64.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geocrime_stl-0.1.3-py3-none-any.whl (64.2 kB view details)

Uploaded Python 3

File details

Details for the file geocrime_stl-0.1.3.tar.gz.

File metadata

  • Download URL: geocrime_stl-0.1.3.tar.gz
  • Upload date:
  • Size: 64.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for geocrime_stl-0.1.3.tar.gz
Algorithm Hash digest
SHA256 95a2ad9d2da46884e24de7edcf1519906c1a0981ddb0a94fc755aef71113f997
MD5 f802014672098a76cfc0e40056b2b58f
BLAKE2b-256 5e9f74d99fdf3ff11685548f9f0a9347bc997fbff8f86f3fe28110b64f7ad54d

See more details on using hashes here.

File details

Details for the file geocrime_stl-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: geocrime_stl-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 64.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for geocrime_stl-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 6fecbf94530306a25a4bf4c13ee1e67294ebe5510d2137f4b758b070d41646a2
MD5 6ba3f48c87d35a9212fd34833753d7d4
BLAKE2b-256 e8d9a41894a86b0050e5ba686e417d849af0028f0c224ca82405ad832f85f98c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page