The OpenPoliceData (OPD) Python library is the most comprehensive centralized public access point for incident-level police data
Project description
OpenPoliceData
The OpenPoliceData (OPD) Python library is the most comprehensive centralized public access point for incident-level police data in the United States. OPD provides easy access to 550+ incident-level datasets from 236 police agencies and 11 entire states. Types of data include traffic stops, use of force, officer-involved shootings, and complaints.
Users request data by department name and type of data, and the data is returned as a pandas DataFrame. There is no need to manually find the data online or to know how to work with open data APIs (ArcGIS, Socrata, etc.). When data is loaded by OPD, the returned data is unmodified (with the exception of formatting known date fields) from what appears on the source's site, and OPD provides links to the original data for transparency.
OpenPoliceData can be installed from the Python Package Index (PyPI):
pip install openpolicedata
OpenPoliceData provides access to police data with 2 simple lines of code:
> import openpolicedata as opd
> src = opd.Source("New Orleans")
> data = src.load(table_type="USE OF FORCE", year=2022)
- Documentation: https://openpolicedata.readthedocs.io/
- Source Code: https://github.com/openpolicedata/openpolicedata
- Bug Tracker: https://github.com/openpolicedata/openpolicedata/issues
- Latest Datasets
- Release Notes
- Contributing
Latest Datasets Added to OPD
We've added a huge number of datasets recently!
- 2025 datasets: Phoenix and San Jose Calls for Service, Cedar Lake, IN Arrests, Calls for Service, Traffic Stops, Traffic Citations, Traffic Warnings, and Crashes, Griffith, IN Arrests, Calls for Service, Traffic Stops, Traffic Citations, Traffic Warnings, and Crashes, St. John, IN Arrests, Calls for Service, Traffic Stops, Traffic Citations, Traffic Warnings, and Crashes, Wichita Crashes, Louisville Incidents, New Orleans Calls for Services and Incidents, Minneapolis Incidents, Portland, OR Calls for Service
- 2024 datasets: Oakland Use of Force, Washington D.C. Incidents and Lawsuits, Chicago Pedestrian Stops, Louisville Incidents, New Orleans Calls for Services and Incidents, Baltimore Calls for Service, Pittsfield, MA Arrests and Crashes, Lincoln Calls for Service, Incidents, Traffic Stops, Vehicle Pursuits, and Use of Force, New York City Pedestrian Stops, Asheville Calls for Service, Durham Calls for Service, Norman Arrests, Comlaints, Crashes, Incidents, Traffic Stops and Use of Force, Charleston Citations, Nashville Calls for Service, Albemarle County VA STOPS, Bremerton, WA Arrests, Citations, and Incidents
- California: Stops data for all departments submitted for RIPA for 2018-2023
- Cincinnati Officer Involved Shootings and Use of Force
- Memphis Traffic Citations and Traffic Stops
- New Jersey State Police 2021 Traffic Stops
- Los Angeles Calls for Service and Incidents
- Seattle Arrests
- Portland, OR Arrests and Incidents
- Richmond, CA Arrests
- San Diego Incidents
- Pittsburg Incidents
- Austin Calls for Service and Crashes
- Tucson Crashes and Incidents
- Long Beach Stops
- Salinas, CA Crashes
- Cary, NC Crashes and Incidents
- Morrisville, NC Incidents
v0.11 - 2025-06-06
Added
- Added fuzzy searching for source name in datasets query
- Added rapidfuzz as required dependency
- Added data loader for Opendatasoft API
- Added to_feather and load_feather functions to export and re-import tables from feather files
- Added to_parquet and load_parquet functions to export and re-import tables from parquet files
- Added load_csv function to replace load_from_csv in the next release (v1.0)
Changed
- Data loaders moved into separate modules
- Changed enum for pedestrian stops table from PEDESTRIAN to PEDESTRIAN_STOPS
- Minimum geopandas version is now 0.8
- rapidfuzz is now a required dependency
Deprecated
- Deprecated load_from_csv function. load_csv should be used instead.
- Added note that all deprecated functionality will be removed in the next release (v1.0)
Fixed
- Fixed bug when requested date range contains a date and a year
- Now handling strings that are date strings if stripped
- Fixed bug in pandas deprecation handler class if the length of the table is 0
Complete change log available at: https://github.com/openpolicedata/openpolicedata/blob/main/CHANGELOG.md
Contributing
All contributions are welcome including code enhancments, bug fixes, bug reports, documentation updates, and locating new datasets. If you're interesting in helping out, see our Contributing Guide or reach out by email.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file openpolicedata-0.11.tar.gz
.
File metadata
- Download URL: openpolicedata-0.11.tar.gz
- Upload date:
- Size: 172.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
d072993bcd665445f2fd5edb24730e13b1c46259113ab22779aab17c8f703776
|
|
MD5 |
25cd8823a7c04ff6eaf73ff4c50a6a9e
|
|
BLAKE2b-256 |
351cf5762b97932f26cee5b891757d40368429d9b2ff8f4ba7623c60d6e678e2
|
File details
Details for the file openpolicedata-0.11-py3-none-any.whl
.
File metadata
- Download URL: openpolicedata-0.11-py3-none-any.whl
- Upload date:
- Size: 148.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
e82dc6dcb6b145d274134fffffb4794f9b4b200acdf871a2988c49937a255613
|
|
MD5 |
53a5a342fecfa54b53561b3aeddd18b4
|
|
BLAKE2b-256 |
076ae4c0e3b8d7b011ef9ea94a6494b4c3302d61c4a03390f0bc30b46cb4fb0b
|