Skip to main content
This is a pre-production deployment of Warehouse. Changes made here affect the production instance of PyPI (
Help us improve Python packaging - Donate today!

Scripts for sampling Geo data sets by the specific region name

Project Description

Geo sampling

Say you want to learn about the average number of potholes per kilometer of street in a city. Or estimate a similar such quantity. To estimate the quantity, you need to sample locations on the streets. This package helps you sample those locations. In particular, the package implements the following sampling strategy:

  1. Sampling Frame: Get all the streets in the region of interest from OpenStreetMap. To accomplish that, the package first downloads administrative boundary data for the country in which the region is located in ESRI format from The administrative data is in multiple levels, for instance, cities are nested in states, which are nested in countries. The user can choose a city or state, but not a portion of a city. And then the package uses the pyshp package to build a URL for the site from which we can download the OSM data.

  2. Sampling Design:

    • For each street (or road), starting from one end of the street, we split the street into .5 km segments till we reach the end of the street. (The last segment, or if the street is shorter than .5km, the only segment, can be shorter than .5 km.)
    • Get the lat/long of starting and ending points of each of the segments. And assume that the street is a straight line between the .5 km segment.
    • Next, create a database of all the segments
    • Sample rows from the database and produce a CSV of the sampled segments
    • Plot the lat/long — filling all the area within the segment. These shaded regions are regions for which data needs to be collected.
  3. Data Collection: Collect data on the highlighted segments.


There are a couple dependencies that need to be built from source on Windows so you may need to install Microsoft Visual C++ Compiler for Python 2.7.


Prepare the working directory. We recommend that you install in the Python virtual environment.

mkdir geo_sampling
cd geo_sampling
virtualenv -p python2.7 venv
. venv/bin/activate

Upgrade Python packages pip and setuptools to the latest version.

pip install --upgrade pip setuptools

Install geo-sampling package from PyPI.

pip install geo-sampling


For more information please visit the project documentation page.


Suriyan Laohaprapanon and Gaurav Sood


Scripts are released under the MIT License.

Release History

Release History

This version
History Node


History Node


History Node


Download Files

Download Files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

File Name & Checksum SHA256 Checksum Help Version File Type Upload Date
geo-sampling-0.0.7.tar.gz (10.4 kB) Copy SHA256 Checksum SHA256 Source Oct 20, 2016

Supported By

WebFaction WebFaction Technical Writing Elastic Elastic Search Pingdom Pingdom Monitoring Dyn Dyn DNS Sentry Sentry Error Logging CloudAMQP CloudAMQP RabbitMQ Heroku Heroku PaaS Kabu Creative Kabu Creative UX & Design Fastly Fastly CDN DigiCert DigiCert EV Certificate Rackspace Rackspace Cloud Servers DreamHost DreamHost Log Hosting