GeoFeatureKit transforms simple coordinates into powerful geospatial insights. Analyze street networks, POI diversity, and spatial patterns with professional progress tracking – no paid APIs or complex setup required.

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
Topic
- Scientific/Engineering :: GIS

Project description

GeoFeatureKit

GeoFeatureKit turns raw coordinates into rich, structured geospatial features – instantly.

🎯 What You Get

Input: Just latitude and longitude coordinates
Output: Comprehensive geospatial intelligence including:

40+ POI categories: restaurants, hospitals, subway stations, benches, toilets, and more
Street network metrics: connectivity, total street length, segment distributions, pattern entropy
Spatial intelligence: POI diversity indices (Shannon, Simpson) and clustering patterns

🚀 Use Cases

Domain	Application	Key Features
🤖 Machine Learning	Price prediction, exposure analysis	Rich feature vectors, contextual embeddings
📊 Research	Propensity score matching	Urban covariates, accessibility metrics
🏙️ Urban Planning	Accessibility research, zoning analysis	Spatial patterns, connectivity measures
🧠 AI/ML	Neural networks, spatial clustering	Environmental context, amenity features

🔧 Recent Updates

✅ Fixed spatial distribution bug: Mean nearest neighbor distance now correctly calculates distances to ALL other points, not just subsequent ones
✅ Improved coordinate detection: Better handling of meter vs degree coordinate systems
✅ Enhanced precision: Cleaner formatting with appropriate decimal places
✅ Fixed network metrics: Corrected dead end and intersection counting logic
✅ Robust testing: Replaced flaky tests with deterministic grid-based validation
✅ Python 3.9+ compatibility: Full support across Python versions ✅ Automated releases: GitHub Actions now automatically publishes to PyPI on version tags

✨ Why GeoFeatureKit?

Advantage	Benefit
✅ Simple	Just coordinates in – structured features out
✅ Powerful	Dozens of geospatial metrics in one function call
✅ User-friendly	Optional progress bars and verbose modes
✅ Open Data	Built entirely on OSM and public geospatial libraries

🚀 Quick Start

Installation

pip install geofeaturekit

Basic Usage

from geofeaturekit import features_from_location

# Example: Analyze Times Square with progress bar
features = features_from_location({
    'latitude': 40.7580,
    'longitude': -73.9855,
    'radius_meters': 500
}, show_progress=True)

print(features)

📝 Example Output

Times Square Analysis (500m radius):

{
  "network_metrics": {
    "basic_metrics": {
      "total_nodes": 777,
      "total_street_segments": 2313,
      "total_intersections": 731,
      "total_dead_ends": 0,
      "total_street_length_meters": 80044.7
    },
    "density_metrics": {
      "intersections_per_sqkm": 930.74,
      "street_length_per_sqkm": 101.92
    },
    "connectivity_metrics": {
              "streets_to_nodes_ratio": 1.488,
        "average_connections_per_node": {
          "value": 5.954,
          "confidence_interval_95": {
            "lower": 5.837,
            "upper": 6.071
          }
        }
    },
    "street_pattern_metrics": {
      "street_segment_length_distribution": {
        "minimum_meters": 0.5,
        "maximum_meters": 286.6,
        "mean_meters": 34.6,
        "median_meters": 12.0,
        "std_dev_meters": 50.7
      },
      "street_bearing_distribution": {
        "mean_degrees": 163.3,
        "std_dev_degrees": 101.5
      },
      "ninety_degree_intersection_ratio": 0.0,
      "bearing_entropy": 2.056
    }
  },
  "poi_metrics": {
    "absolute_counts": {
      "total_points_of_interest": 1076,
      "counts_by_category": {
        "total_restaurant_places": {
          "count": 173,
          "percentage": 16.1
        },
        "total_fast_food_places": {
          "count": 77,
          "percentage": 7.2
        },
        "total_cafe_places": {
          "count": 74,
          "percentage": 6.9
        },
        "total_bicycle_parking_places": {
          "count": 71,
          "percentage": 6.6
        },
        "total_bench_places": {
          "count": 27,
          "percentage": 2.5
        },
        "total_bar_places": {
          "count": 26,
          "percentage": 2.4
        },
        "total_bank_places": {
          "count": 24,
          "percentage": 2.2
        },
        "total_pub_places": {
          "count": 19,
          "percentage": 1.8
        },
        "total_bicycle_rental_places": {
          "count": 15,
          "percentage": 1.4
        },
        "total_theatre_places": {
          "count": 12,
          "percentage": 1.1
        },
        "total_pharmacy_places": {
          "count": 6,
          "percentage": 0.6
        },
        "total_atm_places": {
          "count": 4,
          "percentage": 0.4
        }
      }
    },
    "density_metrics": {
      "points_of_interest_per_sqkm": 1370.700637,
      "density_by_category": {
        "restaurant_places_per_sqkm": 220.382166,
        "fast_food_places_per_sqkm": 98.089172,
        "cafe_places_per_sqkm": 94.267516,
        "bicycle_parking_places_per_sqkm": 90.44586,
        "bank_places_per_sqkm": 30.573248,
        "theatre_places_per_sqkm": 15.286624,
        "pharmacy_places_per_sqkm": 7.643312
      }
    },
    "distribution_metrics": {
      "unique_category_count": 42,
              "largest_category": {
          "name": "restaurant",
          "count": 173,
          "percentage": 16.08
        },
      "diversity_metrics": {
        "shannon_diversity_index": 2.245,
        "simpson_diversity_index": 0.79,
        "category_evenness": 0.601
      },
      "spatial_distribution": {
        "mean_nearest_neighbor_distance_meters": 13.2,
        "nearest_neighbor_distance_std_meters": 9.7,
        "r_statistic": 0.978,
        "pattern_interpretation": "random"
      }
    }
  }
}

🔍 Analysis Results

Location Characteristics	Value	Interpretation
🏙️ POI Density	1,371 per km²	Ultra-dense location (rural areas: <10)
🍽️ Food Scene	324 establishments	Dining powerhouse in 500m radius
🚲 Transit Access	86 bike facilities	Sustainable transport infrastructure
🏛️ Entertainment	12 theaters + 38 venues	Major entertainment district
🏪 Financial Services	24 banks + 4 ATMs	Active commercial hub

Network Intelligence	Value	Interpretation
🚶 Walkability	5.95 connections/node	Very high pedestrian connectivity
🗺️ Street Pattern	2.056 bearing entropy	Organized grid-like layout
🛣️ Network Density	101.9 km/km²	Dense street network

Spatial Intelligence	Value	Use Case
📊 Shannon Diversity	2.245	High variety → Rich ML features
📈 Simpson Diversity	0.79	Robust POI mix → Stable predictions
🎯 Clustering Pattern	R = 0.978	Random distribution → Uniform coverage

Perfect for: Price prediction models, accessibility scoring, urban planning analysis

🎯 Key Features

Rich POI Analysis (Points of Interest)

40+ categories: restaurants, hospitals, schools, transit, entertainment
Density metrics: POIs per square kilometer by category
Diversity indices:
- Shannon diversity: Measures variety and evenness (higher = more diverse)
- Simpson diversity: Probability two random POIs are different types
Spatial patterns: clustered, dispersed, or random POI distributions

Street Network Insights

Connectivity: average connections per intersection
Total length: meters of streets within radius
Segment patterns: distribution of street segment lengths
Bearing analysis: street orientation entropy and grid patterns

Progress Tracking

Mode	Code	Use Case
Standard	`show_progress=True, progress_detail='normal'`	General use with progress bars
Verbose	`show_progress=True, progress_detail='verbose'`	Detailed debugging information
Silent	`show_progress=False`	Batch processing, production

# Example: Verbose progress tracking
features = features_from_location(location, show_progress=True, progress_detail='verbose')

🔬 Scientific Applications

Geospatial Research:

# Compare neighborhood walkability
locations = [
    {'latitude': 40.7580, 'longitude': -73.9855, 'radius_meters': 800},  # Times Square
    {'latitude': 40.7829, 'longitude': -73.9654, 'radius_meters': 800}   # Central Park
]

for loc in locations:
    features = features_from_location(loc)
    walkability_score = (
        features['poi_metrics']['density_metrics']['points_of_interest_per_sqkm'] * 0.4 +
        features['network_metrics']['connectivity_metrics']['average_connections_per_node']['value'] * 100 * 0.6
    )
    print(f"Walkability score: {walkability_score:.1f}")

ML Feature Engineering:

# Generate features for price prediction model
import pandas as pd

properties = pd.read_csv('real_estate.csv')  # lat, lon, price columns
features_list = []

for _, row in properties.iterrows():
    location_features = features_from_location({
        'latitude': row['lat'],
        'longitude': row['lon'], 
        'radius_meters': 1000
    }, show_progress=False)
    
    # Extract key features for ML
    features_list.append({
        'restaurant_density': location_features['poi_metrics']['density_metrics']['restaurant_places_per_sqkm'],
        'transit_access': location_features['poi_metrics']['absolute_counts']['counts_by_category'].get('total_bus_station_places', {}).get('count', 0),
        'street_connectivity': location_features['network_metrics']['connectivity_metrics']['average_connections_per_node']['value'],
        'location_diversity': location_features['poi_metrics']['distribution_metrics']['diversity_metrics']['shannon_diversity_index']
    })

# Add to your ML pipeline
features_df = pd.DataFrame(features_list)
properties = pd.concat([properties, features_df], axis=1)

🛠 Advanced Usage

Batch Processing

# Process multiple locations efficiently
locations = [
    {'latitude': 40.7580, 'longitude': -73.9855, 'radius_meters': 500},
    {'latitude': 40.7829, 'longitude': -73.9654, 'radius_meters': 500},
    {'latitude': 40.7527, 'longitude': -73.9772, 'radius_meters': 500}
]

results = features_from_location(locations, show_progress=True)

Command Line Interface

# Single location analysis
geofeaturekit analyze 40.7580 -73.9855 --radius 500 --verbose

# Batch analysis from file
geofeaturekit batch-analyze locations.json --radius 1000 --output results/

Custom Radius Analysis

# Compare different scales
radii = [200, 500, 1000, 2000]  # meters

for radius in radii:
    features = features_from_location({
        'latitude': 40.7580,
        'longitude': -73.9855, 
        'radius_meters': radius
    })
    
    poi_count = features['poi_metrics']['absolute_counts']['total_points_of_interest']
    print(f"{radius}m radius: {poi_count} POIs")

📖 Key Terms

Term	Definition	Scale
POI	Points of Interest (restaurants, hospitals, schools, ATMs)	Count
Shannon Diversity	Measures variety and evenness of POI types	0-4+ (higher = more diverse)
Simpson Diversity	Probability two random POIs are different types	0-1 (higher = more diverse)
Bearing Entropy	Street grid organization measure	0-4+ (lower = more organized)
R-statistic	Spatial clustering pattern	0-2.1 (<1 clustered, ~1 random, >1 dispersed)
Connectivity	Average connections per street intersection	2-8+ (higher = more walkable)

📊 Output Structure

GeoFeatureKit returns a comprehensive dictionary with four main sections:

{
    'network_metrics': {
        'basic_metrics': {...},      # Node/edge counts, total length
        'density_metrics': {...},    # Per-km² measurements  
        'connectivity_metrics': {...}, # Connection patterns
        'street_pattern_metrics': {...} # Orientation, segment analysis
    },
    'poi_metrics': {
        'absolute_counts': {...},    # Raw POI counts by category
        'density_metrics': {...},    # POIs per km² by category
        'distribution_metrics': {...} # Diversity and spatial patterns
    },
    'units': {
        'area': 'square_meters',
        'length': 'meters', 
        'density': 'per_square_kilometer'
    }
}

🌍 Standards & Quality

SI Units: All measurements in meters, square kilometers
Confidence Intervals: Statistical uncertainty for network metrics
Reproducible: Deterministic results with caching
Validated: Comprehensive test suite with property-based testing

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

🚀 Automated Releases

GeoFeatureKit uses automated releases via GitHub Actions. Every time a version tag is pushed, the package is automatically:

✅ Tested on Python 3.9, 3.10, 3.11, and 3.12
✅ Built with proper validation
✅ Published to PyPI
✅ Released on GitHub with auto-generated notes

For maintainers: Use ./release.sh <version> to automate the entire release process.

📄 License

MIT License - see LICENSE file for details.

🙏 Acknowledgments

Built with OSMnx, NetworkX, and GeoPandas. Data from OpenStreetMap contributors.

📚 Citation

If you use GeoFeatureKit in your research, please cite:

@software{geofeaturekit2025,
    title={GeoFeatureKit: Geospatial Feature Extraction and Analysis},
    author={Alexander Li},
    year={2025},
    url={https://github.com/lihangalex/geofeaturekit}
}

Ready to analyze any location? Start with pip install geofeaturekit and explore geospatial patterns like never before! 🌍

Project details

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
Topic
- Scientific/Engineering :: GIS

Release history Release notifications | RSS feed

0.6.1

Jul 8, 2025

0.6.0

Jul 7, 2025

0.5.1

Jul 6, 2025

0.5.0

Jul 6, 2025

0.4.0

Jul 6, 2025

0.2.9

Jul 6, 2025

0.2.8

Jul 6, 2025

This version

0.2.7

Jul 6, 2025

0.2.6

Jul 6, 2025

0.2.4

Jul 5, 2025

0.2.3

Jul 5, 2025

0.2.2

Jul 5, 2025

0.2.1

Jul 5, 2025

0.2.0

Jul 5, 2025

0.1.5

Jul 5, 2025

0.1.4

Jul 5, 2025

0.1.2

Jul 5, 2025

0.1.1

Jul 5, 2025

0.1.0

Jul 5, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geofeaturekit-0.2.7.tar.gz (56.8 kB view details)

Uploaded Jul 6, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

geofeaturekit-0.2.7-py3-none-any.whl (48.7 kB view details)

Uploaded Jul 6, 2025 Python 3

File details

Details for the file geofeaturekit-0.2.7.tar.gz.

File metadata

Download URL: geofeaturekit-0.2.7.tar.gz
Upload date: Jul 6, 2025
Size: 56.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.18

File hashes

Hashes for geofeaturekit-0.2.7.tar.gz
Algorithm	Hash digest
SHA256	`9e796cb0f13468958361eab4054913766d7e1f26d4ffc5a9f95cc4acbcc48bbb`
MD5	`bec66d1afbf88a1ebe729535c122b774`
BLAKE2b-256	`738c17a355589e693e8d7a49b29cd3de69e308d93cf4354c631f924813b0b114`

See more details on using hashes here.

File details

Details for the file geofeaturekit-0.2.7-py3-none-any.whl.

File metadata

Download URL: geofeaturekit-0.2.7-py3-none-any.whl
Upload date: Jul 6, 2025
Size: 48.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.18

File hashes

Hashes for geofeaturekit-0.2.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`94761341131c016564a916a8d94389951d310c33b46c95ad0f359ffc8768c909`
MD5	`c69d34b63de804a2de96c638283a9932`
BLAKE2b-256	`b9fe3f5babd0127ce935aa10376de89a26b5e14731825231ebcc612a86793db1`

See more details on using hashes here.

geofeaturekit 0.2.7

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

GeoFeatureKit

🎯 What You Get

🚀 Use Cases

🔧 Recent Updates

✨ Why GeoFeatureKit?

🚀 Quick Start

Installation

Basic Usage

📝 Example Output

🔍 Analysis Results

🎯 Key Features

Rich POI Analysis (Points of Interest)

Street Network Insights

Progress Tracking

🔬 Scientific Applications

🛠 Advanced Usage

Batch Processing

Command Line Interface

Custom Radius Analysis

📖 Key Terms

📊 Output Structure

🌍 Standards & Quality

🤝 Contributing

🚀 Automated Releases

📄 License

🙏 Acknowledgments

📚 Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes