Skip to main content

GeoPandas AI is an extension of GeoPandas allowing you to interact with your spatial data in natural language.

Project description

GeoPandas AI

GeoPandas AI is a powerful Python library that brings natural language processing capabilities to your geospatial data analysis workflow. It allows you to interact with GeoDataFrames using natural language queries, making geospatial analysis more accessible and intuitive.

Features

  • Natural language interaction with GeoDataFrames
  • Support for multiple LLM providers through LiteLLM
  • Various output types including:
    • GeoDataFrames
    • DataFrames
    • Text responses
    • Plots
    • Interactive maps
    • Lists
    • Dictionaries
    • Numeric values (integers, floats)
    • Boolean values

Installation

pip install geopandas-ai

Quick Start

import geopandas
import geopandasai as gpdai
import folium
import json

# Configure your LLM provider (example using Google's Vertex AI)
gpdai.set_active_lite_llm_config({
    "model": "vertex_ai/gemini-2.0-flash",
    "vertex_credentials": json.dumps(json.load(open("google-credentials.json", "r")))
})

# Load your geospatial data
gdfai = gpdai.read_file("your_data.geojson")

# Ask questions about your data
result = gdfai.chat("how many points are in this dataset?")
print(result)

# Get specific types of results
count = gdfai.chat("how many points?", result_type=int)
points_only = gdfai.chat("Keep only geometry of type point", result_type=geopandas.GeoDataFrame)
map_view = gdfai.chat("Plot the points", result_type=folium.Map)

Configuration

GeoPandas AI uses LiteLLM to support multiple LLM providers. You can configure your preferred provider in two ways:

  1. Using the set_active_lite_llm_config function:
from geopandasai import set_active_lite_llm_config

set_active_lite_llm_config({
    "model": "your_model_name",
    # Add provider-specific configuration
})
  1. Using environment variables:
export LITELLM_CONFIG='{"model": "your_model_name", ...}'

Please refer to https://docs.litellm.ai/docs/providers for more details on configuring LiteLLM.

Adding Custom Libraries

GeoPandas AI allows you to extend its capabilities by adding custom libraries that can be used in the generated code. There are two ways to add libraries:

  1. Globally using set_libraries:
from geopandasai import set_libraries

# Add libraries that will be available for all chat queries
set_libraries(['numpy', 'scipy', 'shapely'])
  1. Per-query using the user_provided_libraries parameter:
# Add libraries for a specific query
result = gdfai.chat(
    "calculate the convex hull using scipy",
    user_provided_libraries=['scipy', 'numpy']
)

By default, the following libraries are always available:

  • pandas
  • matplotlib.pyplot
  • folium
  • geopandas

Note: Make sure any additional libraries you specify are installed in your environment.

Examples

Basic Queries

# Count features
count = gdfai.chat("how many features?", result_type=ResultType.INTEGER)

# Filter data
filtered = gdfai.chat("show only points with population > 1000", result_type=ResultType.GEODATAFRAME)

# Get statistics
stats = gdfai.chat("what's the average population?", result_type=ResultType.FLOAT)

Visualization

# Create a plot
plot = gdfai.chat("create a scatter plot of population vs area", result_type=ResultType.PLOT)

# Generate an interactive map
map = gdfai.chat("show all points colored by population", result_type=ResultType.MAP)

Data Transformation

# Convert to list
list_data = gdfai.chat("convert to list", result_type=ResultType.LIST)

# Convert to dictionary
dict_data = gdfai.chat("convert to json", result_type=ResultType.DICT)

Caching

GeoPandas AI includes a caching system to improve performance and reduce API calls. The library provides two types of caches:

  1. InMemoryResultCache: Stores results in memory (default)
  2. FileResultCache: Persists results to disk

Using Caching

from geopandasai import set_cache_instance, FileCache, InMemoryCache

# Use in-memory cache (default)
set_cache_instance(InMemoryCache())

# Or use file-based cache
set_cache_instance(FileCache(cache_dir="./.geopandasai_cache"))

The cache automatically stores:

  • Query results
  • Generated code
  • Any intermediate results

This helps in:

  • Reducing API calls
  • Improving response times for repeated queries
  • Saving costs when using paid LLM providers

Accessing Generated Code

After each query, you can access the generated Python code that was used to produce the result:

# Make a query
result = gdfai.chat("how many points are in this dataset?")

# Access the generated code
print(gdfai.last_output.code)

# or

code = gdfai.chat("how many points are in this dataset?").code

This is useful for:

  • Understanding how the AI interpreted your query
  • Learning from the generated code
  • Debugging unexpected results
  • Reusing the generated code in your own scripts

Requirements

  • Python 3.8+
  • GeoPandas
  • LiteLLM
  • Matplotlib
  • Folium
  • Contextily

License

MIT + Commercial Platform Restriction (see LICENSE.md)

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

geopandas_ai-0.0.9.tar.gz (14.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

geopandas_ai-0.0.9-py3-none-any.whl (23.0 kB view details)

Uploaded Python 3

File details

Details for the file geopandas_ai-0.0.9.tar.gz.

File metadata

  • Download URL: geopandas_ai-0.0.9.tar.gz
  • Upload date:
  • Size: 14.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for geopandas_ai-0.0.9.tar.gz
Algorithm Hash digest
SHA256 c3220e4163b99470f99187d1b2a8dae1b21339bb07a5a4efc5da8c520444fe8d
MD5 3bc39f1194eb5b6c4fdbdfd017d512cd
BLAKE2b-256 02145fff8ad699e8388cd4784af6a5046fc4913dceee3a98db9dec68e26582eb

See more details on using hashes here.

Provenance

The following attestation bundles were made for geopandas_ai-0.0.9.tar.gz:

Publisher: python-publish.yml on GaspardMerten/geopandas-ai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file geopandas_ai-0.0.9-py3-none-any.whl.

File metadata

  • Download URL: geopandas_ai-0.0.9-py3-none-any.whl
  • Upload date:
  • Size: 23.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for geopandas_ai-0.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 2726975955e1dce13e171eedf8431d06e4f40337bf02b7f8380069412c1ebd1e
MD5 a06f06baadd77181848a3c930e03982d
BLAKE2b-256 0fd05db52407cca1ed2d5141f15905e0299f3d5e830d10eb6ec9d1b4d7c73496

See more details on using hashes here.

Provenance

The following attestation bundles were made for geopandas_ai-0.0.9-py3-none-any.whl:

Publisher: python-publish.yml on GaspardMerten/geopandas-ai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page