Production-ready, offline geo-intelligence library for resolving latitude/longitude to country, ISO codes, continent, and timezone information
Project description
geo-intel-offline
Production-ready, offline geo-intelligence library for resolving latitude/longitude coordinates to country, ISO codes, continent, timezone, and confidence scores. No API keys, no network requests, 100% deterministic.
📋 Table of Contents
- Why This Library Exists
- Features
- Installation
- Quick Start
- API Reference
- Examples
- Use Cases
- Advanced Usage
- Performance & Accuracy
- Architecture
- Troubleshooting
- Contributing
- Additional Resources
🌟 Why This Library Exists
Every developer working with geolocation has faced the same frustration: you need to know what country a set of coordinates belongs to, but all the solutions either cost money, require API keys, need constant internet connectivity, or have restrictive rate limits. What if you're building an offline application? What if you're processing millions of records and API costs become prohibitive? What if you need deterministic results without external dependencies?
We built geo-intel-offline to solve these real-world problems.
This library was born from the need for a reliable, fast, and completely free solution that works everywhere—from edge devices in remote locations to high-throughput data processing pipelines. No subscriptions, no rate limits, no vendor lock-in. Just pure Python that does one thing exceptionally well: tell you where in the world a coordinate belongs.
Whether you're building a mobile app that works offline, processing billions of GPS logs, enriching datasets without external APIs, or creating applications for regions with unreliable internet—this library empowers you to add geo-intelligence to your projects without compromise.
✨ Features
- 🚀 Fast: < 1ms per lookup, < 15MB memory footprint
- 📦 Offline: Zero network dependencies, works completely offline
- 🎯 Accurate: 100% accuracy across 258 countries
- 🔒 Deterministic: Same input always produces same output
- 🗜️ Optimized: 66% size reduction with automatic compression
- 🌍 Comprehensive: Supports all countries, continents, and territories
- 🎨 Clean API: Unified function for forward and reverse geocoding
- 🔧 No Dependencies: Pure Python, no native extensions
- 💰 Free Forever: No API costs, no rate limits, no hidden fees
📦 Installation
From PyPI (Recommended)
pip install geo-intel-offline
From uv
uv pip install geo-intel-offline
From Source
git clone https://github.com/RRJena/geo-intel-offline.git
cd geo-intel-offline
pip install .
🚀 Quick Start
Basic Usage
The resolve() function automatically detects forward or reverse geocoding based on arguments:
Forward Geocoding (Coordinates → Country):
from geo_intel_offline import resolve
result = resolve(40.7128, -74.0060) # New York City
print(result.country) # "United States of America"
print(result.iso2) # "US"
print(result.iso3) # "USA"
print(result.continent) # "North America"
print(result.timezone) # "America/New_York"
print(result.confidence) # 0.98
Reverse Geocoding (Country → Coordinates):
from geo_intel_offline import resolve
# Just pass country name or ISO code as a string
result = resolve("United States")
print(result.latitude) # Country centroid latitude
print(result.longitude) # Country centroid longitude
print(result.iso2) # "US"
# Works with ISO codes
result = resolve("US") # ISO2 code
result = resolve("USA") # ISO3 code
Step-by-Step Guide
Step 1: Install the Package
pip install geo-intel-offline
Step 2: Import and Use
from geo_intel_offline import resolve
# Forward geocoding: Resolve coordinates to country
result = resolve(51.5074, -0.1278) # London, UK
# Access results as attributes
print(f"Country: {result.country}")
print(f"ISO2 Code: {result.iso2}")
print(f"ISO3 Code: {result.iso3}")
print(f"Continent: {result.continent}")
print(f"Timezone: {result.timezone}")
print(f"Confidence: {result.confidence:.2f}")
# Reverse geocoding: Resolve country to coordinates
result = resolve("United Kingdom")
print(f"UK centroid: ({result.latitude}, {result.longitude})")
print(f"ISO2: {result.iso2}")
Step 3: Handle Edge Cases
from geo_intel_offline import resolve
# Ocean locations (no country)
result = resolve(0.0, 0.0) # Gulf of Guinea (ocean)
if result.country is None:
print("No country found (likely ocean)")
print(f"Confidence: {result.confidence}") # Will be 0.0
# Border regions (may have lower confidence)
result = resolve(49.0, 8.2) # Near France-Germany border
if result.confidence < 0.7:
print(f"Low confidence: {result.confidence:.2f} (near border)")
📚 API Reference
resolve(*args, data_dir=None, countries=None, continents=None, exclude_countries=None)
Unified function for both forward and reverse geocoding. Automatically detects mode based on arguments.
Forward Geocoding (Coordinates → Country):
- Pass two numeric arguments:
resolve(lat, lon) - Example:
resolve(40.7128, -74.0060)
Reverse Geocoding (Country → Coordinates):
- Pass one string argument:
resolve("Country Name")orresolve("ISO Code") - Example:
resolve("United States")orresolve("US")
Parameters:
Positional Arguments:
- Forward geocoding:
(lat: float, lon: float)- Two numeric values - Reverse geocoding:
(country: str)- One string (country name or ISO code)
Keyword Arguments:
data_dir(str, optional): Custom data directory pathcountries(list[str], optional): List of ISO2 codes to load (modular format only, forward geocoding)continents(list[str], optional): List of continent names to load (modular format only, forward geocoding)exclude_countries(list[str], optional): List of ISO2 codes to exclude (modular format only, forward geocoding)
Returns:
-
Forward Geocoding:
GeoIntelResultobject with:country(str | None): Country nameiso2(str | None): ISO 3166-1 alpha-2 codeiso3(str | None): ISO 3166-1 alpha-3 codecontinent(str | None): Continent nametimezone(str | None): IANA timezone identifierconfidence(float): Confidence score (0.0 to 1.0)
-
Reverse Geocoding:
ReverseGeoIntelResultobject with:latitude(float | None): Country centroid latitudelongitude(float | None): Country centroid longitudecountry(str | None): Country nameiso2(str | None): ISO 3166-1 alpha-2 codeiso3(str | None): ISO 3166-1 alpha-3 codecontinent(str | None): Continent nametimezone(str | None): IANA timezone identifierconfidence(float): Always 1.0 for exact country match
Methods:
to_dict(): Convert result to dictionary
Raises:
ValueError: If parameters are invalid or missingFileNotFoundError: If data files are missing
Examples:
# Forward geocoding (coordinates → country)
result = resolve(40.7128, -74.0060) # New York
print(result.country) # "United States of America"
print(result.iso2) # "US"
# Reverse geocoding (country → coordinates)
result = resolve("United States")
print(result.latitude, result.longitude) # Country centroid coordinates
print(result.iso2) # "US"
# Reverse geocoding with ISO codes
result = resolve("US") # ISO2 code
result = resolve("USA") # ISO3 code
result = resolve("JPN") # ISO3 code
# Forward geocoding with filters
result = resolve(40.7128, -74.0060, countries=["US", "CA"])
result = resolve(40.7128, -74.0060, continents=["North America"])
resolve_by_country(country_input, data_dir=None) ⚠️ Deprecated
Deprecated: Use resolve("Country Name") instead for consistency.
This function is kept for backward compatibility but the unified resolve() function is recommended.
Parameters:
country_input(str): Country name (e.g., "United States", "Japan") or ISO code (e.g., "US", "USA", "JP", "JPN")data_dir(str, optional): Custom data directory path
Returns:
ReverseGeoIntelResult object (same as resolve("Country Name"))
Example:
# Old way (still works)
from geo_intel_offline import resolve_by_country
result = resolve_by_country("United States")
# New unified way (recommended)
from geo_intel_offline import resolve
result = resolve("United States") # Just pass country name as string
Result Objects
GeoIntelResult
Result object returned by forward geocoding resolve(lat, lon).
Properties:
result.country # Country name (str | None)
result.iso2 # ISO2 code (str | None)
result.iso3 # ISO3 code (str | None)
result.continent # Continent name (str | None)
result.timezone # Timezone (str | None)
result.confidence # Confidence score (float, 0.0-1.0)
Methods:
result.to_dict() # Convert to dictionary
ReverseGeoIntelResult
Result object returned by reverse geocoding resolve("Country Name").
Properties:
result.latitude # Centroid latitude (float | None)
result.longitude # Centroid longitude (float | None)
result.country # Country name (str | None)
result.iso2 # ISO2 code (str | None)
result.iso3 # ISO3 code (str | None)
result.continent # Continent name (str | None)
result.timezone # Timezone (str | None)
result.confidence # Confidence score (float, always 1.0)
Methods:
result.to_dict() # Convert to dictionary
📖 Examples
Example 1: Resolve Multiple Locations
from geo_intel_offline import resolve
locations = [
(40.7128, -74.0060, "New York"),
(51.5074, -0.1278, "London"),
(35.6762, 139.6503, "Tokyo"),
(-33.8688, 151.2093, "Sydney"),
(55.7558, 37.6173, "Moscow"),
]
for lat, lon, name in locations:
result = resolve(lat, lon)
print(f"{name}: {result.country} ({result.iso2}) - Confidence: {result.confidence:.2f}")
Output:
New York: United States of America (US) - Confidence: 0.98
London: United Kingdom (GB) - Confidence: 0.93
Tokyo: Japan (JP) - Confidence: 0.93
Sydney: Australia (AU) - Confidence: 0.80
Moscow: Russia (RU) - Confidence: 0.93
Example 2: Batch Processing
from geo_intel_offline import resolve
import time
coordinates = [
(40.7128, -74.0060),
(51.5074, -0.1278),
(35.6762, 139.6503),
# ... more coordinates
]
start = time.perf_counter()
results = [resolve(lat, lon) for lat, lon in coordinates]
end = time.perf_counter()
print(f"Processed {len(coordinates)} coordinates in {(end - start)*1000:.2f}ms")
print(f"Average: {(end - start)*1000/len(coordinates):.3f}ms per lookup")
Example 3: Dictionary Access
from geo_intel_offline import resolve
result = resolve(37.7749, -122.4194) # San Francisco
# Access as dictionary
result_dict = result.to_dict()
print(result_dict)
# {
# 'country': 'United States of America',
# 'iso2': 'US',
# 'iso3': 'USA',
# 'continent': 'North America',
# 'timezone': 'America/Los_Angeles',
# 'confidence': 0.95
# }
# Or access as attributes
print(result.country) # "United States of America"
print(result.iso2) # "US"
Example 4: Filter by Confidence
from geo_intel_offline import resolve
def resolve_with_threshold(lat, lon, min_confidence=0.7):
"""Resolve coordinates with confidence threshold."""
result = resolve(lat, lon)
if result.confidence < min_confidence:
return None, f"Low confidence: {result.confidence:.2f}"
return result, None
result, error = resolve_with_threshold(40.7128, -74.0060, min_confidence=0.9)
if result:
print(f"High confidence result: {result.country}")
else:
print(f"Rejected: {error}")
Example 5: Error Handling
from geo_intel_offline import resolve
def safe_resolve(lat, lon):
"""Safely resolve coordinates with error handling."""
try:
result = resolve(lat, lon)
if result.country is None:
return {"error": "No country found", "confidence": result.confidence}
return {
"country": result.country,
"iso2": result.iso2,
"iso3": result.iso3,
"continent": result.continent,
"timezone": result.timezone,
"confidence": result.confidence,
}
except ValueError as e:
return {"error": f"Invalid coordinates: {e}"}
except FileNotFoundError as e:
return {"error": f"Data files not found: {e}"}
# Usage
result = safe_resolve(40.7128, -74.0060)
print(result)
Example 6: Reverse Geocoding
from geo_intel_offline import resolve
# Get country centroid coordinates
result = resolve("United States")
print(f"US centroid: ({result.latitude}, {result.longitude})")
# Works with ISO codes
us_coords = resolve("US") # ISO2 code
jp_coords = resolve("JPN") # ISO3 code
uk_coords = resolve("United Kingdom") # Country name
# The function automatically detects reverse geocoding from a single string argument
🎯 Use Cases
1. Mobile Applications
Offline-first apps that need to identify user location even without internet connectivity. Perfect for travel apps, fitness trackers, or field data collection tools that work in remote areas.
# Works offline - no internet needed!
from geo_intel_offline import resolve
def identify_user_country(lat, lon):
result = resolve(lat, lon)
return result.country # Works even in airplane mode
2. Data Processing & Analytics
Batch processing of GPS logs, location data, or transaction records. Process millions of coordinates without API rate limits or costs.
# Process millions of records - no rate limits!
import pandas as pd
from geo_intel_offline import resolve
df = pd.read_csv('location_data.csv')
df['country'] = df.apply(
lambda row: resolve(row['lat'], row['lon']).country,
axis=1
)
3. IoT & Edge Devices
Edge computing applications where devices need geo-intelligence without cloud connectivity. Perfect for sensors, trackers, or embedded systems.
# Runs on Raspberry Pi, microcontrollers, edge devices
# No cloud dependency, minimal resources
result = resolve(sensor_lat, sensor_lon)
if result.country != 'US':
trigger_alert()
4. API Alternatives & Rate Limit Avoidance
Replace expensive APIs or bypass rate limits. Perfect for applications that need high throughput or want to reduce infrastructure costs.
# Instead of: external_api.geocode(lat, lon) # $0.005 per request
# Use: resolve(lat, lon) # FREE, unlimited, instant
5. Geographic Data Enrichment
Enrich datasets with country information for analysis, visualization, or machine learning. No need to maintain external API connections or handle failures.
# Enrich logs, events, transactions with country data
events = load_events_from_database()
for event in events:
event['country'] = resolve(event['lat'], event['lon']).iso2
save_event(event)
6. Location-Based Features
Add geo-context to your applications: content localization, compliance checks, regional restrictions, or timezone-aware scheduling.
# Content localization based on location
result = resolve(user_lat, user_lon)
if result.continent == 'Europe':
show_gdpr_banner()
elif result.country == 'US':
show_us_specific_content()
7. Development & Testing
Local development and testing without needing API keys or internet connectivity. Great for CI/CD pipelines and automated testing.
# Test with real data - no mocks needed
def test_geocoding():
result = resolve(40.7128, -74.0060)
assert result.country == 'United States of America'
assert result.iso2 == 'US'
8. Research & Academic Projects
Academic research that requires reproducible results without external API dependencies or costs that might limit research scope.
# Reproducible research - same results every time
# No API costs to worry about in grant proposals
results = [resolve(lat, lon) for lat, lon in research_coordinates]
🔧 Advanced Usage
Modular Data Loading
For applications that only need specific regions, you can use modular data loading to reduce memory footprint:
from geo_intel_offline import resolve
# Load only specific countries (requires modular data format)
result = resolve(40.7128, -74.0060, countries=["US", "CA", "MX"])
# Load by continent
result = resolve(51.5074, -0.1278, continents=["Europe"])
# Exclude specific countries
result = resolve(35.6762, 139.6503, exclude_countries=["RU", "CN"])
Note: Modular data loading requires building data in modular format. See Building Custom Data below.
Building Custom Data (Advanced)
Prerequisites
- Download Natural Earth Admin 0 Countries GeoJSON:
wget https://www.naturalearthdata.com/downloads/10m-cultural-vectors/10m-admin-0-countries/ # Or use the provided script: bash scripts/download_natural_earth.sh
Build Full Dataset
# Build complete dataset with compression
python3 -m geo_intel_offline.data_builder \
data_sources/ne_10m_admin_0_countries.geojson \
geo_intel_offline/data
# Or use automated script
python3 -m geo_intel_offline.data_builder \
data_sources/ne_10m_admin_0_countries.geojson \
geo_intel_offline/data
Note: The build process automatically compresses data files, reducing size by ~66%.
Build Modular Dataset
# Build modular format (country-wise files)
python3 -m geo_intel_offline.data_builder_modular \
data_sources/ne_10m_admin_0_countries.geojson \
output_directory
# Build specific countries only
python3 -m geo_intel_offline.data_builder_modular \
--countries US,CA,MX \
data_sources/ne_10m_admin_0_countries.geojson \
output_directory
# Build by continent
python3 -m geo_intel_offline.data_builder_modular \
--continents "North America,Europe" \
data_sources/ne_10m_admin_0_countries.geojson \
output_directory
⚡ Performance & Accuracy
Performance Benchmarks
- Lookup Speed: < 1ms per resolution
- Memory Footprint: < 15 MB (all data in memory)
- Cold Start: ~100ms (initial data load)
- Data Size: ~4 MB compressed (66% reduction)
Performance Test
from geo_intel_offline import resolve
import time
test_points = [
(40.7128, -74.0060), # NYC
(51.5074, -0.1278), # London
(35.6762, 139.6503), # Tokyo
# ... more points
]
start = time.perf_counter()
for _ in range(100):
for lat, lon in test_points:
resolve(lat, lon)
end = time.perf_counter()
avg_time = ((end - start) / (100 * len(test_points))) * 1000
print(f"Average lookup time: {avg_time:.3f}ms")
Test Results
Comprehensive testing across 258 countries with perfect accuracy:
Forward Geocoding (Coordinates → Country)
- Overall Accuracy: 100.00% (2,568 passed / 2,568 total test points)
- Countries Tested: 258
- Countries with 100% Accuracy: 258 (100.0%)
- Countries with 90%+ Accuracy: 258 (100.0%)
- Test Points: 2,568 (10 points per country, varies for small territories)
Reverse Geocoding (Country → Coordinates)
- Overall Accuracy: 100.00% (730 passed / 730 total tests)
- By Country Name: 258/258 (100.00%)
- By ISO2 Code: 236/236 (100.00%)
- By ISO3 Code: 236/236 (100.00%)
Key Highlights
✅ 100% accuracy for forward geocoding across all 258 countries
✅ 100% accuracy for reverse geocoding with all input methods
✅ 258 countries fully supported including territories and disputed regions
✅ Comprehensive coverage of all continents and major territories
✅ Perfect accuracy achieved through rigorous testing and edge case handling
See TEST_RESULTS.md for detailed country-wise results, continent-level breakdowns, and comprehensive test methodology.
Understanding Confidence Scores
Confidence scores range from 0.0 to 1.0:
- 0.9 - 1.0: High confidence (well within country boundaries)
- 0.7 - 0.9: Good confidence (inside country, may be near border)
- 0.5 - 0.7: Moderate confidence (near border or ambiguous region)
- 0.0 - 0.5: Low confidence (likely ocean or disputed territory)
from geo_intel_offline import resolve
result = resolve(40.7128, -74.0060) # NYC (center of country)
print(f"Confidence: {result.confidence:.2f}") # ~0.98
result = resolve(49.0, 8.2) # Near France-Germany border
print(f"Confidence: {result.confidence:.2f}") # ~0.65-0.75
result = resolve(0.0, 0.0) # Ocean
print(f"Confidence: {result.confidence:.2f}") # 0.0
🏗️ Architecture
The library uses a hybrid three-stage resolution pipeline:
- Geohash Indexing: Fast spatial filtering to candidate countries
- Point-in-Polygon: Accurate geometric verification using ray casting
- Confidence Scoring: Distance-to-border calculation for certainty assessment
For detailed architecture documentation, see ARCHITECTURE.md.
❓ Troubleshooting
Issue: "Data file not found"
Solution: Ensure data files are present in the package installation directory, or build custom data:
# Check if data files exist
ls geo_intel_offline/data/*.json.gz
# If missing, rebuild data
python3 -m geo_intel_offline.data_builder \
path/to/geojson \
geo_intel_offline/data
Issue: Low accuracy for specific locations
Possible causes:
- Location is in ocean (no country)
- Location is on border (ambiguous)
- Location is in disputed territory
Solution: Check confidence score and handle edge cases:
result = resolve(lat, lon)
if result.confidence < 0.5:
print("Low confidence - may be ocean or border region")
Issue: Memory usage higher than expected
Solution: Use modular data loading to load only needed countries:
# Instead of loading all countries
result = resolve(lat, lon)
# Load only needed countries
result = resolve(lat, lon, countries=["US", "CA"])
🤝 Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature) - Commit your changes (
git commit -m 'Add some AmazingFeature') - Push to the branch (
git push origin feature/AmazingFeature) - Open a Pull Request
See CONTRIBUTING.md for detailed guidelines.
📝 License
MIT License - see LICENSE file for details.
📚 Additional Resources
- ARCHITECTURE.md - Internal design and architecture details
- TEST_RESULTS.md - Comprehensive test results and benchmarks
- QUICK_START.md - Quick start guide for new users
🔗 Links
- PyPI: https://pypi.org/project/geo-intel-offline/
- GitHub: https://github.com/RRJena/geo-intel-offline
- Issues: https://github.com/RRJena/geo-intel-offline/issues
🙏 Acknowledgments
- Data source: Natural Earth
- Geohash implementation: Based on standard geohash algorithm
- Point-in-Polygon: Ray casting algorithm
👨💻 Author
Rakesh Ranjan Jena
- 🌐 Blog: https://www.rrjprince.com/
- 💼 LinkedIn: https://www.linkedin.com/in/rrjprince/
Made with ❤️ by Rakesh Ranjan Jena for the Python community
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file geo_intel_offline-1.4.1-py3-none-any.whl.
File metadata
- Download URL: geo_intel_offline-1.4.1-py3-none-any.whl
- Upload date:
- Size: 4.3 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3571409214ef2f242012cdf49872a07a3bd491e39a80b252d001f5a1bf922462
|
|
| MD5 |
bf83da75690e989e24b32526ea1c8d5d
|
|
| BLAKE2b-256 |
60b2312203dd8431d45118f836e5f36d67fe482e5d7f54c86b5149ddef0c8afb
|