Skip to main content

Transform pixelated geometries from raster data into smooth natural looking features

Project description

Smoothify Text

Python Version License PyPI version Conda version Tutorials

📋 View Changelog

A Python package for smoothing and refining geometries derived from raster data classifications. Smoothify transforms jagged polygons and lines resulting from raster-to-vector conversion into smooth, visually appealing features using an optimized implementation of Chaikin's corner-cutting algorithm.

Problem

Polygons and lines derived from classified raster data (e.g., ML model predictions, spectral indices, or remote sensing classifications) often have unnatural "stair-stepped" or "pixelated" edges that:

  • Are visually unappealing in maps and GIS applications
  • Can be difficult to work with in downstream vector processing
  • Don't represent the real-world features they're meant to depict

Solution

Smoothify applies an optimized implementation of Chaikin's corner-cutting algorithm along with other geometric processing to create smooth, natural-looking features while:

  • Preserving the general shape and area of polygons
  • Supporting all shapley geometry types
  • Handling shapes with interior holes
  • Efficiently processing large datasets with multiprocessing

Smoothify Hero Image

Installation

uv add smoothify

or

pip install smoothify

or

conda install conda-forge::smoothify

Quick Start

import geopandas as gpd
from smoothify import smoothify

# Load your polygonized raster data
polygon_gdf = gpd.read_file("path/to/your/polygons.gpkg")

# Apply smoothing (segment_length auto-detected from geometry)
smoothed_gdf = smoothify(
    geom=polygon_gdf,
    smooth_iterations=3,  # More iterations = smoother result
    num_cores=4  # Use parallel processing for large datasets
)

# Or specify segment_length explicitly (generally recommended)
smoothed_gdf = smoothify(
    geom=polygon_gdf,
    segment_length=10.0,  # Use the original raster resolution
    smooth_iterations=3,
    num_cores=4
)

# Save the result
smoothed_gdf.to_file("smoothed_polygons.gpkg")

Examples

Example notebooks:

Basic Polygon Smoothing

Transform pixelated polygons from raster data into smooth, natural-looking features:

Basic Polygon Smoothing

LineString Smoothing

Works perfectly for roads, streams, and other linear features:

LineString Smoothing

Controlling Smoothness with Iterations

The smooth_iterations parameter controls how smooth the result will be:

Effect of Different Iterations

Merging Adjacent Geometries

When processing multiple adjacent polygons, allowing merge_collection = True produces a combined result:

Merging Adjacent Geometries

General Usage

The smoothify() function accepts three types of input:

1. GeoDataFrame

import geopandas as gpd
from smoothify import smoothify
# By default this will dissolve adjacent polygons before smoothing
gdf = gpd.read_file("polygons.gpkg")
smoothed_gdf = smoothify(
    geom=gdf,
    segment_length=10.0,
    smooth_iterations=3,
    num_cores=4
)

# Dissolve geometries by a specific field before smoothing
# Useful for merging adjacent polygons with the same classification
gdf_with_classes = gpd.read_file("classified_polygons.gpkg")
smoothed_by_class = smoothify(
    geom=gdf_with_classes,
    segment_length=10.0,
    smooth_iterations=3,
    merge_collection=True,
    merge_field="land_type",  # Merge adjacent geometries with same land_type
    num_cores=4
)

2. Single Geometry

from shapely.geometry import Polygon
from smoothify import smoothify

polygon = Polygon([(0, 0), (10, 0), (10, 10), (0, 10)])
smoothed_polygon = smoothify(
    geom=polygon,
    smooth_iterations=3
)

3. List of Geometries or GeometryCollection

from shapely.geometry import Polygon, LineString
from smoothify import smoothify

geometries = [
    Polygon([(0, 0), (10, 0), (10, 10), (0, 10)]),
    LineString([(0, 0), (5, 5), (10, 0)])
]
smoothed = smoothify(
    geom=geometries,
    segment_length=1.0,
    smooth_iterations=3
)

Parameters

Parameter Type Default Description
geom GeoDataFrame, BaseGeometry, or list[BaseGeometry] Required The geometry/geometries to smooth
segment_length float None Resolution of the original raster data in map units. If None (default), automatically detects by finding the minimum segment length (from a data sample). Recommended to specify explicitly when known
smooth_iterations int 3 Number of Chaikin corner-cutting iterations (typically 3-5). Higher values = smoother output with more vertices
num_cores int 0 Number of CPU cores for parallel processing (0 = all available cores, 1 = serial)
merge_collection bool True Whether to merge/dissolve adjacent geometries in collections before smoothing
merge_field str None GeoDataFrame only: Column name to use for dissolving geometries. Only valid when merge_collection=True. If None, dissolves all geometries together. If specified, dissolves geometries grouped by the column values
merge_multipolygons bool True Whether to merge adjacent polygons within MultiPolygons before smoothing
preserve_area bool True Whether to restore original area after smoothing via buffering (applies to Polygons only)
area_tolerance float 0.01 Percentage of original area allowed as error (e.g., 0.01 = 0.01% error = 99.99% preservation). Only affects Polygons when preserve_area=True

How It Works

Smoothify uses an advanced multi-step smoothing pipeline:

  1. Adds intermediate vertices along line segments (segmentize)
  2. Generates multiple rotated variants (for Polygons) to avoid artifacts
  3. Simplifies each variant to remove noise
  4. Applies Chaikin corner cutting to smooth
  5. Merges all variants via union to eliminate start-point artifacts
  6. Applies final smoothing pass
  7. Optionally restores original area via buffering (for Polygons)

Performance Considerations

  • Parallel Processing: For large GeoDataFrames or collections, use num_cores = 0 to enable parallel processing
  • Smoothing Iterations: Values of 3-5 typically provide good results. Higher values create smoother output but increase processing time and vertex count
  • Memory Usage: Scales with geometry complexity. The algorithm creates multiple variants during smoothing
  • Optimal segment_length: Should match the original raster cell size (pixel size) or be slightly larger for best results

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/amazing-feature)
  3. Commit your changes (git commit -m 'Add some amazing feature')
  4. Push to the branch (git push origin feature/amazing-feature)
  5. Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smoothify-0.2.0.tar.gz (27.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

smoothify-0.2.0-py3-none-any.whl (16.1 kB view details)

Uploaded Python 3

File details

Details for the file smoothify-0.2.0.tar.gz.

File metadata

  • Download URL: smoothify-0.2.0.tar.gz
  • Upload date:
  • Size: 27.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.15

File hashes

Hashes for smoothify-0.2.0.tar.gz
Algorithm Hash digest
SHA256 e6d332bff234c2292e996f768a03d3f9e8796ff39b087a11fb5120a2c64b5c8e
MD5 dfca1f187bbbb0e6a9827ff45ed77afe
BLAKE2b-256 3eca558961ed136ba7d237bbd56ec50184733be825dbd324544a338e00eaaef1

See more details on using hashes here.

File details

Details for the file smoothify-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: smoothify-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 16.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.15

File hashes

Hashes for smoothify-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0892ce73f5c1e3e56d8ba10fb37d56a6cc649e3868e54562492c5d2dbf5cb5dd
MD5 f68ad1ce35fc33057d3fb914ae4aca3b
BLAKE2b-256 837ce80bb1afb458d18fac88116296750c3ce05b8d20944de5340b1dfb6c798e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page