Skip to main content

Object-based analysis functions for fractal dimensions and size distributions

Project description

Correlation dimension example from DeWitt et al. (2025) Figure from DeWitt et al. (2025)

objscale

Object-based analysis functions for fractal dimensions and size distributions in atmospheric sciences and beyond. Optimized for large datasets.

Description

objscale provides computational tools for analyzing the scaling properties of objects in 2D binary arrays. The package consolidates methods for calculating size distributions and fractal dimensions that account for finite domain effects and complex boundary conditions. Originally developed for atmospheric science applications, these methods apply broadly to any field where object scaling properties matter.

The package implements methods from two main papers:

See the interactive scaling explorer to visualize how the correlation dimension is computed.

Key Functions

finite_array_powerlaw_exponent

Calculate power-law exponents for size distributions while accounting for finite domain truncation effects. Essential for accurate scaling analysis in bounded domains.

individual_fractal_dimension

Fractal dimension of individual objects using the perimeter-area relationship, with proper handling of interior holes and resolution effects.

individual_correlation_dimension

Correlation dimension of a single object. Isolates the Nth largest structure in an array (after removing border-touching structures) and computes its correlation dimension.

ensemble_correlation_dimension

Correlation dimension for characterizing the collective scaling properties of object ensembles. Uses point-pair correlation analysis across multiple length scales.

ensemble_box_dimension

Box-counting dimension for object ensembles. New analyses should prefer ensemble_correlation_dimension. Counts boxes containing object boundaries at varying spatial scales.

Installation

pip install objscale

Documentation

📖 Full Documentation

Complete API reference, detailed examples, and usage guides are available at objscale.readthedocs.io.

Agent Skill (Highly Recommended for Agents)

An agent skill is included in this repository. For Claude Code:

mkdir -p ~/.claude/skills/objscale
cp agent-skills/objscale/SKILL.md ~/.claude/skills/objscale/

Codex:

mkdir -p ~/.codex/skills/objscale
cp agent-skills/objscale/SKILL.md ~/.codex/skills/objscale/

Quick Example

import objscale
import numpy as np

# Create binary array (e.g., cloud mask, percolation lattice)
arrays = [(np.random.random((1000, 1000)) < 0.3).astype(int) for _ in range(4)]

# Size distribution with finite domain corrections
(exponent, error), (log10_sizes, log10_counts) = objscale.finite_array_powerlaw_exponent(
    arrays, 'area', return_counts=True
)

# Ensemble fractal dimensions
corr_dim, corr_error = objscale.ensemble_correlation_dimension(arrays)
box_dim, box_error = objscale.ensemble_box_dimension(arrays)

# Individual object analysis  
ind_dim, ind_error = objscale.individual_fractal_dimension(arrays)

Features

  • Finite domain corrections: Proper handling of truncation effects at domain boundaries as recommended by DeWitt & Garrett (2024)
  • Multiple size metrics: Area, perimeter, width, height, nested perimeter
  • Arbitrary boundaries: Support for NaN-demarcated non-rectangular domains
  • Individual and Ensemble methods: Characterize both individual and collective properties of object fields
  • Performance optimized: Numba acceleration for computational efficiency. Can handle billions of individual objects on a mid-range laptop.

Requirements

  • Python ≥ 3.8
  • NumPy ≥ 1.20.0
  • SciPy ≥ 1.7.0
  • scikit-image ≥ 0.18.0
  • Numba ≥ 0.56.0

Available Functions

Fractal Dimensions

  • individual_fractal_dimension - Fractal dimension of individual objects (perimeter-area scaling)
  • individual_correlation_dimension - Correlation dimension of a single object
  • ensemble_correlation_dimension - Correlation dimension for object ensembles
  • ensemble_box_dimension - Box-counting dimension for object ensembles

Size Distributions

  • finite_array_powerlaw_exponent - Power-law exponents with finite domain corrections
  • finite_array_size_distribution - Size distributions with truncation analysis
  • array_size_distribution - Basic size distribution for single arrays

Object Analysis

  • label_structures - Label connected components (wraps scipy.ndimage.label with NaN handling and periodic boundaries)
  • get_structure_areas - Calculate areas of labelled structures (O(n), fast)
  • get_structure_perimeters - Calculate perimeters of labelled structures (O(n), fast)
  • get_structure_height_width - Calculate height and width of labelled structures
  • get_structure_props - Calculate perimeter, area, width, height from a binary array (convenience wrapper)
  • get_every_boundary_perimeter - Perimeters of every boundary including nested holes
  • total_perimeter - Total perimeter of all objects
  • total_number - Count number of structures
  • isolate_nth_largest_structure - Extract the Nth largest connected structure
  • remove_structures_touching_border_nan - Remove border-touching structures
  • remove_structure_holes - Fill holes in structures
  • label_size - Label each structure with its size value
  • clear_border_adjacent - Clear structures touching array edges

Utilities

  • coarsen_array - Coarsen array resolution by averaging
  • linear_regression - Linear regression with error estimates
  • encase_in_value - Add border of specified value around array
  • get_coords_of_boundaries - Find boundary pixel coordinates (toroidal topology)
  • get_locations_from_pixel_sizes - Convert pixel size arrays to cumulative locations
  • set_num_threads - Set number of threads for parallel computations

For detailed parameter descriptions and usage examples, see the full documentation or use help(objscale.function_name) or objscale.function_name? in IPython/Jupyter.

Support Statement

This package consolidates research code developed over several years. While functional and tested, it should be considered research software with limited ongoing support. Users are encouraged to understand the underlying methods through the referenced papers before applying to their data.

References

If you use this package, please cite:

DeWitt, T. D. and Garrett, T. J.: Finite domains cause bias in measured and modeled distributions of cloud sizes, Atmos. Chem. Phys., 24, 8457–8472, https://doi.org/10.5194/acp-24-8457-2024, 2024.

Author

Thomas D. DeWitt University of Utah Department of Atmospheric Sciences

Sonnet 4 with Claude Code Anthropic

License

MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

objscale-1.3.0.tar.gz (29.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

objscale-1.3.0-py3-none-any.whl (28.7 kB view details)

Uploaded Python 3

File details

Details for the file objscale-1.3.0.tar.gz.

File metadata

  • Download URL: objscale-1.3.0.tar.gz
  • Upload date:
  • Size: 29.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for objscale-1.3.0.tar.gz
Algorithm Hash digest
SHA256 48c0bcf8783f27ceb6e4dca1238824ccec8a25992db3027e182de37a84ffc15f
MD5 42bab837cfa58dee87729213a6d4ceba
BLAKE2b-256 1505b4c782d108533dc56049b29ff81b30e70e73684243a8900b628d091dfb56

See more details on using hashes here.

File details

Details for the file objscale-1.3.0-py3-none-any.whl.

File metadata

  • Download URL: objscale-1.3.0-py3-none-any.whl
  • Upload date:
  • Size: 28.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for objscale-1.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f5019b1708cbf52aafd90f85bca4a1bb97823bc88d15557947a97723a5e51aa7
MD5 49c0aba9e3ec6ea6dafd9cb6899d08ff
BLAKE2b-256 d2923cf35f795c65d681fa8b1bc111e2eaccf0b31a5eea50cf5e4c77855d3a11

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page