Skip to main content

stubgen for xarray and pandas accessors

Project description

accessor_stubs

This package addresses a problem that's bothered me for a while a now. The pandas and xarray packages both discourage subclassing their datastructures, and recommend instead to use composition or accessor methods.

I really wanted to use accessor methods to add my domain specific functionality to the xarray ecosystem, but I really really like having IDE autocomplete that works. Unfortunately the dynamic nature of accessor methods makes them incompatible with the current language server static analysis methods.

To solve this autocomplete problem, I've created this package to create xarray type-stubs on the fly that follow PEP 561. By using the register_dataarray_accessor() and register_dataset_accessor() methods from this package instead of xarray, your accessors will be registered with accessor_stubs as well as xarray. Then, when you call accessor_stubs.stubgen(), new xarray type-stubs will be generated that have your accessor methods listed as properties in DataArray and Dataset type stubs.

This package tracks the version of xarray that is installed and will regenerate type stubs if you upgrade or downgrade your xarray version.

Right now, only xarray is supported, but maybe I'll add pandas in the future...

Usage

In your python module/package, register your accessor using accessor_stubs as:

# accessor.py
import xarray as xr
from accessor_stubs import register_dataset_accessor

@register_dataset_accessor("sneaky")
class SneakyAccessor:
    def __init__(self, xarray_obj):
        self._obj = xarray_obj
        self._sneaky_msg = None

    @property
    def sneak(self):
        """Return a sneaky message"""
        if self._sneaky_msg is None:
            self._sneaky_msg = "👟"
        return self._sneaky_msg

In the top level init.py module of your package from where all your accessors are initialized, run the stubgen() method in accessor_stubs. This will run mypy stubgen on the xarray dataset.py and dataarray.py files in your xarray installation, then add your registered accessors to the stub files. It's very important that you run stubgen() after all your accessors have been registered.

# __init__.py
import accessor # the module where my accessor is created and registered
from accessor_stubs import stubgen
stubgen()

Now after you import your package, new xarray stubs will be generated with your accessors, and your IDE should be able to recognize your accessors as properties of the xarray datastructure classes:

vscode_screenshot

The generated dataset.pyi type-stub looks something like this:

from accessor import SneakyAccessor
import datetime
import numpy as np
import pandas as pd
...

class Dataset(DataWithCoords, DatasetAggregations, DatasetArithmetic, Mapping[Hashable, 'DataArray']):
    sneaky: SneakyAccessor
    def __init__(self, data_vars: DataVars | None = None, coords: Mapping[Any, Any] | None = None, attrs: Mapping[Any, Any] | None = None) -> None: ...
    def __eq__(self, other: DsCompatible) -> Self: ...
    @classmethod
    def load_store(cls, store, decoder: Incomplete | None = None) -> Self: ...
    @property
    def variables(self) -> Frozen[Hashable, Variable]: ...
    ...

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

accessor_stubs-0.0.1-py3-none-any.whl (6.4 kB view details)

Uploaded Python 3

File details

Details for the file accessor_stubs-0.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for accessor_stubs-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 bc42947450735695fb9fd7d67af7145ada8a16fab76af6428e2c55285bd60c52
MD5 660c46625bc7f62846e661ddedd71848
BLAKE2b-256 d8f4b89dcd3a9cc12642073683c7fb882cbd8b9e0354cc7c3622b917b681aa1d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page