Skip to main content

Python SDK for calling the ProxyML API

Project description

proxyml

Python SDK for the ProxyML API.

Status: Early access — server endpoints coming soon.
Request early access or star this repo to follow progress.

creditg_car_counterfactual

Why ProxyML?

Most explainability tools require sending your data to a third-party server. ProxyML never sees your training data. You generate synthetic data locally, score it with your own model, and only the surrogate model and summary statistics are uploaded — your data stays yours.

Installation

pip install proxyml

Setup

ProxyML requires an API key. Set it as an environment variable before importing the package:

export PROXYML_API_KEY="your-api-key"

Optionally override the base URL (defaults to https://api.proxyml.ai/api/v1):

export PROXYML_BASE_URL="https://api.proxyml.ai/api/v1"

Quick Start

import pandas as pd
import proxyml
from proxyml import get_schema

# 1. Load your dataset and generate a schema
df = pd.read_csv("data.csv")
schema = get_schema(df, immutable_cols=["age", "gender"])

# 2. Upload the schema
proxyml.put_schema(schema)

# 3. Generate synthetic training data
synth_df = proxyml.synthesize_data(num_points=500)

# 4. Score synthetic data with your black-box model
predictions = my_model.predict(synth_df.values.tolist())

# 5. Train a surrogate model
proxyml.train_surrogate(
    samples=synth_df.values.tolist(),
    predictions=predictions,
    feature_names=list(synth_df.columns),
)

# 6. Find a counterfactual explanation
sample = df.iloc[0].tolist()
cf = proxyml.find_counterfactual(sample=sample, target=1, version=None)

if cf is not None:
    original = synth_df.iloc[0].to_dict()
    cf_dict = cf.iloc[0].to_dict()
    current_pred = my_model.predict([sample])[0]
    cf_pred = my_model.predict([cf.values.tolist()[0]])[0]

    explanation = proxyml.interpret_counterfactual(
        sample=original,
        counterfactual=cf_dict,
        prediction_changed=(current_pred != cf_pred),
    )
    print(explanation)

See docs/quickstart.md for a full walkthrough and docs/api.md for complete API reference.

Core Concepts

Surrogate model — A fast, interpretable model trained to approximate your black-box model's behavior on synthetic data. Once trained, it can be queried directly via the ProxyML API.

Counterfactual explanation — Given a prediction, a counterfactual is the minimal change to input features that would produce a different prediction. It answers: "What would have to be different for the outcome to change?"

Schema — Describes the statistical properties of each feature (type, range, distribution). Used to generate realistic synthetic data and constrain counterfactual search.

API Reference

Function Description
get_schema(df, immutable_cols) Generate a schema dict from a DataFrame
put_schema(schema) Upload a schema to the API
synthesize_data(num_points, sample, as_df) Generate synthetic data points
train_surrogate(samples, predictions, feature_names, task, test_size) Train a surrogate model
predict(samples, version) Score samples with the surrogate model
find_counterfactual(sample, target, ...) Find a counterfactual for a given sample
interpret_counterfactual(sample, counterfactual, ...) Generate a human-readable explanation

Full documentation: docs/api.md

Examples

License

Apache 2.0 — see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

proxyml-0.1.8.tar.gz (23.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

proxyml-0.1.8-py3-none-any.whl (17.0 kB view details)

Uploaded Python 3

File details

Details for the file proxyml-0.1.8.tar.gz.

File metadata

  • Download URL: proxyml-0.1.8.tar.gz
  • Upload date:
  • Size: 23.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for proxyml-0.1.8.tar.gz
Algorithm Hash digest
SHA256 e2ad85fd10791da222ac5b69483ff67cfedd4de6593e5acc76e7c6693456969e
MD5 699cf43a1976834e763d39126cab5d8f
BLAKE2b-256 e8d4915c6939cce3e4cbe48c8017a6da4bf41b3ab2038372af42a441647f6341

See more details on using hashes here.

File details

Details for the file proxyml-0.1.8-py3-none-any.whl.

File metadata

  • Download URL: proxyml-0.1.8-py3-none-any.whl
  • Upload date:
  • Size: 17.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for proxyml-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 d1ab9c32b0346b17b83ba372e40d9eb0fe3528a403f607706f51d18205b0e7b4
MD5 402d63c9402ba4a378963a6216bfae64
BLAKE2b-256 0bc55a65ea79fba30fbee2bb4be749ab4b691e6d271fc61c98243dbdb1088298

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page