Python SDK for calling the ProxyML API
Project description
proxyml
Python SDK for the ProxyML API.
Status: Early access — server endpoints coming soon.
Request early access or star this repo to follow progress.
Why ProxyML?
Most explainability tools require sending your data to a third-party server. ProxyML never sees your training data. You generate synthetic data locally, score it with your own model, and only the surrogate model and summary statistics are uploaded — your data stays yours.
Installation
pip install proxyml
Setup
ProxyML requires an API key. Set it as an environment variable before importing the package:
export PROXYML_API_KEY="your-api-key"
Optionally override the base URL (defaults to https://api.proxyml.ai/api/v1):
export PROXYML_BASE_URL="https://api.proxyml.ai/api/v1"
Quick Start
import pandas as pd
import proxyml
from proxyml import get_schema
# 1. Load your dataset and generate a schema
df = pd.read_csv("data.csv")
schema = get_schema(df, immutable_cols=["age", "gender"])
# 2. Upload the schema
proxyml.put_schema(schema)
# 3. Generate synthetic training data
synth_df = proxyml.synthesize_data(num_points=500)
# 4. Score synthetic data with your black-box model
predictions = my_model.predict(synth_df.values.tolist())
# 5. Train a surrogate model
proxyml.train_surrogate(
samples=synth_df.values.tolist(),
predictions=predictions,
feature_names=list(synth_df.columns),
)
# 6. Find a counterfactual explanation
sample = df.iloc[0].tolist()
cf = proxyml.find_counterfactual(sample=sample, target=1, version=None)
if cf is not None:
original = synth_df.iloc[0].to_dict()
cf_dict = cf.iloc[0].to_dict()
current_pred = my_model.predict([sample])[0]
cf_pred = my_model.predict([cf.values.tolist()[0]])[0]
explanation = proxyml.interpret_counterfactual(
sample=original,
counterfactual=cf_dict,
prediction_changed=(current_pred != cf_pred),
)
print(explanation)
See docs/quickstart.md for a full walkthrough and docs/api.md for complete API reference.
Core Concepts
Surrogate model — A fast, interpretable model trained to approximate your black-box model's behavior on synthetic data. Once trained, it can be queried directly via the ProxyML API.
Counterfactual explanation — Given a prediction, a counterfactual is the minimal change to input features that would produce a different prediction. It answers: "What would have to be different for the outcome to change?"
Schema — Describes the statistical properties of each feature (type, range, distribution). Used to generate realistic synthetic data and constrain counterfactual search.
API Reference
| Function | Description |
|---|---|
get_schema(df, immutable_cols) |
Generate a schema dict from a DataFrame |
put_schema(schema) |
Upload a schema to the API |
synthesize_data(num_points, sample, as_df) |
Generate synthetic data points |
train_surrogate(samples, predictions, feature_names, task, test_size) |
Train a surrogate model |
predict(samples, version) |
Score samples with the surrogate model |
find_counterfactual(sample, target, ...) |
Find a counterfactual for a given sample |
interpret_counterfactual(sample, counterfactual, ...) |
Generate a human-readable explanation |
Full documentation: docs/api.md
Examples
examples/basic_usage.py— Schema upload, data synthesis, surrogate trainingexamples/counterfactual_example.py— Counterfactual search and interpretation
License
Apache 2.0 — see LICENSE.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file proxyml-0.1.0.tar.gz.
File metadata
- Download URL: proxyml-0.1.0.tar.gz
- Upload date:
- Size: 17.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5089383d86bb06373c61815b2f2792263e2d5f86922bf349b70717d1d2ac1ad7
|
|
| MD5 |
a55c600922c1ed935ccde7231be78bbb
|
|
| BLAKE2b-256 |
3e6cf3a1489b865de461bdb193b89d2d9ba49bf297c9045e42e04d0bbe6b560c
|
File details
Details for the file proxyml-0.1.0-py3-none-any.whl.
File metadata
- Download URL: proxyml-0.1.0-py3-none-any.whl
- Upload date:
- Size: 13.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
34bebf82bf40f63c6f485a464e867fdbd972baa4e306a98d2f4dc09d6e8dc5c1
|
|
| MD5 |
9b83dfb96652390883ba43b62c6d3293
|
|
| BLAKE2b-256 |
bd006127cdd4bebcd43cd419f28cb30ecdc86ec72530c46422c0eb6d55f27d10
|