Skip to main content

Automated validation toolkit for tabular ML models—MRM, credit risk, insurance, and other regulated use cases.

Project description

TanML: Automated Model Validation Toolkit for Tabular Machine Learning Models

PyPI Downloads Python Version OS Documentation License: MIT Give Feedback DOI

TanML bridges the gap between data science tools (for building models) and governance requirements (for validating them). It's not just an ML library—it's a validation workflow with built-in documentation.


📖 Documentation

For detailed guides, API reference, and tutorials, visit our official documentation:

👉 https://tdlabs-ai.github.io/tanml/


Table of Contents


Why TanML?

  • End-to-end workflow: Data Profiling → Data Preprocessing → Feature Power Ranking → Model Development → Evaluation → Reports—all in one UI.
  • Audit-ready reports: Generate editable .docx documents for stakeholders and compliance reviews.
  • Built for regulated industries: Designed for MRM, credit risk, insurance, and SR 11-7 contexts.
  • No code required: Fully UI-driven—no Python knowledge needed.
  • Robust evaluation: Drift detection, stress testing, SHAP explainability, cluster coverage.
  • Works with your stack: Scikit-learn, XGBoost, LightGBM, CatBoost, Statsmodels.

Comparison to Similar Tools

Feature TanML Evidently AI Deepchecks AutoML (PyCaret/AutoGluon) YData Profiling
Primary Focus Model Risk Management Monitoring Testing / CI Model Training Data EDA Only
Scope Data + Model + Governance Data + Model Data + Model Model Building Data Only
Output Audit-Ready .docx Dashboards Reports Models HTML Report
Drift Logic PSI & KS (Regulatory) Stat. Tests Multiple N/A Warnings

Install

We strongly recommend using a virtual environment to isolate dependencies:

# Create and activate virtual environment
python -m venv .venv
source .venv/bin/activate  # Linux/Mac
# .venv\Scripts\activate   # Windows

# Install TanML
pip install tanml

# Upgrade an existing installation
python -m pip install --upgrade tanml

Run TanML Locally

If you are running TanML directly on your own machine, start the UI with:

tanml ui

If the tanml command is not available in your shell, you can also try:

python -m tanml ui

Then open the following in your browser: http://localhost:8501

Remote Access (SSH)

If you are using standard SSH from a terminal, you may need to forward port 8501 manually. If you are using VS Code Remote SSH, VS Code may detect and forward the TanML UI port automatically.

Standard SSH from a terminal

From your local machine, connect to the remote server with port forwarding enabled:

ssh -L 8501:localhost:8501 user@remote-server-ip

Then, on the remote server, run:

tanml ui

After that, open the following in your local browser:

http://localhost:8501

VS Code Remote SSH

If you are connected through VS Code Remote SSH, run tanml ui in the remote terminal. VS Code may automatically detect port 8501 and offer to forward it. If it does not, open the Ports panel in VS Code, choose Forward a Port, enter 8501, and then open the forwarded local address in your browser.


Quick Start (UI)

tanml ui

Library Usage (Python)

You can also use TanML's analysis engine directly in your Python scripts:

import pandas as pd
from tanml.analysis import analyze_drift

# Load your data
train_df = pd.read_csv("train.csv")
test_df = pd.read_csv("test.csv")

# Run drift analysis (PSI, KS Test, etc.)
results = analyze_drift(train_df, test_df)
print(results)

Testing

TanML uses pytest for automated unit testing. To run tests locally:

# 1. Navigate to the project root
cd ./tanml/

# 2. Install with dev dependencies
pip install -e ".[dev]"

# 3. Run all tests
pytest tests/

We also use ruff for linting and mypy for type checking. These are integrated into our CI pipeline.


Optional CLI Flags

Most users just run tanml ui. These help on teams/servers:

# Share on LAN
tanml ui --public

> [!WARNING]
> Using `--public` makes the UI accessible to anyone on your network. Only use this on trusted networks (like a VPN or secure office LAN) and never on public Wi-Fi.

# Different port
tanml ui --port 9000

# Headless (server/CI; no auto-open browser)
tanml ui --headless

# Larger limit (e.g., 2 GB)
tanml ui --max-mb 2048

Env var equivalents (Linux/macOS bash):

TANML_SERVER_ADDRESS=0.0.0.0 TANML_PORT=9000 TANML_MAX_MB=2048 tanml ui

Windows PowerShell:

$env:TANML_SERVER_ADDRESS="0.0.0.0"; $env:TANML_PORT="9000"; $env:TANML_MAX_MB="2048"; tanml ui

Defaults: address 127.0.0.1, port 8501, limit 2048 MB, telemetry OFF.


Reports

TanML generates audit ready reports (.docx) programmatically:

  • Model Development Report — Cross-validation metrics, diagnostics, and performance summary
  • Model Evaluation Report — Train/Test comparison, drift analysis, stress testing, SHAP explainability
  • Feature Power Ranking Report — Feature importance scores, correlation analysis

Reports are generated via tanml/ui/reports/generators.py and exported directly from the UI.


Data Privacy

  • TanML runs locally; no data is sent to external services.
  • Telemetry is disabled by default (and can be forced off via --no-telemetry).
  • Security Note: Running with the --public flag exposes the app to your local network. Ensure you are on a secure connection.
  • UI artifacts and reports are written under ./tanml_runs/<session>/ in your working directory.

Troubleshooting

  • Page didn’t open? Visit http://127.0.0.1:8501 or run tanml ui --port 9000.
  • Large CSVs are slow/heavy? Prefer Parquet; CSV → DataFrame can use several GB RAM.
  • Artifacts missing? Check ./tanml_runs/<session>/artifacts/.
  • Corporate networks: use tanml ui --public to share on LAN.

Contributing

We welcome issues and PRs! See CONTRIBUTING.md for detailed setup instructions.

  1. Clone & Setup: git clone https://github.com/tdlabs-ai/tanml.git
  2. Environment: python -m venv .venv && source .venv/bin/activate
  3. Install: pip install -e .[dev]
  4. Test: pytest

License & Citation

License: MIT. See LICENSE.
SPDX-License-Identifier: MIT

© 2025 Tanmay Sah and Dolly Sah. You may use, modify, and distribute this software with appropriate attribution.

How to cite

If TanML helps your work or publications, please cite:

Sah, T., & Sah, D. (2025). TanML: Automated Model Validation Toolkit for Tabular Machine Learning Models [Software]. Zenodo. https://doi.org/10.5281/zenodo.17317165

Or in BibTeX (version-agnostic):

@software{tanml_2026,
  author       = {Sah, Tanmay and Sah, Dolly},
  title        = {TanML: Automated Model Validation Toolkit for Tabular Machine Learning Models},
  year         = {2026},
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.17317165},
  url          = {https://doi.org/10.5281/zenodo.17317165},
  license      = {MIT}
}

A machine-readable citation file (CITATION.cff) is included for citation tools and GitHub’s “Cite this repository” button.


AI Disclosure

Portions of this codebase (including tests, documentation, and the data_loader module) were refactored and generated with the assistance of Large Language Models (LLMs). All AI-generated contributions have been reviewed and verified by the human maintainers.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tanml-0.2.0.tar.gz (129.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tanml-0.2.0-py3-none-any.whl (140.3 kB view details)

Uploaded Python 3

File details

Details for the file tanml-0.2.0.tar.gz.

File metadata

  • Download URL: tanml-0.2.0.tar.gz
  • Upload date:
  • Size: 129.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for tanml-0.2.0.tar.gz
Algorithm Hash digest
SHA256 ea065bcd065510e1eabc475d2f11fcc64b17fbe6b992654a0d8eac67656cad77
MD5 346ff1cf74c1176991fa04fdcb81ae0a
BLAKE2b-256 af1f25f600e65846ce337f5623d69c65d5d7881aa05e43ace6982d743439ef2b

See more details on using hashes here.

Provenance

The following attestation bundles were made for tanml-0.2.0.tar.gz:

Publisher: release.yml on tdlabs-ai/tanml

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file tanml-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: tanml-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 140.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for tanml-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 967675c231ab01f3d3ce3e04f6c9517e319059b68da4fcef58e3b0e2f5fcbbb4
MD5 3421a5463c2618341e47bb9ad018d91e
BLAKE2b-256 16fe9a02e4249f039fe28468fbe6482e75b28019b2898d78bb41a0f93f91c722

See more details on using hashes here.

Provenance

The following attestation bundles were made for tanml-0.2.0-py3-none-any.whl:

Publisher: release.yml on tdlabs-ai/tanml

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page