Skip to main content

TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations

Project description

TabSTAR Logo

📚 TabSTAR: A Tabular Foundation Model for Tabular Data with Text Fields


Install

To fit a pretrained TabSTAR model to your own dataset, install the package using:

pip install tabstar

Quickstart Example

You can quickly get started with TabSTAR using the following example.

from importlib.resources import files
import pandas as pd
from sklearn.model_selection import train_test_split

from tabstar.tabstar_model import TabSTARClassifier

csv_path = files("tabstar").joinpath("resources", "imdb.csv")
x = pd.read_csv(csv_path)
y = x.pop('Genre_is_Drama')
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.1)
# For regression tasks, replace `TabSTARClassifier` with `TabSTARRegressor`.
tabstar = TabSTARClassifier()
tabstar.fit(x_train, y_train)
# tabstar.save("my_model_path.pkl")
# tabstar = TabSTARClassifier.load("my_model_path.pkl")
# y_pred = tabstar.predict(x_test)
metric = tabstar.score(X=x_test, y=y_test)
print(f"AUC: {metric:.4f}")

For paper replication, TabSTAR evaluation on benchmarks, custom pretraining or research purposes, see:

🔗 TabSTAR Research Repository

Citation

If you use TabSTAR in your work, please cite:

@inproceedings{
arazi2025tabstar,
title={Tab{STAR}: A Tabular Foundation Model for Tabular Data with Text Fields},
author={Alan Arazi and Eilam Shapira and Roi Reichart},
booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
year={2025},
url={https://openreview.net/forum?id=FrXHdcTEzE}
}

License

MIT © Alan Arazi et al.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tabstar-1.1.16.tar.gz (150.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tabstar-1.1.16-py3-none-any.whl (150.9 kB view details)

Uploaded Python 3

File details

Details for the file tabstar-1.1.16.tar.gz.

File metadata

  • Download URL: tabstar-1.1.16.tar.gz
  • Upload date:
  • Size: 150.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for tabstar-1.1.16.tar.gz
Algorithm Hash digest
SHA256 ff8bd7943bb5f379138a776371cb76245ea2aad93a522886dc5abe25e4c036cf
MD5 20fb7e6ccccb0b56c2d89d570a7a727d
BLAKE2b-256 4a588b0bf01352685fdcdda83528bc2bb07abe37f874c65699033ee8af658048

See more details on using hashes here.

File details

Details for the file tabstar-1.1.16-py3-none-any.whl.

File metadata

  • Download URL: tabstar-1.1.16-py3-none-any.whl
  • Upload date:
  • Size: 150.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for tabstar-1.1.16-py3-none-any.whl
Algorithm Hash digest
SHA256 7be15f1f3501908ec4969da0f14d2ce62ba8eceeebbcf1bee6112cf444c3730e
MD5 351ec7da5f471681150a294adb25a031
BLAKE2b-256 babdbe3b86e17f0a8f73a67d71f28dcc5ac11ae7f7d6834bb6af61e0c60b5d91

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page