Skip to main content

SDK to access the DATFID API hosted on Hugging Face Spaces

Project description

DATFID SDK

A Python SDK to access the DATFID API to forecast your data.

Features

  • Easy model fitting: Build panel data models with time-dependent and static features.
  • Flexible lag handling: Specify lags for the dependent variable and selected features.
  • Forecasting: Generate future predictions with aligned timestamps and IDs.
  • Statistical options: Filter features by significance and apply mean-variance tests.
  • White box full interpretability: Get fully interpretable model with equation, estimated parameters, and standard errors.

Installation

pip install datfid

Usage

Before using the SDK, please request an access token by emailing admin@datfid.com or by visiting our website datfid.com.

from datfid import DATFIDClient

# Initialize the client with your DATFID token
client = DATFIDClient(token="your_DATFID_token")

# Fit a model
fit_result = client.fit_model(
    df=dataframe,
    id_col="name of id column",
    time_col="name of time column",
    y="name of dependent variable",
    lag_y="starting lag : ending lag",
    lagged_features={
        "feature 1": "starting lag : ending lag",
        "feature 2": "starting lag : ending lag"
    },
    current_features=["feature 3", "feature 4"],
    filter_by_significance=True/False,
    meanvar_test=True/False
)

# Generate forecasts
forecast_df = client.forecast_model(
    df_forecast=dataframe
)

# The forecast DataFrame contains the individual IDs and timestamps
# from the original data plus a "forecast" column with predicted values.

Example 1

Sample dataset from GitHub (Food and Beverages demand forecasting):

import pandas as pd
from datfid import DATFIDClient

# Initialize the client with your DATFID token
client = DATFIDClient(token="your_DATFID_token")

# Load dataset for model fitting
url_fit = "https://raw.githubusercontent.com/datfid-valeriidashuk/sample-datasets/main/Food_Beverages.xlsx"
df = pd.read_excel(url_fit)

# Fit the model
result = client.fit_model(df=df,
                          id_col="Product",
                          time_col="Time",
                          y="Revenue",
                          current_features='all',
                          filter_by_significance=True
                          )

# Load dataset for forecasting
url_forecast = "https://raw.githubusercontent.com/datfid-valeriidashuk/sample-datasets/main/Food_Beverages_forecast.xlsx"
df_forecast = pd.read_excel(url_forecast)

# Forecast revenue using the fitted model
forecast = client.forecast_model(df_forecast=df_forecast)

Example 2

Slightly larger sample dataset from GitHub (Banking sector, forecasting loan probability):

import pandas as pd
from datfid import DATFIDClient

# Initialize the client with your DATFID token
client = DATFIDClient(token="your_DATFID_token")

# Load dataset for model fitting
url_fit = "https://raw.githubusercontent.com/datfid-valeriidashuk/sample-datasets/main/Banking_extended.xlsx"
df = pd.read_excel(url_fit)

# Fit the model
result = client.fit_model(df=df,
                          id_col="Individual",
                          time_col="Time",
                          y="Loan Probability",
                          lag_y="1:3",
                          lagged_features={"Income Level": "1:3"},
                          filter_by_significance=True)

# Load dataset for forecasting
url_forecast = "https://raw.githubusercontent.com/datfid-valeriidashuk/sample-datasets/main/Banking_extended_forecast.xlsx"
df_forecast = pd.read_excel(url_forecast)

# Forecast loan probability using the fitted model
forecast = client.forecast_model(df_forecast=df_forecast)

API Reference

DATFIDClient

client = DATFIDClient(token: str)

Initialize the client with your DATFID token.

client.fit_model(df: pd.DataFrame, id_col: str, time_col: str, y: str, lag_y: Optional[Union[int, str, list[int]]] = None, lagged_features: Optional[Dict[str, int]] = None, current_features: Optional[list] = None, filter_by_significance: bool = False, meanvar_test: bool = False) -> SimpleNamespace

Fit a model using the provided dataset.

client.forecast_model(df_forecast: pd.DataFrame) -> pd.DataFrame

Generate forecasts using the fitted model.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datfid-0.1.27.tar.gz (7.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datfid-0.1.27-py3-none-any.whl (7.7 kB view details)

Uploaded Python 3

File details

Details for the file datfid-0.1.27.tar.gz.

File metadata

  • Download URL: datfid-0.1.27.tar.gz
  • Upload date:
  • Size: 7.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for datfid-0.1.27.tar.gz
Algorithm Hash digest
SHA256 374a306e03cf654f4eb35f7135538ebe87dd440e331c05b24c5c9028a1ae1981
MD5 dc631ad2ead3a9affecae169581a229f
BLAKE2b-256 c5428eb49d0d25feb308e6361a1f655d535b89b9ea2776538c0950595dda624c

See more details on using hashes here.

File details

Details for the file datfid-0.1.27-py3-none-any.whl.

File metadata

  • Download URL: datfid-0.1.27-py3-none-any.whl
  • Upload date:
  • Size: 7.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for datfid-0.1.27-py3-none-any.whl
Algorithm Hash digest
SHA256 79395495ab4989985b63d1325c69759a61531af31c3bcba2f38a3b7295be6d8f
MD5 6dfb23b8235d5c2b0d4252563b5d0bdc
BLAKE2b-256 71f7759183839643b702552d0442e216e58cf7f580f475e6aedea3e3947524f9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page