Skip to main content

SDK to access the DATFID API hosted on Hugging Face Spaces

Project description

DATFID SDK

A Python SDK to access the DATFID API to forecast your data.

Features

  • Easy model fitting: Build panel data models with time-dependent and static features.
  • Flexible lag handling: Specify lags for the dependent variable and selected features.
  • Forecasting: Generate future predictions with aligned timestamps and IDs.
  • Statistical options: Filter features by significance and apply mean-variance tests.
  • White box full interpretability: Get fully interpretable model with equation, estimated parameters, and standard errors.

Installation

pip install datfid

Usage

Before using the SDK, please request an access token by emailing admin@datfid.com or by visiting our website datfid.com.

from datfid import DATFIDClient

# Initialize the client with your DATFID token
client = DATFIDClient(token="your_DATFID_token")

# Fit a model
fit_result = client.fit_model(
    df=dataframe,
    id_col="name of id column",
    time_col="name of time column",
    y="name of dependent variable",
    lag_y="starting lag : ending lag",
    lagged_features={
        "feature 1": "starting lag : ending lag",
        "feature 2": "starting lag : ending lag"
    },
    current_features=["feature 3", "feature 4"],
    filter_by_significance=True/False,
    meanvar_test=True/False
)

# Generate forecasts
forecast_df = client.forecast_model(
    df_forecast=dataframe
)

# The forecast DataFrame contains the individual IDs and timestamps
# from the original data plus a "forecast" column with predicted values.

Example 1

Sample dataset from GitHub (Food and Beverages demand forecasting):

import pandas as pd
from datfid import DATFIDClient

# Initialize the client with your DATFID token
client = DATFIDClient(token="your_DATFID_token")

# Load dataset for model fitting
url_fit = "https://raw.githubusercontent.com/datfid-valeriidashuk/sample-datasets/main/Food_Beverages.xlsx"
df = pd.read_excel(url_fit)

# Fit the model
result = client.fit_model(df=df,
                          id_col="Product",
                          time_col="Time",
                          y="Revenue",
                          current_features='all',
                          filter_by_significance=True
                          )

# Load dataset for forecasting
url_forecast = "https://raw.githubusercontent.com/datfid-valeriidashuk/sample-datasets/main/Food_Beverages_forecast.xlsx"
df_forecast = pd.read_excel(url_forecast)

# Forecast revenue using the fitted model
forecast = client.forecast_model(df_forecast=df_forecast)

Example 2

Slightly larger sample dataset from GitHub (Banking sector, forecasting loan probability):

import pandas as pd
from datfid import DATFIDClient

# Initialize the client with your DATFID token
client = DATFIDClient(token="your_DATFID_token")

# Load dataset for model fitting
url_fit = "https://raw.githubusercontent.com/datfid-valeriidashuk/sample-datasets/main/Banking_extended.xlsx"
df = pd.read_excel(url_fit)

# Fit the model
result = client.fit_model(df=df,
                          id_col="Individual",
                          time_col="Time",
                          y="Loan Probability",
                          lag_y="1:3",
                          lagged_features={"Income Level": "1:3"},
                          filter_by_significance=True)

# Load dataset for forecasting
url_forecast = "https://raw.githubusercontent.com/datfid-valeriidashuk/sample-datasets/main/Banking_extended_forecast.xlsx"
df_forecast = pd.read_excel(url_forecast)

# Forecast loan probability using the fitted model
forecast = client.forecast_model(df_forecast=df_forecast)

API Reference

DATFIDClient

client = DATFIDClient(token: str)

Initialize the client with your DATFID token.

client.fit_model(df: pd.DataFrame, id_col: str, time_col: str, y: str, lag_y: Optional[Union[int, str, list[int]]] = None, lagged_features: Optional[Dict[str, int]] = None, current_features: Optional[list] = None, filter_by_significance: bool = False, meanvar_test: bool = False) -> SimpleNamespace

Fit a model using the provided dataset.

client.forecast_model(df_forecast: pd.DataFrame) -> pd.DataFrame

Generate forecasts using the fitted model.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datfid-0.1.24.tar.gz (7.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datfid-0.1.24-py3-none-any.whl (7.0 kB view details)

Uploaded Python 3

File details

Details for the file datfid-0.1.24.tar.gz.

File metadata

  • Download URL: datfid-0.1.24.tar.gz
  • Upload date:
  • Size: 7.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for datfid-0.1.24.tar.gz
Algorithm Hash digest
SHA256 60918b2b609bc606127af21419a908cf79ff5be103c8fb6783c58c62110b2383
MD5 576508662ec1640f2e24d675fffecc60
BLAKE2b-256 fc20cb1636e0bd56cbdc5c4d63b088312b375cae7c30f0a42adfc4e19cdfe7b0

See more details on using hashes here.

File details

Details for the file datfid-0.1.24-py3-none-any.whl.

File metadata

  • Download URL: datfid-0.1.24-py3-none-any.whl
  • Upload date:
  • Size: 7.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for datfid-0.1.24-py3-none-any.whl
Algorithm Hash digest
SHA256 a18bcd38734f12237547306268adf6ab08b5cbb07eac80786834573c8805405c
MD5 d9fcd34ae3b1ceae3e51a1ff5797645f
BLAKE2b-256 1b5e5848e3e6a02bf500719ae2c0be95195abee810b0b9041c282522c1029ea4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page