Skip to main content

The official Python library for The Forecasting Company API

Project description

The Forecasting Company Python SDK

PyPI version API status

The python SDK provides a simple interface to make forecasts using the TFC API.

Documentation

The REST API documentation can be found on https://api.retrocast.com/docs.

To get an API key, visit the Authentication docs. In the API Keys section you will find an option to Sign in or, if you already signed in, a box containing your API key.

Installation

# install from PyPI
pip install theforecastingcompany

Usage

# By default it will look for api_key in os.getenv("TFC_API_KEY"). Otherwise you can explicity set the api_key argument
client = TFCClient()

# Compute forecast for a single model
timesfm_df = client.forecast(
    train_df,
    model=TFCModels.TimesFM_2p5 # StrEnum defined in utils. You can also pass the model name as a string, eg timesfm-2p5
    horizon=12,
    freq="W",
    quantiles=[0.5,0.1,0.9]
)

# Global Model with static variables
tfc_global_df = client.forecast(
        train_df,
        model=TFCModels.TFCGlobal,
        horizon=12,
        freq="W",
        static_variables=["unique_id","Group","Vendor","Category"],
        add_holidays=True,
        add_events=True,
        country_isocode = "US",
        # Fit a separate global model for each group.
        # If None, a single global model is fitted to all timeseries.
        partition_by=["Group"]
    )

If future_variables are available, make sure to pass also a future_df when forecasting, and setting the future_variables argument. Important: The same future_variables columns must be present in both train_df and future_df. For example, if you specify future_variables=["price", "promotion"], both columns must exist in the historical data (train_df) and in the future data (future_df).

The cross_validate function is basically the same, but takes a fcds argument to define the FCDs to use for cross-validation. It also returns the target column in the output dataframe.

Data Structure Requirements

Both train_df and future_df must contain the following columns:

  • unique_id (or custom id_col): Unique identifier for each time series
  • ds (or custom date_col): Date/timestamp column (must be datetime format)
  • target (or custom target_col): The values to forecast

You can customize column names using the id_col, date_col, and target_col parameters if your data uses different naming conventions.

Example train_df with historical data, static variables (region, store_type), and future variables (price, promotion):

import pandas as pd

train_df = pd.DataFrame({
    "unique_id": ["store_1", "store_1", "store_1", "store_2", "store_2", "store_2"],
    "ds": pd.to_datetime(["2024-01-01", "2024-01-02", "2024-01-03", "2024-01-01", "2024-01-02", "2024-01-03"]),
    "target": [100, 105, 110, 200, 195, 205],
    "region": ["North", "North", "North", "South", "South", "South"],  # static variable
    "store_type": ["Premium", "Premium", "Premium", "Standard", "Standard", "Standard"],  # static variable
    "price": [9.99, 9.99, 9.99, 12.99, 12.99, 12.99],  # future variable (must be in train_df too!)
    "promotion": [0, 1, 0, 1, 0, 1]  # future variable (must be in train_df too!)
})

Example future_df with the same future variables price and promotion:

future_df = pd.DataFrame({
    "unique_id": ["store_1", "store_1", "store_2", "store_2"],
    "ds": pd.to_datetime(["2024-01-04", "2024-01-05", "2024-01-04", "2024-01-05"]),
    "price": [9.99, 8.99, 12.99, 11.99],  # future variable
    "promotion": [1, 0, 0, 1]  # future variable
})

# Forecasting with future variables
forecast_df = client.forecast(
    train_df,
    future_df=future_df,
    model=TFCModels.TFCGlobal,
    horizon=2,
    freq="D",
    static_variables=["region", "store_type"],
    future_variables=["price", "promotion"]  # These columns must exist in both train_df and future_df
)

Using custom column names:

# If your data uses different column names
my_data = pd.DataFrame({
    "item_id": ["A", "A", "B", "B"],
    "date": pd.to_datetime(["2024-01-01", "2024-01-02", "2024-01-01", "2024-01-02"]),
    "sales": [50, 55, 30, 32]
})

forecast_df = client.forecast(
    my_data,
    model=TFCModels.TimesFM_2p5,
    horizon=7,
    freq="D",
    id_col="item_id",      # specify your ID column
    date_col="date",       # specify your date column
    target_col="sales"     # specify your target column
)

Batch Size

Some models (such as chronos-2 and moirai-2) support batching multiple time series into a single request. You can control the batch size using the batch_size parameter:

forecast_df = client.forecast(
    train_df,
    model=TFCModels.Chronos_2,
    horizon=12,
    freq="D",
    batch_size=256  # default value
)

Increasing batch_size can speed up execution when forecasting many time series, but may increase the risk of timeouts or connection errors on the server side. If you encounter such errors, try decreasing the batch size.

Versioning

This package generally follows SemVer conventions, though certain backwards-incompatible changes may be released as minor versions:

  1. Changes that only affect static types, without breaking runtime behavior.
  2. Changes to library internals which are technically public but not intended or documented for external use. (Please open a GitHub issue to let us know if you are relying on such internals.)
  3. Changes that we do not expect to impact the vast majority of users in practice.

We take backwards-compatibility seriously and work hard to ensure you can rely on a smooth upgrade experience.

We are keen for your feedback; please open an issue with questions, bugs, or suggestions.

Determining the installed version

If you've upgraded to the latest version but aren't seeing any new features you were expecting then your python environment is likely still using an older version.

You can determine the version that is being used at runtime with:

import theforecastingcompany
print(theforecastingcompany.__version__)

Requirements

Python 3.11 or higher.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

theforecastingcompany-0.4.0.tar.gz (142.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

theforecastingcompany-0.4.0-py3-none-any.whl (28.1 kB view details)

Uploaded Python 3

File details

Details for the file theforecastingcompany-0.4.0.tar.gz.

File metadata

File hashes

Hashes for theforecastingcompany-0.4.0.tar.gz
Algorithm Hash digest
SHA256 184728fbab9fc0000b85f7f9186ffc78a2c13a822576ff63ee64a215fe14a3fe
MD5 35d3752b06d61567044baa01545a2399
BLAKE2b-256 c28bd53df44a765bbea0635f217d926c4604020a18815c2f4f829dda55b04cf2

See more details on using hashes here.

File details

Details for the file theforecastingcompany-0.4.0-py3-none-any.whl.

File metadata

File hashes

Hashes for theforecastingcompany-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 06e0377bb3bb098107bd9c7d4a824d413a6605bb252a6e2ba5c569c6d6374ec9
MD5 9721c4ab3e3c9e088841ae11c12250e9
BLAKE2b-256 709edbdda9436c9db7b7a4257a72040cdc26c2b65c65efb0b2f4fab49a55a888

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page