Multimodal extension of Google's TimesFM for time series forecasting with text

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

himura467

These details have not been verified by PyPI

Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Project description

Multimodal TimesFM

A multimodal extension of Google's TimesFM for time series forecasting with text inputs.

Installation

pip install multimodal-timesfm

Quick Start

1. Define Custom Dataset

Implement a custom dataset by extending MultimodalDatasetBase:

from pathlib import Path
from typing import Literal
import numpy as np
from multimodal_timesfm.multimodal_dataset import MultimodalDatasetBase

class CustomDataset(MultimodalDatasetBase):
    """Custom dataset for your multimodal time series data."""

    def __init__(
        self,
        data_dir: Path,
        split_ratio: float = 0.8,
        split: Literal["train", "test"] = "train",
        patch_len: int = 32,
        context_len: int = 128,
        horizon_len: int = 32,
    ):
        super().__init__(data_dir, split_ratio, split, patch_len, context_len, horizon_len)

    def _load_data(self) -> None:
        """Load and process your custom data format.

        Populate self.data with dictionaries containing:
        - context: np.ndarray of shape (context_len,) - historical time series values
        - future: np.ndarray of shape (horizon_len,) - target future values
        - freq: int - frequency indicator (0=daily, 1=weekly/monthly, 2=quarterly+)
        - patched_texts: list of lists - text organized by temporal patches
        - metadata: dict - additional sample information
        """
        # Your custom data loading logic
        for sample in self._read_your_data_files():
            # Organize texts by patches (one list per patch)
            num_patches = self.context_len // self.patch_len
            patched_texts = [[] for _ in range(num_patches)]

            # Assign text descriptions to appropriate patches
            for text_item in sample["texts"]:
                patch_idx = self._get_patch_index(text_item["timestamp"])
                patched_texts[patch_idx].append(text_item["text"])

            self.data.append({
                "context": sample["historical_values"],  # shape: (context_len,)
                "future": sample["target_values"],       # shape: (horizon_len,)
                "freq": sample["frequency"],             # 0, 1, or 2
                "patched_texts": patched_texts,          # list of text lists
                "metadata": sample["info"]
            })

2. Train the Model

from multimodal_timesfm.multimodal_patched_decoder import MultimodalPatchedDecoder, MultimodalTimesFMConfig
from multimodal_timesfm.trainer import MultimodalTrainer

# Create datasets
train_dataset = CustomDataset(data_dir="path/to/data", split="train")
val_dataset = CustomDataset(data_dir="path/to/data", split="test")

# Initialize model with pretrained TimesFM weights
config = MultimodalTimesFMConfig(
    text_encoder_type="english",  # or "japanese"
    context_len=128,
    horizon_len=32,
    input_patch_len=32,
)
model = MultimodalPatchedDecoder(config)

# Load pretrained TimesFM checkpoint
model.load_pretrained_timesfm("path/to/timesfm_checkpoint.ckpt")

# Train
trainer = MultimodalTrainer(
    model=model,
    train_dataset=train_dataset,
    val_dataset=val_dataset,
    batch_size=8,
    gradient_accumulation_steps=4,
    learning_rate=1e-4,
    weight_decay=0.01,
    log_dir=Path("logs"),
    checkpoint_dir=Path("checkpoints"),
    wandb_project="my-project",
    wandb_run_name="experiment-1"
)

# Train for 10 epochs, saving checkpoints every 5 epochs
trainer.train(num_epochs=10, save_frequency=5)

3. Evaluate the Model

from multimodal_timesfm.evaluation import evaluate_multimodal_model
from torch.utils.data import DataLoader
from multimodal_timesfm.utils.collate import multimodal_collate_fn

# Create test dataset
test_dataset = CustomDataset(data_dir="path/to/data", split="test")
test_loader = DataLoader(
    test_dataset,
    batch_size=8,
    shuffle=False,
    collate_fn=multimodal_collate_fn
)

# Load trained model
model = MultimodalPatchedDecoder(config)
checkpoint = torch.load("checkpoints/best_model.pt")
model.load_state_dict(checkpoint["model_state_dict"])

# Evaluate
metrics = evaluate_multimodal_model(model, test_loader, device="cuda")
print(f"MSE: {metrics['mse']:.4f}")
print(f"MAE: {metrics['mae']:.4f}")

4. Visualize Predictions

import matplotlib.pyplot as plt
import numpy as np

# Get predictions for visualization
model.eval()
sample = test_dataset[0]
with torch.no_grad():
    prediction = model.forecast(
        context=torch.tensor(sample["context"]).unsqueeze(0),
        text_inputs=[sample["patched_texts"]],
        freq=torch.tensor([sample["freq"]])
    )

# Plot
plt.figure(figsize=(12, 4))
context_len = len(sample["context"])
horizon_len = len(sample["future"])

# Plot context
plt.plot(range(context_len), sample["context"], label="Context", color="blue")

# Plot ground truth
plt.plot(range(context_len, context_len + horizon_len),
         sample["future"], label="Ground Truth", color="green")

# Plot prediction
plt.plot(range(context_len, context_len + horizon_len),
         prediction[0].cpu().numpy(), label="Prediction", color="red", linestyle="--")

plt.xlabel("Time")
plt.ylabel("Value")
plt.legend()
plt.title("Multimodal TimesFM Forecast")
plt.savefig("forecast_visualization.png")

5. Inference on New Data

from multimodal_timesfm import MultimodalTimesFM, TimesFmHparams, MultimodalTimesFMConfig

# Load trained model for inference
hparams = TimesFmHparams(context_len=128, horizon_len=32)
config = MultimodalTimesFMConfig(text_encoder_type="english")
model = MultimodalTimesFM(hparams, config, "checkpoints/best_model.pt")

# Prepare new data
time_series_data = np.array([...])  # Your time series context
text_descriptions = [[
    ["High volatility expected"],        # Texts for patch 1
    ["Market uncertainty increasing"],   # Texts for patch 2
    ["Economic indicators show growth"]  # Texts for patch 3
]]

# Generate forecast
forecasts, quantiles = model.forecast(
    inputs=[time_series_data],
    text_descriptions=text_descriptions,
    freq=[0],  # 0=daily, 1=weekly/monthly, 2=quarterly+
    forecast_context_len=128
)

print(f"Forecast shape: {forecasts.shape}")
print(f"Point forecast: {forecasts[0]}")
print(f"Quantiles shape: {quantiles.shape}")

Features

Multimodal forecasting: Combines time series data with textual context
Built on TimesFM: Leverages Google's state-of-the-art time series foundation model
Flexible text encoding: Supports English and Japanese text inputs
Easy integration: Simple API for adding text context to time series forecasting

Time-MMD Dataset Example

The project includes complete scripts for training and evaluating on the Time-MMD dataset.

Setup

Initialize the Time-MMD dataset submodule:

git submodule update --init

The dataset contains multimodal time series data across 10 domains: Agriculture, Climate, Economy, Energy, Environment, Health_AFR, Health_US, Security, SocialGood, and Traffic.

Training with Cross-Validation

Train a multimodal TimesFM model using cross-validation:

PYTHONPATH=. uv run python scripts/train_time_mmd_cv.py \
    --seed 42

Configuration:

Model config: TimesFM architecture parameters (layers, dimensions, context length, etc.)
Training config: Batch size, learning rate, domains, cross-validation settings, etc.
See examples/time_mmd/configs/ for configuration templates

The script will:

Create train/validation splits for each cross-validation fold
Train a separate model for each fold
Save checkpoints and cross-validation results to JSON

Evaluation

Evaluate trained models on the test set:

PYTHONPATH=. uv run python scripts/evaluate_time_mmd_cv.py \
    --cv-results logs/cv_results.json \
    --seed 42

This evaluates both the multimodal model and a baseline TimesFM model (without text inputs), reporting:

Mean Squared Error (MSE)
Mean Absolute Error (MAE)
Per-fold and overall metrics

Visualization

Visualize model predictions:

PYTHONPATH=. uv run python scripts/visualize_time_mmd_cv.py \
    --cv-results logs/cv_results.json

Output:

Time series plots showing context, ground truth, and predictions
Metric comparison bar charts (MSE and MAE)

Acknowledgments

We thank the Time-MMD team for providing the multimodal time series dataset used in our examples and experiments.

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

himura467

These details have not been verified by PyPI

Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Release history Release notifications | RSS feed

0.6.1

Mar 18, 2026

0.6.0

Mar 16, 2026

0.5.0

Mar 16, 2026

0.4.0

Mar 5, 2026

0.3.0

Mar 3, 2026

0.2.0

Nov 25, 2025

0.1.4

Nov 11, 2025

0.1.3

Oct 28, 2025

This version

0.1.2

Oct 27, 2025

0.1.1

Sep 25, 2025

0.1.0

Sep 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

multimodal_timesfm-0.1.2.tar.gz (25.3 kB view details)

Uploaded Oct 27, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

multimodal_timesfm-0.1.2-py3-none-any.whl (31.5 kB view details)

Uploaded Oct 27, 2025 Python 3

File details

Details for the file multimodal_timesfm-0.1.2.tar.gz.

File metadata

Download URL: multimodal_timesfm-0.1.2.tar.gz
Upload date: Oct 27, 2025
Size: 25.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for multimodal_timesfm-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`2c822c635a969bb474ea94f62aa39caae790175bf8e8f391a5007c82b7578a44`
MD5	`390febfe7378db7c96c71e482fd6590e`
BLAKE2b-256	`da57f63fdac8aac1553a9c1f7a4a76febbd62623894cc67cb371cec21b78dde6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for multimodal_timesfm-0.1.2.tar.gz:

Publisher: publish-to-pypi.yml on himura467/multimodal-timesfm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: multimodal_timesfm-0.1.2.tar.gz
- Subject digest: 2c822c635a969bb474ea94f62aa39caae790175bf8e8f391a5007c82b7578a44
- Sigstore transparency entry: 642782558
- Sigstore integration time: Oct 27, 2025
Source repository:
- Permalink: himura467/multimodal-timesfm@de7482b25ffc325800848d077eaa9e9f371a8872
- Branch / Tag: refs/tags/v0.1.2
- Owner: https://github.com/himura467
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yml@de7482b25ffc325800848d077eaa9e9f371a8872
- Trigger Event: release

File details

Details for the file multimodal_timesfm-0.1.2-py3-none-any.whl.

File metadata

Download URL: multimodal_timesfm-0.1.2-py3-none-any.whl
Upload date: Oct 27, 2025
Size: 31.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for multimodal_timesfm-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5004a7566c4cb40c8fd3019a74a58dc3b103ffdc66e91e17d092e3f4aedd9488`
MD5	`036ce92165332831e3282cbe66d84267`
BLAKE2b-256	`c4fbeb5b653f8dae57e14bea60a00887a5dd7f35cd1af0b249c099e654d5f560`

See more details on using hashes here.

Provenance

The following attestation bundles were made for multimodal_timesfm-0.1.2-py3-none-any.whl:

Publisher: publish-to-pypi.yml on himura467/multimodal-timesfm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: multimodal_timesfm-0.1.2-py3-none-any.whl
- Subject digest: 5004a7566c4cb40c8fd3019a74a58dc3b103ffdc66e91e17d092e3f4aedd9488
- Sigstore transparency entry: 642782625
- Sigstore integration time: Oct 27, 2025
Source repository:
- Permalink: himura467/multimodal-timesfm@de7482b25ffc325800848d077eaa9e9f371a8872
- Branch / Tag: refs/tags/v0.1.2
- Owner: https://github.com/himura467
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yml@de7482b25ffc325800848d077eaa9e9f371a8872
- Trigger Event: release

multimodal-timesfm 0.1.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

Multimodal TimesFM

Installation

Quick Start

1. Define Custom Dataset

2. Train the Model

3. Evaluate the Model

4. Visualize Predictions

5. Inference on New Data

Features

Time-MMD Dataset Example

Setup

Training with Cross-Validation

Evaluation

Visualization

Acknowledgments

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance