Skip to main content

LLM-ABBA: mining time series via symbolic approximation and large language models

Project description

pip license

LLM-ABBA is an software framework designed for performing time series application using Large Language Models (LLMs) based on symbolic representation, as introduced in the paper: LLM-ABBA: Symbolic Time Series Approximation using Large Language Models.

Time series analysis often involves identifying patterns, trends, and structures within sequences of data points. Traditional methods, such as discrete wavelet transforms or symbolic aggregate approximation (SAX), have demonstrated success in converting continuous time series into symbolic representations, facilitating better analysis and compression. However, these methods are often limited in their ability to capture complex and subtle patterns.

LLM-ABBA builds upon these techniques by incorporating the power of large language models, which have been shown to excel in pattern recognition and sequence prediction tasks. By applying LLMs to symbolic time series representation, LLM-ABBA is able to automatically discover rich, meaningful representations of time series data. This approach offers several advantages:

  • Higher accuracy and compression: LLM-ABBA achieves better symbolic representations by leveraging LLMs’ ability to understand and generate sequences, resulting in higher data compression and more accurate representation of underlying patterns.

  • Adaptability: The use of LLMs enables the framework to adapt to various types of time series data, allowing for robust performance across different domains such as finance, healthcare, and environmental science.

  • Scalability: LLM-ABBA is designed to efficiently handle large-scale time series datasets, making it suitable for both small and big data applications.

  • Automatic feature discovery: By harnessing the power of LLMs, LLM-ABBA can discover novel features and patterns in time series data that traditional symbolic approaches might miss.

In summary, LLM-ABBA represents a significant advancement in symbolic time series analysis, combining the power of modern machine learning techniques with established methods to offer enhanced compression, pattern recognition, and interpretability.

Key Features

  • Symbolic Time Series Approximation: Converts time series data into symbolic representations.

  • LLM-Powered Encoding: Utilizes LLMs to enhance compression and pattern discovery.

  • Efficient and Scalable: Designed to work with large-scale time series datasets.

  • Flexible Integration: Compatible with various machine learning and statistical analysis workflows.

Installation

LLM-ABBA can be installed via pip:

pip install llm-abba

Usage

For details of usage, please refer to the documentation and folder examples.

LLM-ABBA uses quantized ABBA with fixed-point adaptive piecewise linear continuous approximation (FAPCA). One would like to independently try quantized ABBA (with FAPCA), we provide independent interface:

from llmabba import ABBA

 ts = [[1.2, 1.4, 1.3, 1.8, 2.2, 2.4, 2.1], [1, 2, 3, 4]]
 abba = ABBA()
 symbolic_representation = abba.encode(ts)
 print("Symbolic Representation:", symbolic_representation)
 reconstruction = abba.decode(symbolic_representation)

Contributing

We welcome contributions! If you’d like to improve LLM-ABBA, please follow these steps:

  1. Fork the repository.

  2. Create a new branch for your feature or bugfix.

  3. Submit a pull request.

License

LLM-ABBA is released under the MIT License.

Contact

For questions or feedback, please reach out via GitHub issues or contact the authors of the paper.

References

[1]Carson, E., Chen, X., and Kang, C., “LLM-ABBA: Understanding time series via symbolic approximation”, arXiv e-prints, arXiv:2411.18506, 2024. doi:10.48550/arXiv.2411.18506.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llmabba-0.0.3.tar.gz (491.3 kB view details)

Uploaded Source

File details

Details for the file llmabba-0.0.3.tar.gz.

File metadata

  • Download URL: llmabba-0.0.3.tar.gz
  • Upload date:
  • Size: 491.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.4

File hashes

Hashes for llmabba-0.0.3.tar.gz
Algorithm Hash digest
SHA256 d1e6f23190ab04516da064bcfb10dc32f60efea9aa604c8bc5301e9904448e2c
MD5 87444e9d176b69606f46edab418d6fae
BLAKE2b-256 625d47b9d99737fb3420a2aced26a197925e0a821390cf988647c2e255a83d7e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page