A package for detecting anomalies in time series data using LLMs

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language

Project description

Anomaly Agent

A Python package for detecting anomalies in time series data using Large Language Models.

How It Works

The AnomalyAgent uses a LangGraph-based state machine to orchestrate anomaly detection and verification workflows:

graph TD
    A[Start] --> B[Detection Node]
    B --> C{Anomalies Found?}
    C -->|No| D[End - No Anomalies]
    C -->|Yes| E{Verification Enabled?}
    E -->|No| F[End - Return Detected]
    E -->|Yes| G["`Verification Chain
    (n_verify_steps: 1-5)`"]
    G --> H[End - Return Verified]
    
    B -->|Error| I[Error Handler]
    I -->|Retry| B
    I -->|Max Retries| J[End - Error]
    
    style B fill:#e1f5fe
    style G fill:#f3e5f5
    style I fill:#ffebee

Key Components:

Detection Node: Analyzes time series data to identify potential anomalies using LLM
Verification Node(s): Re-examines detected anomalies through multiple rounds to reduce false positives
Error Handler: Manages retries and failure scenarios with exponential backoff
Graph Caching: Reuses compiled graphs across agents with same configuration for efficiency

Installation

pip install anomaly-agent

Usage

See the examples.ipynb notebook for some usage examples.

import os
from anomaly_agent.utils import make_df, make_anomaly_config
from anomaly_agent.plot import plot_df
from anomaly_agent import AnomalyAgent

# set openai api key if not in environment
# os.environ['OPENAI_API_KEY'] = "<your-openai-api-key>"

# get anomaly config to generate some dummy data
anomaly_cfg = make_anomaly_config()
print(anomaly_cfg)

# generate some dummy data
df = make_df(100, 3, anomaly_config=anomaly_cfg)
df.head()

# create anomaly agent (uses cost-optimized gpt-5-nano by default)
anomaly_agent = AnomalyAgent()

# detect anomalies
anomalies = anomaly_agent.detect_anomalies(df)

# print anomalies
print(anomalies)

{
  "var1":"AnomalyList(anomalies="[
    "Anomaly(timestamp=""2020-02-05",
    variable_value=3.279153,
    "anomaly_description=""Abrupt spike in value, significantly higher than previous observations."")",
    "Anomaly(timestamp=""2020-02-15",
    variable_value=5.001551,
    "anomaly_description=""Abrupt spike in value, significantly higher than previous observations."")",
    "Anomaly(timestamp=""2020-02-20",
    variable_value=3.526827,
    "anomaly_description=""Abrupt spike in value, significantly higher than previous observations."")",
    "Anomaly(timestamp=""2020-03-23",
    variable_value=3.735584,
    "anomaly_description=""Abrupt spike in value, significantly higher than previous observations."")",
    "Anomaly(timestamp=""2020-04-05",
    variable_value=8.207361,
    "anomaly_description=""Abrupt spike in value, significantly higher than previous observations."")",
    "Anomaly(timestamp=""2020-02-06",
    variable_value=0.0,
    "anomaly_description=""Missing value (NaN) detected."")",
    "Anomaly(timestamp=""2020-02-24",
    variable_value=0.0,
    "anomaly_description=""Missing value (NaN) detected."")",
    "Anomaly(timestamp=""2020-04-09",
    variable_value=0.0,
    "anomaly_description=""Missing value (NaN) detected."")"
  ]")",
  "var2":"AnomalyList(anomalies="[
    "Anomaly(timestamp=""2020-01-27",
    variable_value=3.438903,
    "anomaly_description=""Significantly high spike compared to previous values."")",
    "Anomaly(timestamp=""2020-02-15",
    variable_value=3.374155,
    "anomaly_description=""Significantly high spike compared to previous values."")",
    "Anomaly(timestamp=""2020-02-29",
    variable_value=3.194132,
    "anomaly_description=""Significantly high spike compared to previous values."")",
    "Anomaly(timestamp=""2020-03-03",
    variable_value=3.401919,
    "anomaly_description=""Significantly high spike compared to previous values."")"
  ]")",
  "var3":"AnomalyList(anomalies="[
    "Anomaly(timestamp=""2020-01-15",
    variable_value=4.116716,
    "anomaly_description=""Significantly higher value compared to previous days."")",
    "Anomaly(timestamp=""2020-02-15",
    variable_value=2.418594,
    "anomaly_description=""Unusually high value than expected."")",
    "Anomaly(timestamp=""2020-02-29",
    variable_value=0.279798,
    "anomaly_description=""Lower than expected value in the series."")",
    "Anomaly(timestamp=""2020-03-29",
    variable_value=8.016581,
    "anomaly_description=""Extremely high value deviating from the norm."")",
    "Anomaly(timestamp=""2020-04-07",
    variable_value=7.609766,
    "anomaly_description=""Another extreme spike in value."")"
  ]")"
}

# get anomalies in long format
df_anomalies_long = anomaly_agent.get_anomalies_df(anomalies)
df_anomalies_long.head()

	timestamp	variable_name	value	anomaly_description
0	2020-02-05	var1	3.279153	Abrupt spike in value, significantly higher th...
1	2020-02-15	var1	5.001551	Abrupt spike in value, significantly higher th...
2	2020-02-20	var1	3.526827	Abrupt spike in value, significantly higher th...
3	2020-03-23	var1	3.735584	Abrupt spike in value, significantly higher th...
4	2020-04-05	var1	8.207361	Abrupt spike in value, significantly higher th...

Advanced Features

Debug Mode

Enable debug mode to get detailed logging of the anomaly detection process:

from anomaly_agent import AnomalyAgent

# Create agent with debug mode enabled
agent = AnomalyAgent(debug=True)

# Run detection - you'll see detailed logs
anomalies = agent.detect_anomalies(df)

This will show detailed information about:

Column processing stages
Node execution timing
Anomaly detection results
Verification filtering
Graph transitions

Custom Model Selection

Choose different GPT-5 models based on your needs:

# Cost-optimized (default)
agent = AnomalyAgent(model_name="gpt-5-nano")  # ~$0.05/$0.40 per 1M tokens

# Balanced performance  
agent = AnomalyAgent(model_name="gpt-5-mini")  # ~$0.25/$2.00 per 1M tokens

# Premium reasoning
agent = AnomalyAgent(model_name="gpt-5")       # ~$1.25/$10.00 per 1M tokens

Advanced Configuration

from anomaly_agent import AnomalyAgent

agent = AnomalyAgent(
    model_name="gpt-5-nano",
    verify_anomalies=True,        # Enable/disable verification step
    n_verify_steps=2,            # Number of verification rounds (1-5)
    max_retries=3,               # Configure retry behavior
    timeout_seconds=300,         # Set operation timeout
    debug=True                   # Enable detailed logging
)

Multiple Verification Steps

Improve detection accuracy by running multiple verification rounds to reduce false positives:

# Single verification (default, fastest)
agent_single = AnomalyAgent(n_verify_steps=1)

# Double verification (good balance for production)
agent_double = AnomalyAgent(n_verify_steps=2)  

# Triple verification (maximum confidence)
agent_triple = AnomalyAgent(n_verify_steps=3)

# Runtime override
anomalies = agent_single.detect_anomalies(df, n_verify_steps=4)

Benefits:

Reduced false positives through multiple LLM evaluations
Better consistency due to stochastic nature of LLMs
Configurable trade-off between accuracy and cost
Detailed metadata tracking for each verification step

Performance considerations:

n_verify_steps=1: Fastest, standard accuracy
n_verify_steps=2: 2x verification cost, good production balance
n_verify_steps=3+: Higher cost, maximum confidence for critical applications

Streaming and Parallel Processing

Enhanced user experience and performance for multiple time series variables:

# Real-time streaming with progress updates
def progress_handler(column, event, data):
    if event == "start":
        print(f"🔍 Starting {column}")
    elif event == "column_complete":
        print(f"✅ {column}: {data['anomaly_count']} anomalies")

anomalies = agent.detect_anomalies_streaming(df, progress_callback=progress_handler)

# Parallel processing for faster execution
import asyncio

async def detect_parallel():
    anomalies = await agent.detect_anomalies_parallel(
        df, 
        max_concurrent=3,  # Process up to 3 columns simultaneously
        progress_callback=progress_handler
    )
    return anomalies

results = asyncio.run(detect_parallel())

# Async streaming for responsive applications
async def process_with_streaming():
    async for event in agent.detect_anomalies_streaming_async(df):
        if event["event"] == "result":
            print(f"Column {event['column']}: {event['data']['anomaly_count']} anomalies")

asyncio.run(process_with_streaming())

Key Features:

Real-time Progress: Stream updates as each detection step completes
Parallel Execution: Process multiple time series variables concurrently
Configurable Concurrency: Control resource usage with max_concurrent parameter
Error Resilience: Graceful handling of failures in parallel execution
Performance Monitoring: Built-in timing and progress metrics

Use Cases:

Streaming: Interactive dashboards, real-time monitoring, user feedback
Parallel: Batch processing, large datasets, performance-critical applications
Async Streaming: Web applications, reactive UIs, progressive data loading

Custom Prompts

Customize the detection and verification prompts for domain-specific analysis:

custom_detection_prompt = """
You are a financial analyst specializing in market anomaly detection.
Focus on detecting price movements that exceed normal volatility ranges...
"""

agent = AnomalyAgent(
    detection_prompt=custom_detection_prompt,
    verification_prompt=custom_verification_prompt
)

Architecture Components

For advanced users, you can access the underlying components:

from anomaly_agent import (
    AnomalyAgent, 
    GraphManager,           # Graph caching and management
    DetectionNode,          # Anomaly detection node
    VerificationNode,       # Anomaly verification node
    ErrorHandlerNode,       # Error handling and retry logic
    AgentConfig,            # Configuration models
    AgentState              # State management
)

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language

Release history Release notifications | RSS feed

0.13.2

Apr 18, 2026

0.13.1

Feb 1, 2026

0.13.0

Feb 1, 2026

0.12.0

Feb 1, 2026

0.11.5

Jan 31, 2026

0.11.3

Jan 31, 2026

0.11.2

Jan 31, 2026

0.11.1

Jan 31, 2026

0.11.0

Jan 30, 2026

0.10.0

Jan 30, 2026

This version

0.9.0

Aug 17, 2025

0.8.0

Jun 17, 2025

0.7.0

May 25, 2025

0.6.0

May 13, 2025

0.5.0

Feb 14, 2025

0.4.0

Feb 14, 2025

0.3.0

Feb 14, 2025

0.2.0

Feb 11, 2025

0.1.0

Feb 6, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

anomaly_agent-0.9.0.tar.gz (40.1 kB view details)

Uploaded Aug 17, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

anomaly_agent-0.9.0-py3-none-any.whl (26.1 kB view details)

Uploaded Aug 17, 2025 Python 3

File details

Details for the file anomaly_agent-0.9.0.tar.gz.

File metadata

Download URL: anomaly_agent-0.9.0.tar.gz
Upload date: Aug 17, 2025
Size: 40.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.6

File hashes

Hashes for anomaly_agent-0.9.0.tar.gz
Algorithm	Hash digest
SHA256	`feba1f754c2b9294e649ff7a81f0eb9a372529bce5ab63bf701a0141e95ee84e`
MD5	`111b688099b33e9969e403b38e788da6`
BLAKE2b-256	`829789587f2cd53996755d777dc5305d9b8910fbd27b2ea1fbbf6468681fd24f`

See more details on using hashes here.

File details

Details for the file anomaly_agent-0.9.0-py3-none-any.whl.

File metadata

Download URL: anomaly_agent-0.9.0-py3-none-any.whl
Upload date: Aug 17, 2025
Size: 26.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.6

File hashes

Hashes for anomaly_agent-0.9.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`05e12322a52a4c8e40a8846ffc3d135b54598d7d440b32aff23b6c38481e0a51`
MD5	`ba8e3414743eccd3a00e485e7d0b4c57`
BLAKE2b-256	`f57312f195fc8f9e8dc39069df542c93c759115dff57a983e0c0c589d27cf8ff`

See more details on using hashes here.

anomaly-agent 0.9.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Anomaly Agent

How It Works

Installation

Usage

Advanced Features

Debug Mode

Custom Model Selection

Advanced Configuration

Multiple Verification Steps

Streaming and Parallel Processing

Custom Prompts

Architecture Components

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes