iot-agent

Iteration of Thought LLM Agent

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language

Project description

Iteration of Thought (IoT) Framework

1. Introduction

Generating accurate and contextually relevant responses using AI is more critical than ever. Whether you're developing chatbots, virtual assistants, or any application that relies on natural language processing, understanding how to leverage frameworks like the Iteration of Thought (IoT) can significantly enhance your results.

This implementation is based on the article: https://arxiv.org/pdf/2409.12618

💡 Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning

Understanding the Components of Iteration of Tought (IoT)

Inner Dialogue Agent (IDA):
- Think of the IDA as a personal tutor or guide that helps the LLM refine its responses. Just like a student who asks questions to clarify their understanding, the IDA generates context-sensitive prompts based on the original user query and the LLM's previous responses. This is akin to having a conversation where each response leads to further questions that deepen understanding.
- Mathematically, we can represent this as a function: C: Q × R × K' → P where Q is the set of possible queries, R represents potential responses from the LLM, and P denotes the generated prompts. Each iteration allows the IDA to dynamically adjust its guidance based on what has been previously discussed.
LLM Agent (LLMA):
- The LLMA is like the brain of the operation, processing the prompts generated by the IDA. It uses its internal knowledge base to refine its responses further. Imagine it as a researcher who takes feedback from their mentor (the IDA) and uses that feedback to improve their work.
- This relationship can be expressed mathematically as: L: Q × P × K → R Here, L takes in a query q, a prompt p, and a knowledge base K, producing a refined response r.
Iterative Prompting Loop:
- The iterative loop is where the magic happens. It involves back-and-forth communication between the IDA and LLMA. Each time the LLMA generates a response, it is evaluated by the IDA, which then creates a new prompt for further refinement. This process continues until either a satisfactory answer is reached or a maximum number of iterations is completed.
- This loop can be visualized as a conversation where each participant builds on what the other has said, leading to deeper insights and improved answers.

Why Use IoT?

The IoT framework is particularly effective in situations where complex queries require nuanced understanding or where initial responses may lack depth or clarity. It allows for adaptive exploration across different reasoning paths without discarding potentially valuable insights—unlike traditional methods that may generate multiple reasoning paths but ultimately discard most of them.

When to use it?

The IoT framework is particularly useful in scenarios where complex queries require nuanced understanding or where initial responses may lack depth or clarity. It's ideal for applications in education, customer support, content generation, and more.

2. Getting Started

Setting Up Your Environment

To get started with the IoT framework, you'll need to set up your environment correctly:

Install Python: Ensure you have Python installed on your machine (version 3.11 or higher).
Set up OpenAI API: Obtain your API key from OpenAI and set it as an environment variable. (You can use another model for example ollama/gemma2:2b)
Install Poetry: If you haven't already, install Poetry by following the instructions on the Poetry website.

Clone the Repository: Clone the repository containing the IoT framework code:

git clone https://github.com/raphaelmansuy/iteration_of_thought
cd iteration_of_thought

Install Dependencies: Use Poetry to install required packages:
```
poetry install
```

Running the Program

To run the program, you can use the following command within the Poetry environment:

Activate the Poetry Shell:
```
poetry shell
```
Run the Main Script:
```
python src/iot_agent/main.py --method AIoT --query "Your query here" --temperature 0.5
```
You can specify the method (AIoT or GIoT), the query, and the sampling temperature for the LLM response.

Understanding the Code Structure

The provided code consists of several key components:

IterationOfThought Class: This class manages the iteration process using specified models and includes methods for both AIoT and GIoT.
Methods:
- _call_llm: Handles API calls to an LLLM service.
- inner_dialogue_agent: Generates new prompts based on previous responses to refine the output.
- llm_agent: Combines the user query with the generated prompt to produce a refined response.
- stopping_criterion: Determines when to stop iterating based on the content of the response.
- aiot: Implements the Autonomous Iteration of Thought process.
- giot: Implements the Guided Iteration of Thought process.

3. Code Explanation

Main Components

Imports and Configuration: The code begins by importing necessary libraries and setting up configuration variables, including the OpenAI API key and model type.

import os
import time
import signal
from typing import Optional
from loguru import logger
import click
from rich.console import Console
from rich.panel import Panel
from rich.progress import Progress, SpinnerColumn, TextColumn
from rich.prompt import Prompt
from rich.table import Table
from rich.markdown import Markdown
from litellm import completion
import requests  # Added for handling URL requests

API Key Handling: The API key is retrieved from the environment variables. If it is not set, an error is raised.

API_KEY = os.getenv("OPENAI_API_KEY")
if not API_KEY:
    raise ValueError("OpenAI API key must be set as an environment variable.")

IterationOfThought Class: This class encapsulates the logic for both AIoT and GIoT methods. It initializes with parameters such as the model type, maximum iterations, timeout settings, and temperature.

class IterationOfThought:
    def __init__(self, model: str = MODEL, max_iterations: int = 5, timeout: int = 30, temperature: float = 0.5):
        self.model = model
        self.max_iterations = max_iterations
        self.timeout = timeout
        self.temperature = temperature

API Call Method: The _call_llm method handles the interaction with the OpenAI API, including error handling for rate limits.

def _call_llm(self, prompt: str, temperature: Optional[float] = None, max_retries: int = 3) -> str:
    for _ in range(max_retries):
        try:
            with console.status(f"[bold green]Calling {self.model} API...", spinner="dots"):
                response = completion(
                    model=self.model,
                    temperature=temperature or self.temperature,
                    messages=[{"role": "user", "content": prompt}],
                )
            return response["choices"][0]["message"]["content"].strip()
        except Exception as e:
            console.print(f"[red]Error: {e}")
            return ""
    console.print("[red]Failed to get a response from OpenAI API after max retries")
    return ""

Inner Dialogue Agent: This method generates a new prompt based on the previous response, encouraging deeper reasoning.

def inner_dialogue_agent(self, query: str, previous_response: str) -> str:
    prompt = (
        f"Given the original query: '{query}' and the previous response: '{previous_response}', "
        "generate an instructive and context-specific prompt to refine and improve the answer."
    )
    return self._call_llm(prompt)

AIoT and GIoT Methods: The aiot method implements the autonomous iteration process, while the giot method follows a fixed number of iterations.

def aiot(self, query: str) -> str:
    # Implementation of AIoT

def giot(self, query: str, fixed_iterations: int) -> str:
    # Implementation of GIoT

User Interaction: The get_user_query function prompts the user for input, allowing for a sample query or a custom one.

def get_user_query() -> str:
    user_input = Prompt.ask("Query", default=sample_query)
    return user_input

Main Function: The main function orchestrates the execution of the program, handling user input and displaying results.

@click.command()
@click.option("--method", type=click.Choice(["AIoT", "GIoT", "both"]), default="AIoT", help="Choose the method to run")
def main(method: str) -> None:
    # Main execution logic

4. Examples of IoT in Action

Simple Example: Basic Query Handling

Let's start with a simple example using AIoT:

sample_query = "What is the capital of France?"
final_response_aiot = iot.aiot(sample_query)
print(final_response_aiot)

In this example, we ask a straightforward question about France's capital. The AI will generate an initial response and refine it through iterations until it reaches a satisfactory answer.

Intermediate Example: Refining Responses

Now let's look at an intermediate example using GIoT:

sample_query = "Explain photosynthesis."
final_response_giot = iot.giot(sample_query, fixed_iterations=3)
print(final_response_giot)

Here, we are asking for an explanation of photosynthesis over three iterations, allowing us to obtain a more detailed understanding each time.

Advanced Example: Complex Query Iteration

For our advanced example, let's tackle a more complex query:

sample_query = "Describe the impact of climate change on marine biodiversity."
final_response_aiot = iot.aiot(sample_query)
print(final_response_aiot)

This query might require multiple iterations for deeper insights into various aspects related to climate change and marine life.

5. Interactive Elements

Quick Quiz: Test Your Knowledge 🧠✨

Question: What are the two main components of the IoT framework? 🤔

A) AIoT and GIoT 🌐🔄
B) Machine Learning and Deep Learning 📊🧠
C) Data Science and Data Engineering 📈🔧

Pause and reflect before checking your answer!

6. Pro Tips

Craft Effective Prompts: The quality of your prompts significantly influences response quality. Be clear and specific.
Iterate Wisely: Not all queries require multiple iterations; assess when it's necessary based on complexity.

7. Common Misconceptions

Many users believe that simply sending queries to AI will yield perfect results without needing refinement—this is a misconception! Iterative frameworks like IoT are essential for enhancing response accuracy and relevance.

8. Sequence Diagram: Understanding IoT Process

To visualize how the IoT framework operates, here's a Mermaid sequence diagram illustrating the interaction between different components during response generation:

sequenceDiagram
    participant User as User
    participant AI as OpenAI Model
    participant IDA as Inner Dialogue Agent
    participant LLM as LLM Agent
    
    User->>LLM: Send initial query
    LLM->>AI: Generate initial response
    AI-->>LLM: Return response
    LLM-->>User: Show initial response
    
    User->>IDA: Request refinement with previous response
    IDA->>AI: Generate new prompt based on previous response
    AI-->>IDA: Return refined prompt
    
    IDA->>LLM: Send refined prompt
    LLM->>AI: Generate refined response
    AI-->>LLM: Return refined response
    LLM-->>User: Show refined response
    
    Note over User, LLM: Repeat process until stopping criterion met

9. Conclusion

Call-to-Action: Apply What You've Learned!

To put your new knowledge into practice within 24 hours:

Choose a topic you're passionate about.
Formulate a query related to that topic.
Implement either AIoT or GIoT using the provided code structure.
Share your refined response with peers or colleagues!

By taking these steps, you'll not only reinforce what you've learned but also begin applying it in real-world scenarios—empowering you as a practitioner in no time!

Project details

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language

Release history Release notifications | RSS feed

This version

0.1.3

Sep 21, 2024

0.1.2

Sep 21, 2024

0.1.0

Sep 21, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iot_agent-0.1.3.tar.gz (13.9 kB view details)

Uploaded Sep 21, 2024 Source

Built Distribution

iot_agent-0.1.3-py3-none-any.whl (11.2 kB view details)

Uploaded Sep 21, 2024 Python 3

File details

Details for the file iot_agent-0.1.3.tar.gz.

File metadata

Download URL: iot_agent-0.1.3.tar.gz
Upload date: Sep 21, 2024
Size: 13.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.2 CPython/3.12.3 Darwin/24.0.0

File hashes

Hashes for iot_agent-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`1e85a112d204332f9748fb83094fe0c6a93f5312a2b396dcd863e2e8c6a371c4`
MD5	`4f287332df8312d769e2527f68bea9f0`
BLAKE2b-256	`2034cafdaf8d9bd674cba6ebd14f54e97ed1965284abf811ab09c9982195e09b`

See more details on using hashes here.

File details

Details for the file iot_agent-0.1.3-py3-none-any.whl.

File metadata

Download URL: iot_agent-0.1.3-py3-none-any.whl
Upload date: Sep 21, 2024
Size: 11.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.2 CPython/3.12.3 Darwin/24.0.0

File hashes

Hashes for iot_agent-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cce5ba74f7334057321632c88ad11d45e7f5a8aaf02d7f75a675f1093673d410`
MD5	`11cbf6d53c4f9aeb38d5a61d0a7e6c09`
BLAKE2b-256	`9f2a1b5e2d8140d7ee37725d188dcd8cd8362bf5ae972d25004039feb00c4088`