Skip to main content

Iteration of Thought LLM Agent

Project description

Iteration of Thought (IoT) Framework

1. Introduction

Generating accurate and contextually relevant responses using AI is more critical than ever. Whether you're developing chatbots, virtual assistants, or any application that relies on natural language processing, understanding how to leverage frameworks like the Iteration of Thought (IoT) can significantly enhance your results.

This implementation is based on the article: https://arxiv.org/pdf/2409.12618

💡 Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning

Understanding the Components of Iteration of Tought (IoT)

  1. Inner Dialogue Agent (IDA):

    • Think of the IDA as a personal tutor or guide that helps the LLM refine its responses. Just like a student who asks questions to clarify their understanding, the IDA generates context-sensitive prompts based on the original user query and the LLM's previous responses. This is akin to having a conversation where each response leads to further questions that deepen understanding.
    • Mathematically, we can represent this as a function: C: Q × R × K' → P where Q is the set of possible queries, R represents potential responses from the LLM, and P denotes the generated prompts. Each iteration allows the IDA to dynamically adjust its guidance based on what has been previously discussed.
  2. LLM Agent (LLMA):

    • The LLMA is like the brain of the operation, processing the prompts generated by the IDA. It uses its internal knowledge base to refine its responses further. Imagine it as a researcher who takes feedback from their mentor (the IDA) and uses that feedback to improve their work.
    • This relationship can be expressed mathematically as: L: Q × P × K → R Here, L takes in a query q, a prompt p, and a knowledge base K, producing a refined response r.
  3. Iterative Prompting Loop:

    • The iterative loop is where the magic happens. It involves back-and-forth communication between the IDA and LLMA. Each time the LLMA generates a response, it is evaluated by the IDA, which then creates a new prompt for further refinement. This process continues until either a satisfactory answer is reached or a maximum number of iterations is completed.
    • This loop can be visualized as a conversation where each participant builds on what the other has said, leading to deeper insights and improved answers.

Why Use IoT?

The IoT framework is particularly effective in situations where complex queries require nuanced understanding or where initial responses may lack depth or clarity. It allows for adaptive exploration across different reasoning paths without discarding potentially valuable insights—unlike traditional methods that may generate multiple reasoning paths but ultimately discard most of them.

When to use it?

The IoT framework is particularly useful in scenarios where complex queries require nuanced understanding or where initial responses may lack depth or clarity. It's ideal for applications in education, customer support, content generation, and more.

2. Getting Started

Setting Up Your Environment

To get started with the IoT framework, you'll need to set up your environment correctly:

  1. Install Python: Ensure you have Python installed on your machine (version 3.11 or higher).
  2. Set up OpenAI API: Obtain your API key from OpenAI and set it as an environment variable. (You can use another model for example ollama/gemma2:2b)
  3. Install Poetry: If you haven't already, install Poetry by following the instructions on the Poetry website.
  4. Clone the Repository: Clone the repository containing the IoT framework code:
    git clone https://github.com/raphaelmansuy/iteration_of_thought
    cd iteration_of_thought
    
  5. Install Dependencies: Use Poetry to install required packages:
    poetry install
    

Running the Program

To run the program, you can use the following command within the Poetry environment:

  1. Activate the Poetry Shell:

    poetry shell
    
  2. Run the Main Script:

    python src/iot_agent/main.py --method AIoT --query "Your query here" --temperature 0.5
    

    You can specify the method (AIoT or GIoT), the query, and the sampling temperature for the LLM response.

Understanding the Code Structure

The provided code consists of several key components:

  • IterationOfThought Class: This class manages the iteration process using specified models and includes methods for both AIoT and GIoT.
  • Methods:
    • _call_llm: Handles API calls to an LLLM service.
    • inner_dialogue_agent: Generates new prompts based on previous responses to refine the output.
    • llm_agent: Combines the user query with the generated prompt to produce a refined response.
    • stopping_criterion: Determines when to stop iterating based on the content of the response.
    • aiot: Implements the Autonomous Iteration of Thought process.
    • giot: Implements the Guided Iteration of Thought process.

3. Code Explanation

Main Components

  1. Imports and Configuration: The code begins by importing necessary libraries and setting up configuration variables, including the OpenAI API key and model type.

    import os
    import time
    import signal
    from typing import Optional
    from loguru import logger
    import click
    from rich.console import Console
    from rich.panel import Panel
    from rich.progress import Progress, SpinnerColumn, TextColumn
    from rich.prompt import Prompt
    from rich.table import Table
    from rich.markdown import Markdown
    from litellm import completion
    import requests  # Added for handling URL requests
    
  2. API Key Handling: The API key is retrieved from the environment variables. If it is not set, an error is raised.

    API_KEY = os.getenv("OPENAI_API_KEY")
    if not API_KEY:
        raise ValueError("OpenAI API key must be set as an environment variable.")
    
  3. IterationOfThought Class: This class encapsulates the logic for both AIoT and GIoT methods. It initializes with parameters such as the model type, maximum iterations, timeout settings, and temperature.

    class IterationOfThought:
        def __init__(self, model: str = MODEL, max_iterations: int = 5, timeout: int = 30, temperature: float = 0.5):
            self.model = model
            self.max_iterations = max_iterations
            self.timeout = timeout
            self.temperature = temperature
    
  4. API Call Method: The _call_llm method handles the interaction with the OpenAI API, including error handling for rate limits.

    def _call_llm(self, prompt: str, temperature: Optional[float] = None, max_retries: int = 3) -> str:
        for _ in range(max_retries):
            try:
                with console.status(f"[bold green]Calling {self.model} API...", spinner="dots"):
                    response = completion(
                        model=self.model,
                        temperature=temperature or self.temperature,
                        messages=[{"role": "user", "content": prompt}],
                    )
                return response["choices"][0]["message"]["content"].strip()
            except Exception as e:
                console.print(f"[red]Error: {e}")
                return ""
        console.print("[red]Failed to get a response from OpenAI API after max retries")
        return ""
    
  5. Inner Dialogue Agent: This method generates a new prompt based on the previous response, encouraging deeper reasoning.

    def inner_dialogue_agent(self, query: str, previous_response: str) -> str:
        prompt = (
            f"Given the original query: '{query}' and the previous response: '{previous_response}', "
            "generate an instructive and context-specific prompt to refine and improve the answer."
        )
        return self._call_llm(prompt)
    
  6. AIoT and GIoT Methods: The aiot method implements the autonomous iteration process, while the giot method follows a fixed number of iterations.

    def aiot(self, query: str) -> str:
        # Implementation of AIoT
    
    def giot(self, query: str, fixed_iterations: int) -> str:
        # Implementation of GIoT
    
  7. User Interaction: The get_user_query function prompts the user for input, allowing for a sample query or a custom one.

    def get_user_query() -> str:
        user_input = Prompt.ask("Query", default=sample_query)
        return user_input
    
  8. Main Function: The main function orchestrates the execution of the program, handling user input and displaying results.

    @click.command()
    @click.option("--method", type=click.Choice(["AIoT", "GIoT", "both"]), default="AIoT", help="Choose the method to run")
    def main(method: str) -> None:
        # Main execution logic
    

4. Examples of IoT in Action

Simple Example: Basic Query Handling

Let's start with a simple example using AIoT:

sample_query = "What is the capital of France?"
final_response_aiot = iot.aiot(sample_query)
print(final_response_aiot)

In this example, we ask a straightforward question about France's capital. The AI will generate an initial response and refine it through iterations until it reaches a satisfactory answer.

Intermediate Example: Refining Responses

Now let's look at an intermediate example using GIoT:

sample_query = "Explain photosynthesis."
final_response_giot = iot.giot(sample_query, fixed_iterations=3)
print(final_response_giot)

Here, we are asking for an explanation of photosynthesis over three iterations, allowing us to obtain a more detailed understanding each time.

Advanced Example: Complex Query Iteration

For our advanced example, let's tackle a more complex query:

sample_query = "Describe the impact of climate change on marine biodiversity."
final_response_aiot = iot.aiot(sample_query)
print(final_response_aiot)

This query might require multiple iterations for deeper insights into various aspects related to climate change and marine life.

5. Interactive Elements

Quick Quiz: Test Your Knowledge 🧠✨

Question: What are the two main components of the IoT framework? 🤔

  • A) AIoT and GIoT 🌐🔄
  • B) Machine Learning and Deep Learning 📊🧠
  • C) Data Science and Data Engineering 📈🔧

Pause and reflect before checking your answer!

6. Pro Tips

  • Craft Effective Prompts: The quality of your prompts significantly influences response quality. Be clear and specific.
  • Iterate Wisely: Not all queries require multiple iterations; assess when it's necessary based on complexity.

7. Common Misconceptions

Many users believe that simply sending queries to AI will yield perfect results without needing refinement—this is a misconception! Iterative frameworks like IoT are essential for enhancing response accuracy and relevance.

8. Sequence Diagram: Understanding IoT Process

To visualize how the IoT framework operates, here's a Mermaid sequence diagram illustrating the interaction between different components during response generation:

sequenceDiagram
    participant User as User
    participant AI as OpenAI Model
    participant IDA as Inner Dialogue Agent
    participant LLM as LLM Agent
    
    User->>LLM: Send initial query
    LLM->>AI: Generate initial response
    AI-->>LLM: Return response
    LLM-->>User: Show initial response
    
    User->>IDA: Request refinement with previous response
    IDA->>AI: Generate new prompt based on previous response
    AI-->>IDA: Return refined prompt
    
    IDA->>LLM: Send refined prompt
    LLM->>AI: Generate refined response
    AI-->>LLM: Return refined response
    LLM-->>User: Show refined response
    
    Note over User, LLM: Repeat process until stopping criterion met

9. Conclusion

Call-to-Action: Apply What You've Learned!

To put your new knowledge into practice within 24 hours:

  1. Choose a topic you're passionate about.
  2. Formulate a query related to that topic.
  3. Implement either AIoT or GIoT using the provided code structure.
  4. Share your refined response with peers or colleagues!

By taking these steps, you'll not only reinforce what you've learned but also begin applying it in real-world scenarios—empowering you as a practitioner in no time!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iot_agent-0.1.3.tar.gz (13.9 kB view details)

Uploaded Source

Built Distribution

iot_agent-0.1.3-py3-none-any.whl (11.2 kB view details)

Uploaded Python 3

File details

Details for the file iot_agent-0.1.3.tar.gz.

File metadata

  • Download URL: iot_agent-0.1.3.tar.gz
  • Upload date:
  • Size: 13.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Darwin/24.0.0

File hashes

Hashes for iot_agent-0.1.3.tar.gz
Algorithm Hash digest
SHA256 1e85a112d204332f9748fb83094fe0c6a93f5312a2b396dcd863e2e8c6a371c4
MD5 4f287332df8312d769e2527f68bea9f0
BLAKE2b-256 2034cafdaf8d9bd674cba6ebd14f54e97ed1965284abf811ab09c9982195e09b

See more details on using hashes here.

File details

Details for the file iot_agent-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: iot_agent-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 11.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Darwin/24.0.0

File hashes

Hashes for iot_agent-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 cce5ba74f7334057321632c88ad11d45e7f5a8aaf02d7f75a675f1093673d410
MD5 11cbf6d53c4f9aeb38d5a61d0a7e6c09
BLAKE2b-256 9f2a1b5e2d8140d7ee37725d188dcd8cd8362bf5ae972d25004039feb00c4088

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page