A python module

These details have not been verified by PyPI

Project description

chainollama

chainollama is a partner package for chatollama which provides the ChainEngine class. This engine is an extension of the ChatEngine class. It provides a function called chain(message: str). This function acts similarly to the chat function from ChatEngine, but it allows the model to respond both naturally and with tool calls sequentially. When the model is responding to the user's message, it is instructed to respond in a specific format if it wants to call tools that you can provide with the ChatToolKit and ChatTool classes.

Any Python function you create can be added to this system without the need for manual formatting. You just need to make the function, and the model will understand it. Ensure that you provide a description in the function like this:

def testing_func(arg1: str, arg2: bool = True) -> None:
    """
    This is a test function with 2 arguments
    :param arg1: description for arg1
    :param arg2: description for arg2
    :returns: description of returning value(s)
    """
    return None

This function will be turned into this for the model to understand:

{
    "function": "testing_func(arg1: str, arg2: bool = True) -> None",
    "description": "This is a test function with 2 arguments\n:param arg1: description for arg1\n:param arg2: description for arg2\n:returns: description of returning value(s)"
}

Note that while the ChatTool class exists to create individual tools, it is often easier to simply pass your functions directly into the ChatToolKit as a list. The toolkit will automatically convert these functions into tools without requiring you to create each ChatTool manually.

Important Details About the Chain Function

The chain function internally uses the chat and wait functions. The chat function streams the response and runs as part of a separate thread, while the wait function allows you to wait for that thread to finish in the normal chatollama module. In ChainEngine, these functions are used internally, meaning that if you want to have the entire chain process run asynchronously, you will need to manage it by starting a separate thread yourself.

This approach means that no code will run after the chain function is called until the model completes its entire response cycle, including tool usage. This can be beneficial as it simplifies interaction, ensuring that everything is complete before moving forward. However, for scenarios involving UI performance or where responsiveness is key, handling threading on top of it is essential to maintain smooth operation.

Fine-Tuning Plans

We are planning to create a fine-tuned version of the llama3.1:8b model that will be optimized to work even better with this system. This fine-tuned model will have several enhanced capabilities:

Improved Tool Use Integration: The model will be better at weaving tool use naturally into its language responses.
Enhanced JSON Understanding: The model will have a better understanding of the JSON format used for converting functions in the toolkit.
Internal System Command Awareness: It will be trained to properly consider internal user system commands that allow the model to receive function results and continue responding seamlessly.
Overall System Integration: The fine-tuned model will have a much better overall integration with the system, leading to more reliable interactions.

Currently, the normal llama3.1:8b model works fairly well, achieving around 50% success in properly using tools and responding to all components of the user's prompt. However, it sometimes forgets to use available tools or fails to respond to certain parts of a user's message. With fine-tuning, we expect this accuracy to improve significantly, aiming for 90% or better.

Features

Chain Execution: Allows the AI to perform tasks by sequentially combining natural language responses with tool calls.
Automatic Function Integration: Any Python function can be added to the toolkit without manual formatting. The model automatically interprets the function signature and description.
Tool Calling Format: Uses a consistent format for the AI to call tools, maintaining a seamless user experience while leveraging available functions.
Error Handling: Includes robust error-handling mechanisms to ensure graceful recovery in case of tool call errors or unexpected input formats.
Extensible ToolKit: Provides ChatTool and ChatToolKit classes to easily create and manage available tools for the model.

Response Handling Callbacks

The ChainEngine provides two callbacks that developers can use to handle how the AI response is integrated into their application:

Before Chain Handle Callback (before_chain_handle_callback****): This callback is called before the AI handles the response. It allows developers to manipulate or intercept the AI's output before processing. If this callback returns True, it will stop the streaming response immediately.
After Chain Handle Callback (after_chain_handle_callback****): This callback is called after the AI processes the response. It allows developers to perform actions based on the processed output, such as modifying the final response or integrating it with the application. Similar to the before callback, if it returns True, it will stop further processing.

Both callbacks take the following arguments:

mode** (int)**: Indicates the mode of the response (mode 0: starting a new response just before the model begins streaming, mode 1: runs for every token generated during streaming, mode 2: called one last time after the stream finishes).
delta** (str)**: Represents the incremental part of the response being streamed. The current token generated basically.
text** (str)**: The entire response text generated so far.

There is also a default printing mechanism (default_printing) which is enabled by default. This feature will print the AI's response in real-time as it is being streamed, allowing developers to see the process directly in the console. However, if either of the callbacks returns True, it will stop the stream immediately, providing greater control over how the output is handled.

Examples

Below are some simple usage examples to demonstrate how to use chainollama effectively.

Example 1: Adding a Function to ChatToolKit

from chainollama import ChatToolKit, ChainEngine

# Define a sample function
def greet_user(name: str) -> str:
    """
    Greets the user by name.
    :param name: The name of the user to greet
    :returns: A greeting string
    """
    return f"Hello, {name}!"

# Add the function to the ChatToolKit
toolkit = ChatToolKit([greet_user])

Example 2: Using ChainEngine to Respond with a Tool Call

# Initialize the ChainEngine with the toolkit
engine = ChainEngine(model="llama3.1:8b", toolkit=toolkit)

# Use the chain method to interact
engine.chain("Greet the user named John.")

# Output could be something like:
# "Hello, John!"

Example 3: Handling Multiple Tools

# Define more functions
def add_numbers(a: int, b: int) -> int:
    """
    Adds two numbers.
    :param a: The first number
    :param b: The second number
    :returns: The sum of the two numbers
    """
    return a + b

def multiply_numbers(a: int, b: int) -> int:
    """
    Multiplies two numbers.
    :param a: The first number
    :param b: The second number
    :returns: The product of the two numbers
    """
    return a * b

# Add the functions to the toolkit
toolkit = ChatToolKit([add_numbers, multiply_numbers])

# Initialize the ChainEngine again
engine = ChainEngine(model="llama3.1:8b", toolkit=toolkit)

# Use the chain method to interact
engine.chain("What is 5 plus 3?")
# Output: "The result of 5 plus 3 is 8."

engine.chain("What is 4 times 7?")
# Output: "The result of 4 times 7 is 28."

Project details

These details have not been verified by PyPI

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.7

Oct 30, 2024

0.6

Oct 29, 2024

0.5

Oct 29, 2024

0.4

Oct 29, 2024

0.3

Oct 29, 2024

0.2

Oct 29, 2024

0.1

Oct 28, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chainollama-0.7.tar.gz (9.9 kB view details)

Uploaded Oct 30, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

chainollama-0.7-py3-none-any.whl (9.8 kB view details)

Uploaded Oct 30, 2024 Python 3

File details

Details for the file chainollama-0.7.tar.gz.

File metadata

Download URL: chainollama-0.7.tar.gz
Upload date: Oct 30, 2024
Size: 9.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for chainollama-0.7.tar.gz
Algorithm	Hash digest
SHA256	`06facb90d8199b4ab5699516317d2382ce839bda1bb81c63a86fae17c6c15ece`
MD5	`53427ff0aa6178bcb82f385bf38f401e`
BLAKE2b-256	`d0bea66b1505a698f34bde4be210df6fe9f6749613221be3d2b2004daa101fc6`

See more details on using hashes here.

File details

Details for the file chainollama-0.7-py3-none-any.whl.

File metadata

Download URL: chainollama-0.7-py3-none-any.whl
Upload date: Oct 30, 2024
Size: 9.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for chainollama-0.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`359495f0f5581cecb3b28ed725dc3bb89141ea4a01a122058ae67dc20301bed0`
MD5	`016a6329228a2a5fe79fa1bcf417ab76`
BLAKE2b-256	`3735edd339cb1388df5fcfd5339be8529384ac7b73c3328773d84d0c9b81396b`

See more details on using hashes here.

chainollama 0.7

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

chainollama

Important Details About the Chain Function

Fine-Tuning Plans

Features

Response Handling Callbacks

Examples

Example 1: Adding a Function to ChatToolKit

Example 2: Using ChainEngine to Respond with a Tool Call

Example 3: Handling Multiple Tools

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes