Use Gradio Apps as tools for LLM Agents

These details have not been verified by PyPI

Project description

Gradio Tools: Gradio 🤝 LLM Agents

There are many 1000s of Gradio apps on Hugging Face Spaces. This library puts them at the tips of your LLM's fingers 🦾

Specifically, gradio-tools is a Python library for converting Gradio apps into tools that can be leveraged by a large language model (LLM)-based agent to complete its task. For example, an LLM could use a Gradio tool to transcribe a voice recording it finds online and then summarize it for you. Or it could use a different Gradio tool to apply OCR to a document on your Google Drive and then answer questions about it.

Currently supported libraries for agents are:

gradio-tools comes with a set of pre-built tools you can leverage immediately! These include:

StableDiffusionTool - Generate an image from a given prompt using the open source stable diffusion demo hosted on HuggingFace spaces
ImageCaptionTool - Caption an image by providing a filepath based on Niels Rogge's HuggingFace Space
ImageToMusicTool - Create an audio clip that matches the style of a given image file based on Sylvain Filoni's HuggingFace Space
StableDiffusionPromptGeneratorTool - Use this tool to improve a prompt for stable diffusion and other image generators based on this HuggingFace Space
TextToVideoTool - A tool for creating short videos from text. Based on this HuggingFace Space
WhisperAudioTranscriptionTool - A tool for transcribing audio with Whisper. Based on this HuggingFace Space
ClipInterrogatorTool - A tool for reverse engineering a prompt from a source image. Based on this HuggingFace Space
DocQueryDocumentAnsweringTool - A tool for answering questions about a document from the from the image of the document. Based on this HuggingFace Space
BarkTextToSpeechTool - A tool for text-to-speech. Based on this HuggingFace Space

We welcome more contributions!

Example Usage

Simply import the desired tools from gradio_tools (or create your own!) and pass to initialize_agent from LangChain.

In this example, we use some pre-built tools to generate images, caption them, and create a video!

Read the How It Works section to learn how to create your own tools! We welcome any new tools to the library!

from gradio_tools import (StableDiffusionTool, ImageCaptioningTool, StableDiffusionPromptGeneratorTool,
                          TextToVideoTool)

from langchain.agents import initialize_agent
from langchain.llms import OpenAI
from langchain.memory import ConversationBufferMemory

llm = OpenAI(temperature=0)
memory = ConversationBufferMemory(memory_key="chat_history")

tools = [StableDiffusionTool().langchain, ImageCaptioningTool().langchain,
         StableDiffusionPromptGeneratorTool().langchain, TextToVideoTool().langchain]


agent = initialize_agent(tools, llm, memory=memory, agent="conversational-react-description", verbose=True)
output = agent.run(input=("Please create a photo of a dog riding a skateboard "
                          "but improve my prompt prior to using an image generator."
                          "Please caption the generated image and create a video for it using the improved prompt."))

https://user-images.githubusercontent.com/41651716/233391796-49c3f762-bd58-41d7-b399-629e3c1661b2.mp4

See the /examples directory for more complete code examples.

How it works

The core abstraction is the GradioTool, which lets you define a new tool for your LLM as long as you implement a standard interface:

class GradioTool(BaseTool):

    def __init__(self, name: str, description: str, src: str) -> None:

    @abstractmethod
    def create_job(self, query: str) -> Job:
        pass

    @abstractmethod
    def postprocess(self, output: Tuple[Any] | Any) -> str:
        pass

The requirements are:

The name for your tool
The description for your tool. This is crucial! Agents decide which tool to use based on their description. Be precise and be sure to inclue example of what the input and the output of the tool should look like.
The url or space id, e.g. freddyaboulton/calculator, of the Gradio application. Based on this value, gradio_tools will create a gradio client instance to query the upstream application via API. Be sure to click the link and learn more about the gradio client library if you are not familiar with it.
create_job - Given a string, this method should parse that string and return a job from the client. Most times, this is as simple as passing the string to the submit function of the client. More info on creating jobs here
postprocess - Given the result of the job, convert it to a string the LLM can display to the user.
Optional - Some libraries, e.g. MiniChain, may need some info about the underlying gradio input and output types used by the tool. By default, this will return gr.Textbox() but if you'd like to provide more accurate info, implement the _block_input(self, gr) and _block_output(self, gr) methods of the tool. The gr variable is the gradio module (the result of import gradio as gr). It will be automatically imported by the GradiTool parent class and passed to the _block_input and _block_output methods.

And that's it!

Appendix

What are agents?

A LangChain agent is a Large Language Model (LLM) that takes user input and reports an output based on using one of many tools at its disposal.

What is Gradio?

Gradio is the defacto standard tool for building Machine Learning Web Applications and sharing them with the world - all with just python! 🐍

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.0.9

May 12, 2023

0.0.8

May 9, 2023

0.0.7

Apr 28, 2023

0.0.6

Apr 26, 2023

0.0.5

Apr 20, 2023

0.0.4

Apr 18, 2023

0.0.3

Apr 17, 2023

0.0.2

Apr 14, 2023

0.0.1

Apr 13, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gradio_tools-0.0.9.tar.gz (9.5 MB view details)

Uploaded May 12, 2023 Source

Built Distribution

gradio_tools-0.0.9-py3-none-any.whl (15.2 kB view details)

Uploaded May 12, 2023 Python 3

File details

Details for the file gradio_tools-0.0.9.tar.gz.

File metadata

Download URL: gradio_tools-0.0.9.tar.gz
Upload date: May 12, 2023
Size: 9.5 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for gradio_tools-0.0.9.tar.gz
Algorithm	Hash digest
SHA256	`c8553ed40b15c74d672d6b3475293580c97172d652f1b142184a01197ce0c6ce`
MD5	`9360167815d250f48747cce79420ebfa`
BLAKE2b-256	`ac6d368bf137bbfed5b2bca089324e50a247a87d6081a82ba03fdddad8b2857b`

See more details on using hashes here.

File details

Details for the file gradio_tools-0.0.9-py3-none-any.whl.

File metadata

Download URL: gradio_tools-0.0.9-py3-none-any.whl
Upload date: May 12, 2023
Size: 15.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for gradio_tools-0.0.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2095889326c6c19799381e05d4ade58d646944f716e5121ae0790ae3b9408f4e`
MD5	`152078a91232829cee4189302f0c9c0d`
BLAKE2b-256	`f67cc5b3bd23c48a4a0b926fe9402c2f819422da31c977b882bbb0116cfdc9a0`

See more details on using hashes here.

gradio-tools 0.0.9

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Gradio Tools: Gradio 🤝 LLM Agents

Example Usage

How it works

Appendix

What are agents?

What is Gradio?

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes