Skip to main content

A lightweight Python package for creating agents on Langchain for Agentic platforms

Project description

A lightweight framework to create "microagents" based on the LangChain BaseModel interface.

The idea is to make it easy to a multi-agent platform.

Usage

Basic Example

The following deploys an agent based on a self-hosted hugging face model as an endpoint.

import uvicorn
from fastapi import FastAPI
from fastapi.responses import StreamingResponse
from transformers import pipeline, AutoModelForCausalLM, AutoTokenizer
from langchain_huggingface.llms import HuggingFacePipeline
from agents_on_langchain.base_agent import BaseAgent

model = AutoModelForCausalLM.from_pretrained('mistralai/Mistral-7B-Instruct-v0.3',
                                             torch_dtype=torch.bfloat16,
                                             device_map="auto")
tokenizer = AutoTokenizer.from_pretrained('mistralai/Mistral-7B-Instruct-v0.3')


class RequestClassifierAgent(BaseAgent):
    version = '01'
    base_llm = HuggingFacePipeline(
        pipeline=pipeline(
            "text-generation",
            model=model,
            tokenizer=tokenizer,
            temperature=0.1,
            max_new_tokens=64,
            do_sample=True
        )
    )

    q_and_a = [
        ("Tell me about the languages of Eberron.", "lookup"),
        ("War veteran", "character"),
        ("Create a House Cannith item.", "original"),
        ("Create the details of a town on the border between Zilargo and Breland.", "original"),
        ("Groggy comic relief", "character"),
        ("Tell me about fashion in the five nations.", "lookup"),
        ("Find for me a magic lipstick.", "lookup"),
    ]

    def listen(self, context: str) -> bool:
        """This agent does not listen to any contextual information"""
        return False

    def __init__(self):
        pass

    def _retrieve(self, q: str) -> List[Tuple[str, dict]]:
        return []

    def _prompt(self, q: str):
        return dedent(f"""[INST]
        User is making a request. Classify the request into one of the three categories. Response with only one word.
        If the user is asking to create some original content, respond with the word "original".
        If this is a request to create a character or NPC, or it looks like it is decribing a D&D character, respond with the word "character".
        If the user is making a request to find out information, respond with the word "lookup".

        Q:
        {q}

        A:
        [/INST]""")

    def respond(self, q: str) -> Iterable[str]:
        category = self.base_llm.bind(skip_prompt=True).invoke(self._prompt(q))

        yield category.strip()

    def run() -> None:
        pass


class Message(BaseModel):
    content: str


request_classifier = RequestClassifierAgent()
app = FastAPI()


@app.post("/classify")
def classify(message: Message):
    generator = supervisor.respond(message.content)
    return StreamingResponse(request_classifier, media_type="text/event-stream")

Complex Exmaple

See here: https://github.com/sinan-ozel/eberron-llm/tree/main/multi-agent-servers/eberron-agent-server/app

Philosophy

I went with three guiding principles in writing this model.

  1. Everything is an agent: tools, buses, orchestrators are all agents.
  2. Final code should show the flow: You should be able scale while being able to see how agents connect to each other. This means that each agent relationship should be at most one line of code.
  3. Agents are minimal building blocks: one prompt per agent, one vector store per agent, one model per agent. In other words, they are microagents.

Deployment

The mode of deployment is as follows: Make sure that there is a model and an API, based on the BaseModelin LangChain. Put all agents in the same piece of code. Run this code on a loop - the loop can also be an "orchestrator" agent.

Note that you can also host the model on the same pod / instance / computer as the agents. This is how I (the author) tested it.

Development

Requirements

Just Docker. If you want to develop, you can use the .devcontainer on VS Code, you don't need to install anything.

This works with VS Code, however, if you want to use another IDE, you can also use the Dockerfile.dev to create your development environment.

Testing

Run:

docker-compose run --rm --build test

Contributing

  1. Branch out
  2. Add new code.
  3. Add tests.
  4. Push.
  5. Make a pull request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agents_on_langchain-0.1.1.tar.gz (15.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

agents_on_langchain-0.1.1-py3-none-any.whl (10.8 kB view details)

Uploaded Python 3

File details

Details for the file agents_on_langchain-0.1.1.tar.gz.

File metadata

  • Download URL: agents_on_langchain-0.1.1.tar.gz
  • Upload date:
  • Size: 15.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for agents_on_langchain-0.1.1.tar.gz
Algorithm Hash digest
SHA256 09bd6e637631a56ed64096001361041ab98284359a2c970671ee45a9c1e6029b
MD5 07b3affcb667b5712976a24303e9d54b
BLAKE2b-256 98b92db7f9ea172befc3c563c8d46d31e9441098f70d6cb7dbcff564f627286c

See more details on using hashes here.

File details

Details for the file agents_on_langchain-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for agents_on_langchain-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4ed96680f2f9568136234512841c3b547022d0921f73f3418c737234bcc6200a
MD5 e1b48bb71cb33afc0cbc8ffa59077707
BLAKE2b-256 4c3b2d78074ab52fa2feb912cd442597e192d3587f7f39439efd67a73d17d4fd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page