Skip to main content

A lightweight Python package for creating agents on Langchain for Agentic platforms

Project description

A lightweight framework to create "microagents" based on the LangChain BaseModel interface.

The idea is to make it easy to a multi-agent platform.

Usage

Basic Example

The following deploys an agent based on a self-hosted hugging face model as an endpoint.

import uvicorn
from fastapi import FastAPI
from fastapi.responses import StreamingResponse
from transformers import pipeline, AutoModelForCausalLM, AutoTokenizer
from langchain_huggingface.llms import HuggingFacePipeline
from agents_on_langchain.base_agent import BaseAgent

model = AutoModelForCausalLM.from_pretrained('mistralai/Mistral-7B-Instruct-v0.3',
                                             torch_dtype=torch.bfloat16,
                                             device_map="auto")
tokenizer = AutoTokenizer.from_pretrained('mistralai/Mistral-7B-Instruct-v0.3')


class RequestClassifierAgent(BaseAgent):
    version = '01'
    base_llm = HuggingFacePipeline(
        pipeline=pipeline(
            "text-generation",
            model=model,
            tokenizer=tokenizer,
            temperature=0.1,
            max_new_tokens=64,
            do_sample=True
        )
    )

    q_and_a = [
        ("Tell me about the languages of Eberron.", "lookup"),
        ("War veteran", "character"),
        ("Create a House Cannith item.", "original"),
        ("Create the details of a town on the border between Zilargo and Breland.", "original"),
        ("Groggy comic relief", "character"),
        ("Tell me about fashion in the five nations.", "lookup"),
        ("Find for me a magic lipstick.", "lookup"),
    ]

    def listen(self, context: str) -> bool:
        """This agent does not listen to any contextual information"""
        return False

    def __init__(self):
        pass

    def _retrieve(self, q: str) -> List[Tuple[str, dict]]:
        return []

    def _prompt(self, q: str):
        return dedent(f"""[INST]
        User is making a request. Classify the request into one of the three categories. Response with only one word.
        If the user is asking to create some original content, respond with the word "original".
        If this is a request to create a character or NPC, or it looks like it is decribing a D&D character, respond with the word "character".
        If the user is making a request to find out information, respond with the word "lookup".

        Q:
        {q}

        A:
        [/INST]""")

    def respond(self, q: str) -> Iterable[str]:
        category = self.base_llm.bind(skip_prompt=True).invoke(self._prompt(q))

        yield category.strip()

    def run() -> None:
        pass


class Message(BaseModel):
    content: str


request_classifier = RequestClassifierAgent()
app = FastAPI()


@app.post("/classify")
def classify(message: Message):
    generator = supervisor.respond(message.content)
    return StreamingResponse(request_classifier, media_type="text/event-stream")

Complex Exmaple

See here: https://github.com/sinan-ozel/eberron-llm/tree/main/multi-agent-servers/eberron-agent-server/app

Philosophy

I went with three guiding principles in writing this model.

  1. Everything is an agent: tools, buses, orchestrators are all agents.
  2. Final code should show the flow: You should be able scale while being able to see how agents connect to each other. This means that each agent relationship should be at most one line of code.
  3. Agents are minimal building blocks: one prompt per agent, one vector store per agent, one model per agent. In other words, they are microagents.

Deployment

The mode of deployment is as follows: Make sure that there is a model and an API, based on the BaseModelin LangChain. Put all agents in the same piece of code. Run this code on a loop - the loop can also be an "orchestrator" agent.

Note that you can also host the model on the same pod / instance / computer as the agents. This is how I (the author) tested it.

Development

Requirements

Just Docker. If you want to develop, you can use the .devcontainer on VS Code, you don't need to install anything.

This works with VS Code, however, if you want to use another IDE, you can also use the Dockerfile.dev to create your development environment.

Testing

Run:

docker-compose run --rm --build test

Contributing

  1. Branch out
  2. Add new code.
  3. Add tests.
  4. Push.
  5. Make a pull request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agents_on_langchain-0.1.0.dev202502150302.tar.gz (15.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file agents_on_langchain-0.1.0.dev202502150302.tar.gz.

File metadata

File hashes

Hashes for agents_on_langchain-0.1.0.dev202502150302.tar.gz
Algorithm Hash digest
SHA256 44579ba1cb58a10c92d7237a5e830b80723fc4b4194c3a0ff89d528310e75795
MD5 f34c4c01474ccaf79cf16c51059146e3
BLAKE2b-256 ad63f60774526fc761465dfd15f8ca4576d859730f3075d0ae0c5a7d458ab5d1

See more details on using hashes here.

File details

Details for the file agents_on_langchain-0.1.0.dev202502150302-py3-none-any.whl.

File metadata

File hashes

Hashes for agents_on_langchain-0.1.0.dev202502150302-py3-none-any.whl
Algorithm Hash digest
SHA256 78907b7de6b037a053c5c577206e5e19e9071a99ee27568642533060bc110fed
MD5 4272773013abf60e140e71c45d9324cd
BLAKE2b-256 5f2a576113b4784c7647b1204a217ce044c9a6e7dcceba5278cd2357dfb6add2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page