A Python package for creating AI Assistants and AI Agents with Vectara

These details have not been verified by PyPI

Project links

Project description

vectara-agentic

✨ Overview

vectara-agentic is a Python library for developing powerful AI assistants and agents using Vectara and Agentic-RAG. It leverages the LlamaIndex Agent framework and provides helper functions to quickly create tools that connect to Vectara corpora.

Agentic RAG diagram

Features

Enables easy creation of custom AI assistants and agents.
Create a Vectara RAG tool or search tool with a single line of code.
Supports ReAct, OpenAIAgent, LATS and LLMCompiler agent types.
Includes pre-built tools for various domains (e.g., finance, legal).
Integrates with various LLM inference services like OpenAI, Anthropic, Gemini, GROQ, Together.AI, Cohere, Bedrock and Fireworks
Built-in support for observability with Arize Phoenix

📚 Example AI Assistants

Check out our example AI assistants:

Prerequisites

Vectara account
A Vectara corpus with an API key
Python 3.10 or higher
OpenAI API key (or API keys for Anthropic, TOGETHER.AI, Fireworks AI, Bedrock, Cohere, GEMINI or GROQ, if you choose to use them)

Installation

pip install vectara-agentic

🚀 Quick Start

1. Initialize the Vectara tool factory

import os
from vectara_agentic.tools import VectaraToolFactory

vec_factory = VectaraToolFactory(
    vectara_api_key=os.environ['VECTARA_API_KEY'],
    vectara_customer_id=os.environ['VECTARA_CUSTOMER_ID'],
    vectara_corpus_key=os.environ['VECTARA_CORPUS_KEY']
)

2. Create a Vectara RAG Tool

A RAG tool calls the full Vectara RAG pipeline to provide summarized responses to queries grounded in data.

from pydantic import BaseModel, Field

years = list(range(2020, 2024))
tickers = {
    "AAPL": "Apple Computer",
    "GOOG": "Google",
    "AMZN": "Amazon",
    "SNOW": "Snowflake",
}

class QueryFinancialReportsArgs(BaseModel):
    query: str = Field(..., description="The user query.")
    year: int | str = Field(..., description=f"The year this query relates to. An integer between {min(years)} and {max(years)} or a string specifying a condition on the year (example: '>2020').")
    ticker: str = Field(..., description=f"The company ticker. Must be a valid ticket symbol from the list {tickers.keys()}.")

query_financial_reports_tool = vec_factory.create_rag_tool(
    tool_name="query_financial_reports",
    tool_description="Query financial reports for a company and year",
    tool_args_schema=QueryFinancialReportsArgs,
    lambda_val=0.005,
    summary_num_results=7, 
    # Additional arguments
)

See the docs for additional arguments to customize your Vectara RAG tool.

3. Create other tools (optional)

In addition to RAG tools, you can generate a lot of other types of tools the agent can use. These could be mathematical tools, tools that call other APIs to get more information, or any other type of tool.

See Agent Tools for more information.

4. Create your agent

from vectara_agentic import Agent

agent = Agent(
    tools=[query_financial_reports_tool],
    topic="10-K financial reports",
    custom_instructions="""
        - You are a helpful financial assistant in conversation with a user. Use your financial expertise when crafting a query to the tool, to ensure you get the most accurate information.
        - You can answer questions, provide insights, or summarize any information from financial reports.
        - A user may refer to a company's ticker instead of its full name - consider those the same when a user is asking about a company.
        - When calculating a financial metric, make sure you have all the information from tools to complete the calculation.
        - In many cases you may need to query tools on each sub-metric separately before computing the final metric.
        - When using a tool to obtain financial data, consider the fact that information for a certain year may be reported in the following year's report.
        - Report financial data in a consistent manner. For example if you report revenue in thousands, always report revenue in thousands.
    """
)

See the docs for additional arguments, including agent_progress_callback and query_logging_callback.

5. Run your agent

res = agent.chat("What was the revenue for Apple in 2021?")
print(res.response)

Note that:

vectara-agentic also supports achat() and two streaming variants stream_chat() and astream_chat().
The response types from chat() and achat() are of type AgentResponse. If you just need the actual string response it's available as the response variable, or just use str(). For advanced use-cases you can look at other AgentResponse variables such as sources.

🧰 Vectara tools

vectara-agentic provides two helper functions to connect with Vectara RAG

create_rag_tool() to create an agent tool that connects with a Vectara corpus for querying.
create_search_tool() to create a tool to search a Vectara corpus and return a list of matching documents.

See the documentation for the full list of arguments for create_rag_tool() and create_search_tool(), to understand how to configure Vectara query performed by those tools.

Creating a Vectara RAG tool

A Vectara RAG tool is often the main workhorse for any Agentic RAG application, and enables the agent to query one or more Vectara RAG corpora.

The tool generated always includes the query argument, followed by 1 or more optional arguments used for metadata filtering, defined by tool_args_schema.

For example, in the quickstart example the schema is:

class QueryFinancialReportsArgs(BaseModel):
    query: str = Field(..., description="The user query.")
    year: int | str = Field(..., description=f"The year this query relates to. An integer between {min(years)} and {max(years)} or a string specifying a condition on the year (example: '>2020').")
    ticker: str = Field(..., description=f"The company ticker. Must be a valid ticket symbol from the list {tickers.keys()}.")

The query is required and is always the query string. The other arguments are optional and will be interpreted as Vectara metadata filters.

For example, in the example above, the agent may call the query_financial_reports_tool tool with query='what is the revenue?', year=2022 and ticker='AAPL'. Subsequently the RAG tool will issue a Vectara RAG query with the same query, but with metadata filtering (doc.year=2022 and doc.ticker='AAPL').

There are also additional cool features supported here:

An argument can be a condition, for example year='>2022' translates to the correct metadata filtering condition doc.year>2022
if fixed_filter is defined in the RAG tool, it provides a constant metadata filtering that is always applied. For example, if fixed_filter=doc.filing_type='10K' then a query with query='what is the reveue', year=2022 and ticker='AAPL' would translate into query='what is the revenue' with metadata filtering condition of "doc.year=2022 AND doc.ticker='AAPL' and doc.filing_type='10K'"

Note that tool_args_type is an optional dictionary that indicates the level at which metadata filtering is applied for each argument (doc or part)

Creating a Vectara search tool

The Vectara search tool allows the agent to list documents that match a query. This can be helpful to the agent to answer queries like "how many documents discuss the iPhone?" or other similar queries that require a response in terms of a list of matching documents.

🛠️ Agent Tools at a Glance

vectara-agentic provides a few tools out of the box (see ToolsCatalog for details):

Standard tools:

summarize_text: a tool to summarize a long text into a shorter summary (uses LLM)
rephrase_text: a tool to rephrase a given text, given a set of rephrase instructions (uses LLM) These tools use an LLM and so would use the Tools LLM specified in your AgentConfig. To instantiate them:

from vectara_agentic.tools_catalog import ToolsCatalog
summarize_text = ToolsCatalog(agent_config).summarize_text

This ensures the summarize_text tool is configured with the proper LLM provider and model as specified in the Agent configuration.

Legal tools: a set of tools for the legal vertical, such as:

summarize_legal_text: summarize legal text with a certain point of view
critique_as_judge: critique a legal text as a judge, providing their perspective

Financial tools: based on tools from Yahoo! Finance:

tools to understand the financials of a public company like: balance_sheet, income_statement, cash_flow
stock_news: provides news about a company
stock_analyst_recommendations: provides stock analyst recommendations for a company.

Database tools: providing tools to inspect and query a database

list_tables: list all tables in the database
describe_tables: describe the schema of tables in the database
load_data: returns data based on a SQL query
load_sample_data: returns the first 25 rows of a table
load_unique_values: returns the top unique values for a given column

In addition, we include various other tools from LlamaIndex ToolSpecs:

Tavily search and EXA.AI
arxiv
neo4j & Kuzu for Graph DB integration
Google tools (including gmail, calendar, and search)
Slack

Note that some of these tools may require API keys as environment variables

You can create your own tool directly from a Python function using the create_tool() method of the ToolsFactory class:

def mult_func(x, y):
    return x * y

mult_tool = ToolsFactory().create_tool(mult_func)

Note: When you define your own Python functions as tools, implement them at the top module level, and not as nested functions. Nested functions are not supported if you use serialization (dumps/loads or from_dict/to_dict).

🛠️ Configuration

Configuring Vectara-agentic

The main way to control the behavior of vectara-agentic is by passing an AgentConfig object to your Agent when creating it. For example:

agent_config = AgentConfig(
    agent_type = AgentType.REACT,
    main_llm_provider = ModelProvider.ANTHROPIC,
    main_llm_model_name = 'claude-3-5-sonnet-20241022',
    tool_llm_provider = ModelProvider.TOGETHER,
    tool_llm_model_name = 'meta-llama/Llama-3.3-70B-Instruct-Turbo'
)

agent = Agent(
    tools=[query_financial_reports_tool],
    topic="10-K financial reports",
    custom_instructions="You are a helpful financial assistant in conversation with a user.",
    agent_config=agent_config
)

The AgentConfig object may include the following items:

agent_type: the agent type. Valid values are REACT, LLMCOMPILER, LATS or OPENAI (default: OPENAI).
main_llm_provider and tool_llm_provider: the LLM provider for main agent and for the tools. Valid values are OPENAI, ANTHROPIC, TOGETHER, GROQ, COHERE, BEDROCK, GEMINI or FIREWORKS (default: OPENAI).
main_llm_model_name and tool_llm_model_name: agent model name for agent and tools (default depends on provider).
observer: the observer type; should be ARIZE_PHOENIX or if undefined no observation framework will be used.
endpoint_api_key: a secret key if using the API endpoint option (defaults to dev-api-key)

If any of these are not provided, AgentConfig first tries to read the values from the OS environment.

Configuring Vectara RAG or search tools

When creating a VectaraToolFactory, you can pass in a vectara_api_key, vectara_customer_id, and vectara_corpus_id to the factory.

If not passed in, it will be taken from the environment variables (VECTARA_API_KEY and VECTARA_CORPUS_KEY). Note that VECTARA_CORPUS_KEY can be a single KEY or a comma-separated list of KEYs (if you want to query multiple corpora).

These values will be used as credentials when creating Vectara tools - in create_rag_tool() and create_search_tool().

Setting up a privately hosted LLM

If you want to setup vectara-agentic to use your own self-hosted LLM endpoint, follow the example below

        config = AgentConfig(
            agent_type=AgentType.REACT,
            main_llm_provider=ModelProvider.PRIVATE,
            main_llm_model_name="meta-llama/Meta-Llama-3.1-8B-Instruct",
            private_llm_api_base="http://vllm-server.company.com/v1",
            private_llm_api_key="TEST_API_KEY",
        )
        agent = Agent(agent_config=config, tools=tools, topic=topic,
                      custom_instructions=custom_instructions)

In this case we specify the Main LLM provider to be privately hosted with Llama-3.1-8B as the model.

The ModelProvider.PRIVATE specifies a privately hosted LLM.
The private_llm_api_base specifies the api endpoint to use, and the private_llm_api_key specifies the private API key requires to use this service.

ℹ️ Additional Information

About Custom Instructions for your Agent

The custom instructions you provide to the agent guide its behavior. Here are some guidelines when creating your instructions:

Write precise and clear instructions, without overcomplicating.
Consider edge cases and unusual or atypical scenarios.
Be cautious to not over-specify behavior based on your primary use-case, as it may limit the agent's ability to behave properly in others.

Diagnostics

The Agent class defines a few helpful methods to help you understand the internals of your application.

The report() method prints out the agent object’s type, the tools, and the LLMs used for the main agent and tool calling.
The token_counts() method tells you how many tokens you have used in the current session for both the main agent and tool calling LLMs. This can be helpful if you want to track spend by token.

Serialization

The Agent class supports serialization. Use the dumps() to serialize and loads() to read back from a serialized stream.

Note: due to cloudpickle limitations, if a tool contains Python weakref objects, serialization won't work and an exception will be raised.

Observability

vectara-agentic supports observability via the existing integration of LlamaIndex and Arize Phoenix. First, set VECTARA_AGENTIC_OBSERVER_TYPE to ARIZE_PHOENIX in AgentConfig (or env variable).

Then you can use Arize Phoenix in three ways:

Locally.
1. If you have a local phoenix server that you've run using e.g. python -m phoenix.server.main serve, vectara-agentic will send all traces to it.
2. If not, vectara-agentic will run a local instance during the agent's lifecycle, and will close it when finished.
3. In both cases, traces will be sent to the local instance, and you can see the dashboard at http://localhost:6006
Hosted Instance. In this case the traces are sent to the Phoenix instances hosted on Arize.
1. Go to https://app.phoenix.arize.com, setup an account if you don't have one.
2. create an API key and put it in the PHOENIX_API_KEY environment variable - this indicates you want to use the hosted version.
3. To view the traces go to https://app.phoenix.arize.com.

Now when you run your agent, all call traces are sent to Phoenix and recorded. In addition, vectara-agentic also records FCS (factual consistency score, aka HHEM) values into Arize for every Vectara RAG call. You can see those results in the Feedback column of the arize UI.

🌐 API Endpoint

vectara-agentic can be easily hosted locally or on a remote machine behind an API endpoint, by following theses steps:

Step 1: Setup your API key

Ensure that you have your API key set up as an environment variable:

export VECTARA_AGENTIC_API_KEY=<YOUR-ENDPOINT-API-KEY>

if you don't specify an Endpoint API key it uses the default "dev-api-key".

Step 2: Start the API Server

Initialize the agent and start the FastAPI server by following this example:

from vectara_agentic.agent import Agent
from vectara_agentic.agent_endpoint import start_app
agent = Agent(...)            # Initialize your agent with appropriate parameters
start_app(agent)

You can customize the host and port by passing them as arguments to start_app():

Default: host="0.0.0.0" and port=8000. For example:

start_app(agent, host="0.0.0.0", port=8000)

Step 3: Access the API Endpoint

Once the server is running, you can interact with it using curl or any HTTP client. For example:

curl -G "http://<remote-server-ip>:8000/chat" \
--data-urlencode "message=What is Vectara?" \
-H "X-API-Key: <YOUR-ENDPOINT-API-KEY>"

🤝 Contributing

We welcome contributions! Please see our contributing guide for more information.

📝 License

This project is licensed under the Apache 2.0 License. See the LICENSE file for details.

📞 Contact

Website: vectara.com
Twitter: @vectara
GitHub: @vectara
LinkedIn: @vectara
Discord: Join our community

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.10

Nov 10, 2025

0.4.9

Oct 7, 2025

0.4.8

Sep 26, 2025

0.4.7

Sep 17, 2025

0.4.6

Aug 30, 2025

0.4.5

Aug 27, 2025

0.4.4

Aug 27, 2025

0.4.3

Aug 18, 2025

0.4.2

Aug 12, 2025

0.4.1

Aug 7, 2025

0.4.0

Aug 1, 2025

0.3.3

Jul 23, 2025

0.3.2

Jul 21, 2025

0.3.1

Jul 18, 2025

0.3.0

Jul 16, 2025

0.2.24

Jul 9, 2025

0.2.23

Jul 7, 2025

0.2.22

Jun 30, 2025

0.2.21

Jun 8, 2025

0.2.20

Jun 3, 2025

0.2.19

May 29, 2025

0.2.18

May 26, 2025

0.2.17

May 17, 2025

0.2.16

May 7, 2025

0.2.15

Apr 29, 2025

0.2.14

Apr 25, 2025

0.2.13

Apr 20, 2025

0.2.12

Apr 16, 2025

0.2.11

Apr 11, 2025

0.2.10

Apr 10, 2025

0.2.9

Apr 4, 2025

0.2.8

Mar 28, 2025

0.2.7

Mar 26, 2025

0.2.6

Mar 22, 2025

0.2.5

Mar 19, 2025

0.2.4

Mar 5, 2025

0.2.3

Mar 5, 2025

0.2.2

Mar 2, 2025

This version

0.2.1

Feb 21, 2025

0.2.0

Feb 15, 2025

0.1.28

Feb 13, 2025

0.1.27

Feb 9, 2025

0.1.26

Feb 7, 2025

0.1.25

Feb 5, 2025

0.1.24

Jan 10, 2025

0.1.23

Dec 28, 2024

0.1.22

Dec 21, 2024

0.1.21

Dec 19, 2024

0.1.20

Dec 2, 2024

0.1.19

Nov 8, 2024

0.1.18

Nov 1, 2024

0.1.17

Oct 25, 2024

0.1.16

Oct 21, 2024

0.1.15

Oct 9, 2024

0.1.14 yanked

Oct 9, 2024

Reason this release was yanked:

Issue with guardrail tools

0.1.13

Sep 28, 2024

0.1.12 yanked

Sep 27, 2024

Reason this release was yanked:

Issue with requirements may cause installation to fail

0.1.11 yanked

Sep 26, 2024

Reason this release was yanked:

Missing a requirements

0.1.10

Sep 24, 2024

0.1.9

Sep 20, 2024

0.1.8

Sep 17, 2024

0.1.7

Sep 6, 2024

0.1.6

Aug 21, 2024

0.1.5

Aug 19, 2024

0.1.4

Aug 14, 2024

0.1.3

Aug 14, 2024

0.1.2

Aug 14, 2024

0.1.1

Aug 13, 2024

0.1.0

Aug 13, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vectara_agentic-0.2.1.tar.gz (50.3 kB view details)

Uploaded Feb 21, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vectara_agentic-0.2.1-py3-none-any.whl (47.9 kB view details)

Uploaded Feb 21, 2025 Python 3

File details

Details for the file vectara_agentic-0.2.1.tar.gz.

File metadata

Download URL: vectara_agentic-0.2.1.tar.gz
Upload date: Feb 21, 2025
Size: 50.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for vectara_agentic-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`b8578a2d25ea1c295938aad7e86a898e636c5897be5565aa76d61683f9bb2131`
MD5	`5b7c324404446285e5727b66ec16cbec`
BLAKE2b-256	`220e206f7201b6a44e8ff228fb775c21c3752118ebb792fdcb11cd71c75fd98f`

See more details on using hashes here.

File details

Details for the file vectara_agentic-0.2.1-py3-none-any.whl.

File metadata

Download URL: vectara_agentic-0.2.1-py3-none-any.whl
Upload date: Feb 21, 2025
Size: 47.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for vectara_agentic-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`27ab29927ffcc00f14b46930df5dcb033077f14240743f9be72c36520a444a49`
MD5	`26f667fc0844ed38056c89f4139d55bf`
BLAKE2b-256	`78068fc05d809af9ceb3fbe0ea588aa4338b2020eeba06aebfceb18cc7688ccc`

See more details on using hashes here.

vectara-agentic 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

vectara-agentic

✨ Overview

Features

📚 Example AI Assistants

Prerequisites

Installation

🚀 Quick Start

1. Initialize the Vectara tool factory

2. Create a Vectara RAG Tool

3. Create other tools (optional)

4. Create your agent

5. Run your agent

🧰 Vectara tools

Creating a Vectara RAG tool

Creating a Vectara search tool

🛠️ Agent Tools at a Glance

🛠️ Configuration

Configuring Vectara-agentic

Configuring Vectara RAG or search tools

Setting up a privately hosted LLM

ℹ️ Additional Information

About Custom Instructions for your Agent

Diagnostics

Serialization

Observability

🌐 API Endpoint

Step 1: Setup your API key

Step 2: Start the API Server

Step 3: Access the API Endpoint

🤝 Contributing

📝 License

📞 Contact

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes