llama-index llms perplexity integration

These details have not been verified by PyPI

Project description

LlamaIndex Llms Integration: Perplexity

The Perplexity integration for LlamaIndex allows you to tap into real-time generative search powered by the Perplexity API. This integration supports synchronous and asynchronous chat completions—as well as streaming responses.

Installation

To install the required packages, run:

%pip install llama-index-llms-perplexity
!pip install llama-index

Setup

Import Libraries and Configure API Key

Please refer to the official Perplexity API documentation to get started. You can follow the steps outlined here to generate your API key.

Import the necessary libraries and set your Perplexity API key:

from llama_index.llms.perplexity import Perplexity

pplx_api_key = "your-perplexity-api-key"  # Replace with your actual API key

Initialize the Perplexity LLM

Create an instance of the Perplexity LLM with your API key and desired model settings:

llm = Perplexity(api_key=pplx_api_key, model="sonar-pro", temperature=0.2)

Chat Example

Sending a Chat Message

You can send a chat message using the chat method. Here’s how to do that:

from llama_index.core.llms import ChatMessage

messages_dict = [
    {"role": "system", "content": "Be precise and concise."},
    {
        "role": "user",
        "content": "What is the weather like in San Francisco today?",
    },
]

messages = [ChatMessage(**msg) for msg in messages_dict]

# Obtain a response from the model
response = llm.chat(messages)
print(response)

Async Chat

For asynchronous conversation processing, use the achat method to send messages and await the response:

response = await llm.achat(messages)
print(response)

Stream Chat

For cases where you want to receive a response token by token in real time, use the stream_chat method:

resp = llm.stream_chat(messages)
for r in resp:
    print(r.delta, end="")

Async Stream Chat

Similarly, for asynchronous streaming, the astream_chat method provides a way to process response deltas asynchronously:

resp = await llm.astream_chat(messages)
async for delta in resp:
    print(delta.delta, end="")

Tool calling

Perplexity models can easily be wrapped into a llamaindex tool so that it can be called as part of your data processing or conversational workflows. This tool uses real-time generative search powered by Perplexity, and it’s configured with the updated default model ("sonar-pro") and the enable_search_classifier parameter enabled.

Below is an example of how to define and register the tool:

from llama_index.core.tools import FunctionTool
from llama_index.llms.perplexity import Perplexity
from llama_index.core.llms import ChatMessage


def query_perplexity(query: str) -> str:
    """
    Queries the Perplexity API via the LlamaIndex integration.

    This function instantiates a Perplexity LLM with updated default settings
    (using model "sonar-pro" and enabling search classifier so that the API can
    intelligently decide if a search is needed), wraps the query into a ChatMessage,
    and returns the generated response content.
    """
    pplx_api_key = (
        "your-perplexity-api-key"  # Replace with your actual API key
    )

    llm = Perplexity(
        api_key=pplx_api_key,
        model="sonar-pro",
        temperature=0.7,
        enable_search_classifier=True,  # This will determine if the search component is necessary in this particular context
    )

    messages = [ChatMessage(role="user", content=query)]
    response = llm.chat(messages)
    return response.message.content


# Create the tool from the query_perplexity function
query_perplexity_tool = FunctionTool.from_defaults(fn=query_perplexity)

LLM Implementation example

https://docs.llamaindex.ai/en/stable/examples/llm/perplexity/

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.5.1

Mar 13, 2026

0.5.0

Mar 12, 2026

0.4.2

Sep 25, 2025

0.4.1

Sep 8, 2025

0.4.0

Jul 30, 2025

0.3.7

Jun 26, 2025

0.3.6

Jun 16, 2025

This version

0.3.5

Jun 9, 2025

0.3.4

May 30, 2025

0.3.3

Apr 15, 2025

0.3.2

Dec 7, 2024

0.3.1

Nov 25, 2024

0.3.0

Nov 18, 2024

0.2.1

Oct 8, 2024

0.2.0

Aug 22, 2024

0.1.5

Aug 7, 2024

0.1.4

Jun 26, 2024

0.1.3

Mar 4, 2024

0.1.2

Feb 21, 2024

0.1.1

Feb 12, 2024

0.1.0

Feb 10, 2024

0.0.1

Feb 3, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_llms_perplexity-0.3.5.tar.gz (6.9 kB view details)

Uploaded Jun 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llama_index_llms_perplexity-0.3.5-py3-none-any.whl (6.6 kB view details)

Uploaded Jun 9, 2025 Python 3

File details

Details for the file llama_index_llms_perplexity-0.3.5.tar.gz.

File metadata

Download URL: llama_index_llms_perplexity-0.3.5.tar.gz
Upload date: Jun 9, 2025
Size: 6.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.12

File hashes

Hashes for llama_index_llms_perplexity-0.3.5.tar.gz
Algorithm	Hash digest
SHA256	`ac29cf197fd891e59ad474824e752e4afdd3af007a5c5f379fbc9f829b8a5876`
MD5	`c8dbcced03d2a672fa30aa9ebcdec11d`
BLAKE2b-256	`7dd384146b9bfca35971094c9547868cc737c06cfa7ee5585ccf653f77cad215`

See more details on using hashes here.

File details

Details for the file llama_index_llms_perplexity-0.3.5-py3-none-any.whl.

File metadata

Download URL: llama_index_llms_perplexity-0.3.5-py3-none-any.whl
Upload date: Jun 9, 2025
Size: 6.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.12

File hashes

Hashes for llama_index_llms_perplexity-0.3.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`747b0373ef5fdf8a31cfe691c8c5020a27da0c06019c448153c4ba37a29e1df9`
MD5	`b8e3bbde08eccebe79511f2b037cfa36`
BLAKE2b-256	`b8d6fccc3a8336902ca76e386e131238836eddb52dd048bd8e72872c53eb805d`

See more details on using hashes here.

llama-index-llms-perplexity 0.3.5

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

LlamaIndex Llms Integration: Perplexity

Installation

Setup

Import Libraries and Configure API Key

Initialize the Perplexity LLM

Chat Example

Sending a Chat Message

Async Chat

Stream Chat

Async Stream Chat

Tool calling

LLM Implementation example

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes