pyplexityai

A clean, lightweight, and modern Python client for the Perplexity AI API with OpenAI compatibility. Built with PyPy and JIT-GIL enhancements for optimal performance.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

gweidart

These details have not been verified by PyPI

Project description

PyPlexityAI

Static Badge

A clean, lightweight, and modern Python client for PerplexityAI's API.

Table of Contents (Click to expand)

PyPlexityAI

Performance

Compatibility

Installation

Basic Usage

Features

Search Parameters

Model Matrix

Current Generation (Recommended)

Model Selection Guide

Error Handling

Streaming Support

Basic Streaming

OpenAI-Compatible Streaming

Async Streaming

Stream Processing

OpenAI Compatibility

Batch Processing

Model Selection

Search Modes and Focus Areas

Development

License

Sync/Async API interfaces
OpenAI-compatible streaming
Model selection guide with performance benchmarks
Robust error handling with detailed diagnostics

Performance

PyPy-optimized core components
Connection pooling & automatic retries
Efficient streaming support
Zero-copy parsing for high throughput

Compatibility

Full OpenAI API schema support
Automatic type conversion
Context manager interfaces
Cross-platform compatibility

Installation

pip install pyplexityai  # Basic installation
pip install pyplexityai[async]  # With additional async support

Basic Usage

from pyplexityai import PerplexityClient
from pyplexityai.errors import AuthenticationError

# Initialize with your API key
client = PerplexityClient("your-api-key")

try:
    # Simple synchronous search
    result = client.search_sync("What is quantum computing?")
    print(result["text"])
finally:
    client.close()

# Or use as a context manager (recommended)
with PerplexityClient("your-api-key") as client:
    result = client.search_sync("What is quantum computing?")
    print(result["text"])

Features

Simple API key authentication
Synchronous and streaming search modes
Multiple model options
Clean, typed codebase
Robust error handling with detailed diagnostics
OpenAI-compatible interface
Async support
No unnecessary dependencies
Built with PyPy and JIT-GIL enhancements for optimal performance
Optimized for high-throughput applications

[!IMPORTANT] This library is not affiliated with Perplexity AI. It is a community-driven project.

Search Parameters

Parameter	Type	Default	Description
`mode`	str	"concise"	"copilot" for detailed responses
`search_focus`	str	"internet"	Specialized sources (see below)
`timeout`	float	30.0	Seconds before timeout
`model`	str	"sonar-pro"	Model to use (see Model Matrix)

Search Focus Options: internet, scholar, writing, wolfram, youtube, reddit

Model Matrix

Model	Context	Speed	Output Limit	Best For	Type
sonar-reasoning-pro	127k	⚡⚡⚡	8k	Complex reasoning	Chat Completion
sonar-pro	200k	⚡⚡	8k	General purpose	Chat Completion
sonar-reasoning	127k	⚡⚡⚡	4k	Quick CoT responses	Chat Completion
sonar	127k	⚡⚡	4k	Everyday conversations	Chat Completion

Current Generation (Recommended)

sonar-reasoning-pro: Best for complex chain-of-thought reasoning (8k output limit)
sonar-pro: General purpose model with 200k context window
sonar-reasoning: Fast chain-of-thought responses
sonar: Balanced everyday use model

Legacy Models (Deprecating February 22, 2025)

llama-3.1-sonar-small-128k-online
llama-3.1-sonar-large-128k-online
llama-3.1-sonar-huge-128k-online

Model Selection Guide

Default model: sonar-pro (recommended for most use cases)
Complex reasoning: sonar-reasoning-pro
Quick responses: sonar-reasoning
General chat: sonar
Maximum context: sonar-pro (200k tokens)

Error Handling

The client uses robust error handling with detailed diagnostics:

AuthenticationError: API key validation issues
InvalidParameterError: Invalid parameter values
SearchError: Search request failures
PerplexityTimeoutError: Request timeouts
WebSocketError: WebSocket connection issues

from pyplexityai import PerplexityClient
from pyplexityai.errors import (
    AuthenticationError,
    SearchError,
    InvalidParameterError,
    PerplexityTimeoutError,
)

Each error includes:

Descriptive message
Detailed causes
Helpful hints
Additional notes
Error codes for programmatic handling

try:
    with PerplexityClient("your-api-key") as client:
        result = client.search_sync("What is quantum computing?")
except AuthenticationError as e:
    print(f"Authentication failed: {e.message}")
    print(f"Hint: {e.hint_stmt}")
except InvalidParameterError as e:
    print(f"Invalid parameter: {e.message}")
    print(f"Valid values: {e.causes[-1]}")
except SearchError as e:
    print(f"Search failed: {e.message}")
    print(f"Causes: {e.causes}")
except PerplexityTimeoutError as e:
    print(f"Request timed out: {e.message}")

Streaming Support

Basic Streaming

from pyplexityai import PerplexityClient

with PerplexityClient("your-api-key") as client:
    # Stream responses chunk by chunk
    for chunk in client.search("What is quantum computing?"):
        if "text" in chunk:
            print(chunk["text"], end="", flush=True)

OpenAI-Compatible Streaming

from pyplexityai import OpenAICompatibleClient, ChatMessage

client = OpenAICompatibleClient("your-api-key")

messages = [
    ChatMessage(role="user", content="Explain quantum computing", name=None)
]

# Stream tokens as they arrive
for response in client.create_chat_completion(
    messages=messages,
    model="sonar-pro",
    stream=True,
):
    if "delta" in response["choices"][0]:
        if content := response["choices"][0]["delta"].get("content"):
            print(content, end="", flush=True)

Async Streaming

from pyplexityai import AsyncPerplexityClient

async with AsyncPerplexityClient("your-api-key") as client:
    # Stream responses asynchronously
    async for chunk in client.async_search("What is quantum computing?"):
        if "text" in chunk:
            print(chunk["text"], end="", flush=True)

OpenAI-Compatible Streaming

from pyplexityai import OpenAICompatibleClient
from pyplexityai.client_types import ChatMessage

async with OpenAICompatibleClient("your-api-key") as client:
    messages = [
        ChatMessage(role="user", content="Explain quantum computing", name=None)
    ]

    # Stream tokens as they arrive
    async for response in client.acreate_chat_completion(
        messages=messages,
        model="sonar-pro",
        stream=True,
    ):
        if content := response.choices[0].delta.content:
            print(content, end="", flush=True)

Batch Processing

from pyplexityai import AsyncPerplexityClient
from pyplexityai.errors import PerplexityTimeoutError

async def batch_search(queries: list[str], api_key: str) -> dict[str, str]:
    results = {}
    async with AsyncPerplexityClient(api_key) as client:
        for query in queries:
            try:
                result = await client.async_search_sync(
                    query,
                    timeout=45.0  # Longer timeout for batch
                )
                results[query] = result["text"]
            except PerplexityTimeoutError:
                results[query] = "Error: timeout"
                continue
    return results

# Example batch processing
queries = [
    "What is Python?",
    "What is JavaScript?",
    "What is Rust?"
]
results = await batch_search(queries, "your-api-key")
for query, result in results.items():
    print(f"\nQuery: {query}")
    print(f"Result: {result}")

Model Selection

from pyplexityai import PerplexityClient

with PerplexityClient("your-api-key") as client:
    # Use reasoning-optimized model
    result = client.search_sync(
        "Explain the implications of quantum entanglement",
        model="sonar-reasoning-pro"
    )
    print(result["text"])

    # Use general purpose model with streaming
    for chunk in client.search(
        "Write a Python function to implement quicksort",
        model="sonar-pro"
    ):
        if "text" in chunk:
            print(chunk["text"], end="", flush=True)

Search Modes and Focus Areas

from pyplexityai import AsyncPerplexityClient

async with AsyncPerplexityClient("your-api-key") as client:
    # Copilot mode (more detailed responses)
    async for chunk in client.async_search(
        query="Explain how BERT works",
        mode="copilot",
        model="sonar-pro"
    ):
        if "text" in chunk:
            print(chunk["text"], end="", flush=True)

    # Scholar focus (academic sources)
    result = await client.async_search_sync(
        query="Latest developments in fusion energy",
        mode="concise",
        search_focus="scholar"
    )
    print(result["text"])

Development

# Create virtual environment
uv venv --python pypy

# Activate virtual environment
source .venv/bin/activate

# Install development dependencies
uv pip install -e ".[dev]"

# Run tests
uv run test.py  # Run all tests
uv run test.py errors  # Run specific test(s)

# Run type checks
pyright  # Strict type checking

# Run linter
ruff check . --fix --unsafe-fixes

# Run formatter
ruff format .

License

This project is licensed under the MIT License. See the LICENSE file for details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

gweidart

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.0

Feb 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyplexityai-0.1.0.tar.gz (427.1 kB view details)

Uploaded Feb 10, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pyplexityai-0.1.0-py3-none-any.whl (21.2 kB view details)

Uploaded Feb 10, 2025 Python 3

File details

Details for the file pyplexityai-0.1.0.tar.gz.

File metadata

Download URL: pyplexityai-0.1.0.tar.gz
Upload date: Feb 10, 2025
Size: 427.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for pyplexityai-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`28a6535439568ec41d44345f8a1d3e350400faaaa29dbbc294765bb2b318046f`
MD5	`196e8bf48d848f29eae88f1a10cd37de`
BLAKE2b-256	`b02ce76f4113f16d3b99654abbee49498fcfb69550478cf82f21635b247387ca`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyplexityai-0.1.0.tar.gz:

Publisher: release.yml on gweidart/pyplexityai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pyplexityai-0.1.0.tar.gz
- Subject digest: 28a6535439568ec41d44345f8a1d3e350400faaaa29dbbc294765bb2b318046f
- Sigstore transparency entry: 170079760
- Sigstore integration time: Feb 10, 2025
Source repository:
- Permalink: gweidart/pyplexityai@b1cd13e7173e84a0eb6e4fd07926b5fae82bddb2
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/gweidart
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@b1cd13e7173e84a0eb6e4fd07926b5fae82bddb2
- Trigger Event: push

File details

Details for the file pyplexityai-0.1.0-py3-none-any.whl.

File metadata

Download URL: pyplexityai-0.1.0-py3-none-any.whl
Upload date: Feb 10, 2025
Size: 21.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for pyplexityai-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`68597bca429769f4156210619c633cbcb10fefc8078c478c734a2459c5469088`
MD5	`d0202133936cd4124828f6c57ec7556d`
BLAKE2b-256	`21e3f389e8a22bed6b986977aa7b40574bc8dfe81b4406f39143e4abe3419b96`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyplexityai-0.1.0-py3-none-any.whl:

Publisher: release.yml on gweidart/pyplexityai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pyplexityai-0.1.0-py3-none-any.whl
- Subject digest: 68597bca429769f4156210619c633cbcb10fefc8078c478c734a2459c5469088
- Sigstore transparency entry: 170079762
- Sigstore integration time: Feb 10, 2025
Source repository:
- Permalink: gweidart/pyplexityai@b1cd13e7173e84a0eb6e4fd07926b5fae82bddb2
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/gweidart
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@b1cd13e7173e84a0eb6e4fd07926b5fae82bddb2
- Trigger Event: push

pyplexityai 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

PyPlexityAI

Performance

Compatibility

Installation

Basic Usage

Features

Search Parameters

Model Matrix

Current Generation (Recommended)

Legacy Models (Deprecating February 22, 2025)

Model Selection Guide

Error Handling

Streaming Support

Basic Streaming

OpenAI-Compatible Streaming

Async Streaming

OpenAI-Compatible Streaming

Batch Processing

Model Selection

Search Modes and Focus Areas

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance