Api client for Grazie services

These details have not been verified by PyPI

Project description

Grazie Api Gateway Client

Note, this package is deprecated, please refer to Grazie Api Gateway Client V2 first and check if the new client library supports functionality you need.

This package provides api client for JetBrains AI Platform llm functionality. Supported methods are chat, completion and embeddings.

Support for Grazie NLP services is planned in the future.

You can try models in the browser by going to https://try.ai.intellij.net/ or using the command-line interface.

poetry run -C libs/grazie_api_gateway_client python3 -m grazie.api.client -p openai-gpt-4 chat -v 8 'Who was the most famous pop star in the 90s?'

Usage

First you have to create an instance of client, please check class documentation to know more about parameters:

client = GrazieApiGatewayClient(
    grazie_agent=GrazieAgent(name="grazie-api-gateway-client-readme", version="dev"),
    url=GrazieApiGatewayUrls.STAGING,
    auth_type=AuthType.USER,
    grazie_jwt_token=***
)

Below are examples of usage by method:

Profiles

List all available LLM profiles:

print(client.v8.profiles())

Completion

Without suffix:

client.v8.complete(
    prompt=CompletionPrompt(
        prefix="Once upon a time there was a unicorn. ",
    ),
    profile=Profile.GRAZIE_CHAT_LLAMA_V2_7b,
)

With suffix:

client.v8.complete(
    prompt=CompletionPrompt(
        prefix="Once upon a time there was a unicorn. ",
        suffix=" And they lived happily ever after!"
    ),
    profile=Profile.GRAZIE_CHAT_LLAMA_V2_7b,
)

Chat

client.v8.chat(
    chat=ChatPrompt()
        .add_system("You are a helpful assistant.")
        .add_user("Who won the world series in 2020?"),
    profile=Profile.OPENAI_CHAT_GPT
)

Additionally you can pass id of your prompt or feature via prompt_id parameter. This identifier can later be used to check spending and calculate price of the feature per user or per call.

If you develop prompt which should answer in a structured format (i.e. JSON) it's better to pass temperature = 0. This makes generation deterministic (almost) and will provide parsable responses more reliably.

client.v8.chat(
    chat=ChatPrompt()
        .add_system("You are a helpful assistant.")
        .add_user("Who won the world series in 2020?"),
    profile=Profile.OPENAI_CHAT_GPT,
    parameters={
        LLMParameters.Temperature: Parameters.FloatValue(0.0)
    }
)

Note: this parameter is currently only supported for OpenAI models.

Streaming

Outputs from chat models can be slow, to show progress to a user you can call chat_stream. The output would be a stream of text chunks.

response = ""
for chunk in client.v8.chat_stream(
    chat=ChatPrompt()
        .add_user("Who won the world series in 2020?")
        .add_assistant("The Los Angeles Dodgers won the World Series in 2020.")
        .add_user("Where was it played? Write a small poem about it!"),
    profile=Profile.OPENAI_CHAT_GPT
):
    response += chunk.content

Tool use

Here's an example of the tool usage workflow. For more information, please see the documentation

geo_tool = (
    ToolDefinition(
        name="current_temperature",
        description="Get the current temperature for the given location",
    )
    .add_parameter(
        name="latitude",
        description="The latitude of the location",
        _type=ToolDefinition.ToolParameterTypes.STRING,
        required=True,
    )
    .add_parameter(
        name="longitude",
        description="The longitude of the location",
        _type=ToolDefinition.ToolParameterTypes.STRING,
        required=True,
    )
)

chat_response = client.v8.chat(
    prompt_id="tool_call",
    profile=Profile.OPENAI_CHAT_GPT,
    chat=ChatPrompt()
    .add_system("You are an assistant that uses tools to answer user questions accurately.")
    .add_user("What is the current temperature in Amsterdam?"),
    parameters={
        LLMParameters.Tools: Parameters.JsonValue.from_tools(geo_tool),
        LLMParameters.ToolChoiceRequired: Parameters.BooleanValue(True),
    },
)

content = chat_response.content
tool = chat_response.responses[0].tool_calls[0]

url_params = "&".join(f"{key}={value}" for key, value in json.loads(content).items())
url_params = "&".join([url_params, "current=temperature"])
# The final URL should look like
#   https://api.open-meteo.com/v1/forecast?latitude=52.3676&longitude=4.9041&current=temperature

meteo_response = requests.get(f"https://api.open-meteo.com/v1/forecast?{url_params}").text

final_response = client.v8.chat(
    prompt_id="tool_call",
    profile=Profile.OPENAI_CHAT_GPT,
    chat=ChatPrompt()
    .add_user("What is the current temperature in Amsterdam?")
    .add_tool(
        id=tool.id,
        tool_name=tool.name,
        content=tool.content,
        result=meteo_response,
    ),
    parameters={
        LLMParameters.Tools: Parameters.JsonValue.from_tools(geo_tool),
    },
)

print(final_response.content)

Embeddings

You can also use api to build float vector embeddings for sentences and texts.

client.embed(
    request=EmbeddingRequest(texts=["Sky is blue."], model="sentence-transformers/LaBSE", format_cbor=True)
)

Note: use cbor format for production applications. Pass format_cbor=False only to simplify development initially as the answer will be provided as json.

Additionally, you can use openai embeddings:

client.llm_embed(
    request=LLMEmbeddingRequest(
        texts=["Sky is blue."],
        profile=Profile.OPENAI_EMBEDDING_LARGE,
        dimensions=768
    )
)

Question Answering

You can run question answering against corpus of documents, like documentation or Youtrack issues.

response = ""
for chunk in grazie_api.answer_stream(
    query="How to write a coroutine?", 
    data_source="kotlin_1.9.23"
):
    if chunk.chunk.summaryChunk:
        response += chunk.chunk.summaryChunk

You can find the list of available data sources on https://try.ai.intellij.net/qa

Plain Retrieval

You can also run question answering against a corpus of documents, retrieving only raw documents:

client.retrieve(
    query="How to change a font size in Fleet?",
    data_source="jetbrains-fleet-1.36",
    profile=Profile.OPENAI_GPT_4_TURBO,
    size=10,
)

Or providing a list of prioritized data sources:

client.retrieve_v2(
    query="How to change a font size in Fleet?",
    config_name="fleet-ide",
    data_source_lists=[
        [
            PrioritizedSource(name="jetbrains-fleet-1.45", priority=0), 
            PrioritizedSource(name="jetbrains-fleet-1.46", priority=1), 
        ]
    ],
    profile=Profile.OPENAI_GPT_4_TURBO,
    size=10,
)

Grazie Api Gateway Client V2

The api client V2 for JetBrains AI Platform.

Implemented features

Tasks

Basic usage

Client is available in two flavours APIGatewayClient and AsyncAPIGatewayClient.

ApiGatewayClient

import os

from grazie.api.client_v2 import APIGatewayClient, GatewayEndpoint

api_key = os.getenv("GRAZIE_JWT_TOKEN")
client = APIGatewayClient(
    api_key=api_key,
    endpoint=GatewayEndpoint.STAGING,
)

# Fetch all available tasks in TaskAPI
print(client.tasks.roster())

AsyncApiGatewayClient

import asyncio
import os

from grazie.api.client_v2 import AsyncAPIGatewayClient, GatewayEndpoint


async def main():
    api_key = os.getenv("GRAZIE_JWT_TOKEN")
    client = AsyncAPIGatewayClient(
        api_key=api_key,
        endpoint=GatewayEndpoint.STAGING,
    )

    # Fetch all available tasks in TaskAPI
    print(await client.tasks.roster())

asyncio.run(main())

TaskAPI

Please refer to the client.tasks.roster() for the list of available task IDs. The roster output is in the format of <task-id>:<task-tag>

See the Swagger page to find parameters for the specific task.

Execute a task

from grazie.api.client_v2 import APIGatewayClient

client = APIGatewayClient()
client.tasks.execute(
    id="code-generate:default",
    parameters=dict(
        instructions="Write me a simple python script",
        prefix="",
        suffix="",
        language="python",
    )
)

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.3

Jul 28, 2025

0.3.2

Jul 2, 2025

0.3.1

Jun 10, 2025

0.3.0

Jun 3, 2025

0.2.1

May 6, 2025

0.1.17

Apr 15, 2025

0.1.15

Mar 12, 2025

0.1.14

Mar 6, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grazie_api_gateway_client-0.3.3.tar.gz (26.6 kB view details)

Uploaded Jul 28, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

grazie_api_gateway_client-0.3.3-py3-none-any.whl (37.5 kB view details)

Uploaded Jul 28, 2025 Python 3

File details

Details for the file grazie_api_gateway_client-0.3.3.tar.gz.

File metadata

Download URL: grazie_api_gateway_client-0.3.3.tar.gz
Upload date: Jul 28, 2025
Size: 26.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.3 CPython/3.10.18 Linux/5.15.0-1084-aws

File hashes

Hashes for grazie_api_gateway_client-0.3.3.tar.gz
Algorithm	Hash digest
SHA256	`29a3f0ce7d185b79a479bb83631f2790d924cefb3293748aa0fc3e032fab0d74`
MD5	`7d8f89172562f1bde4a79e5568baca2c`
BLAKE2b-256	`6c76f80883e178e78c71aafc553c6c8bb35e339fec478820149826737f5ae896`

See more details on using hashes here.

File details

Details for the file grazie_api_gateway_client-0.3.3-py3-none-any.whl.

File metadata

Download URL: grazie_api_gateway_client-0.3.3-py3-none-any.whl
Upload date: Jul 28, 2025
Size: 37.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.3 CPython/3.10.18 Linux/5.15.0-1084-aws

File hashes

Hashes for grazie_api_gateway_client-0.3.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`67cec74509ed7dd32c43588315f1c8b80ac29c203592f0d28a385b0ff1845bc8`
MD5	`938381ee3135f7369f27c536187f75d1`
BLAKE2b-256	`95f2f619b27860ddfaca964bb18e8bbc3aaae1d56df638d200da3cdc1d5759c7`

See more details on using hashes here.

grazie_api_gateway_client 0.3.3

Navigation

Verified details

Owner

Maintainers

Unverified details

Meta

Classifiers

Project description

Grazie Api Gateway Client

Usage

Profiles

Completion

Chat

Streaming

Tool use

Embeddings

Question Answering

Plain Retrieval

Grazie Api Gateway Client V2

Implemented features

Basic usage

ApiGatewayClient

AsyncApiGatewayClient

TaskAPI

Execute a task

Project details

Verified details

Owner

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes