OpenTelemetry instrumentation for Together AI - chat, completions, and embeddings

These details have not been verified by PyPI

Project description

TraceAI Together Instrumentation

OpenTelemetry instrumentation for Together AI - chat completions, completions, and embeddings APIs.

Installation

pip install traceai-together

Features

Automatic tracing of Together AI API calls
Support for chat completions, completions, and embeddings endpoints
Streaming response support for both sync and async clients
Token usage tracking
Tool/function calling support
Full OpenTelemetry semantic conventions compliance

Usage

Basic Setup

import together
from opentelemetry import trace
from opentelemetry.sdk.trace import TracerProvider
from opentelemetry.sdk.trace.export import ConsoleSpanExporter, SimpleSpanProcessor

from traceai_together import TogetherInstrumentor

# Set up tracing
provider = TracerProvider()
provider.add_span_processor(SimpleSpanProcessor(ConsoleSpanExporter()))
trace.set_tracer_provider(provider)

# Instrument Together AI
TogetherInstrumentor().instrument(tracer_provider=provider)

# Use Together AI
client = together.Together(api_key="your-api-key")
response = client.chat.completions.create(
    model="meta-llama/Llama-3-8b-chat-hf",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

Chat Completions

import together

client = together.Together()

# Simple chat
response = client.chat.completions.create(
    model="meta-llama/Llama-3-8b-chat-hf",
    messages=[{"role": "user", "content": "What is machine learning?"}],
    max_tokens=512,
)
print(response.choices[0].message.content)

# With system message
response = client.chat.completions.create(
    model="meta-llama/Llama-3-8b-chat-hf",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Explain quantum computing."},
    ],
    temperature=0.7,
)

Streaming Chat

import together

client = together.Together()

# Streaming response
stream = client.chat.completions.create(
    model="meta-llama/Llama-3-8b-chat-hf",
    messages=[{"role": "user", "content": "Tell me a story"}],
    stream=True,
)

for chunk in stream:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="", flush=True)

Completions (Legacy)

import together

client = together.Together()

# Text completion
response = client.completions.create(
    model="meta-llama/Llama-3-8b-hf",
    prompt="The quick brown fox",
    max_tokens=50,
)
print(response.choices[0].text)

Embeddings

import together

client = together.Together()

# Generate embeddings
response = client.embeddings.create(
    model="togethercomputer/m2-bert-80M-8k-retrieval",
    input=["Hello world", "Machine learning is great"],
)
print(f"Generated {len(response.data)} embeddings")
print(f"Dimensions: {len(response.data[0].embedding)}")

Async Client

import asyncio
import together

async def main():
    client = together.AsyncTogether()

    # Async chat completion
    response = await client.chat.completions.create(
        model="meta-llama/Llama-3-8b-chat-hf",
        messages=[{"role": "user", "content": "Hello!"}],
    )
    print(response.choices[0].message.content)

    # Async streaming
    stream = await client.chat.completions.create(
        model="meta-llama/Llama-3-8b-chat-hf",
        messages=[{"role": "user", "content": "Tell me a joke"}],
        stream=True,
    )

    async for chunk in stream:
        if chunk.choices[0].delta.content:
            print(chunk.choices[0].delta.content, end="", flush=True)

asyncio.run(main())

Tool/Function Calling

import together
import json

client = together.Together()

tools = [
    {
        "type": "function",
        "function": {
            "name": "get_weather",
            "description": "Get the weather for a location",
            "parameters": {
                "type": "object",
                "properties": {
                    "location": {
                        "type": "string",
                        "description": "The city name",
                    }
                },
                "required": ["location"],
            },
        },
    }
]

response = client.chat.completions.create(
    model="meta-llama/Llama-3-8b-chat-hf",
    messages=[{"role": "user", "content": "What's the weather in Paris?"}],
    tools=tools,
    tool_choice="auto",
)

if response.choices[0].message.tool_calls:
    for tool_call in response.choices[0].message.tool_calls:
        print(f"Tool: {tool_call.function.name}")
        print(f"Arguments: {tool_call.function.arguments}")

Configuration Options

TraceConfig

from fi_instrumentation import TraceConfig
from traceai_together import TogetherInstrumentor

config = TraceConfig(
    hide_inputs=False,
    hide_outputs=False,
)

TogetherInstrumentor().instrument(
    tracer_provider=provider,
    config=config
)

Captured Attributes

Chat Completions Attributes

Attribute	Description
`fi.span.kind`	"LLM"
`llm.system`	"together"
`llm.provider`	"together"
`llm.model`	Model name (e.g., meta-llama/Llama-3-8b-chat-hf)
`llm.token_count.prompt`	Input token count
`llm.token_count.completion`	Output token count
`llm.token_count.total`	Total token count
`llm.input_messages`	Input messages array
`llm.output_messages`	Output messages array
`llm.invocation_parameters`	Model parameters (temperature, max_tokens, etc.)

Completions Attributes

Attribute	Description
`fi.span.kind`	"LLM"
`llm.system`	"together"
`llm.model`	Model name
`llm.token_count.prompt`	Input token count
`llm.token_count.completion`	Output token count
`input.value`	Input prompt
`output.value`	Generated text

Embeddings Attributes

Attribute	Description
`fi.span.kind`	"EMBEDDING"
`llm.system`	"together"
`embedding.model`	Embedding model name
`together.texts_count`	Number of texts embedded
`together.embeddings_count`	Number of embeddings returned
`together.embedding_dimensions`	Vector dimensions

Available Models

Together AI provides access to many open-source models. Some popular ones include:

Category	Models
Chat	`meta-llama/Llama-3-8b-chat-hf`, `meta-llama/Llama-3-70b-chat-hf`, `mistralai/Mixtral-8x7B-Instruct-v0.1`
Completions	`meta-llama/Llama-3-8b-hf`, `meta-llama/Llama-3-70b-hf`
Embeddings	`togethercomputer/m2-bert-80M-8k-retrieval`, `BAAI/bge-large-en-v1.5`

Real-World Use Cases

RAG Pipeline

import together

client = together.Together()

# Step 1: Generate embeddings for documents
docs = ["Document 1 content", "Document 2 content", "Document 3 content"]
doc_embeddings = client.embeddings.create(
    model="togethercomputer/m2-bert-80M-8k-retrieval",
    input=docs,
)

# Step 2: Generate embedding for query
query = "What is the main topic?"
query_embedding = client.embeddings.create(
    model="togethercomputer/m2-bert-80M-8k-retrieval",
    input=[query],
)

# Step 3: Find relevant docs (using cosine similarity - not shown)
relevant_docs = docs[:2]  # Simplified

# Step 4: Generate response with context
response = client.chat.completions.create(
    model="meta-llama/Llama-3-8b-chat-hf",
    messages=[
        {"role": "system", "content": f"Context: {' '.join(relevant_docs)}"},
        {"role": "user", "content": query},
    ],
)
print(response.choices[0].message.content)

Multi-turn Conversation

import together

client = together.Together()

messages = []

def chat(user_message):
    messages.append({"role": "user", "content": user_message})

    response = client.chat.completions.create(
        model="meta-llama/Llama-3-8b-chat-hf",
        messages=messages,
    )

    assistant_message = response.choices[0].message.content
    messages.append({"role": "assistant", "content": assistant_message})

    return assistant_message

# Have a conversation
print(chat("My name is Alice"))
print(chat("What's my name?"))

License

Apache-2.0

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.0

Mar 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

traceai_together-0.1.0.tar.gz (12.0 kB view details)

Uploaded Mar 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

traceai_together-0.1.0-py3-none-any.whl (13.5 kB view details)

Uploaded Mar 10, 2026 Python 3

File details

Details for the file traceai_together-0.1.0.tar.gz.

File metadata

Download URL: traceai_together-0.1.0.tar.gz
Upload date: Mar 10, 2026
Size: 12.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.21 {"installer":{"name":"uv","version":"0.9.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for traceai_together-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`a406cc4d9a391052e03f5a82d41a966977580650fd35ae715a99487a9ca0d304`
MD5	`7f0f823dbd08891ec9d2b03c92116d9d`
BLAKE2b-256	`eae02fec55131e8b55294625a33d1bfbb69e23d7a92d0287f4800378fd05150c`

See more details on using hashes here.

File details

Details for the file traceai_together-0.1.0-py3-none-any.whl.

File metadata

Download URL: traceai_together-0.1.0-py3-none-any.whl
Upload date: Mar 10, 2026
Size: 13.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.21 {"installer":{"name":"uv","version":"0.9.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for traceai_together-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ec8fa6302c5c0610008dce672a80a077bc1bd1f50aa9e86ab13966a26da3d20d`
MD5	`dbd3ffe38552766b877f2f5267b7a971`
BLAKE2b-256	`3e3c2488dfc7cdbdbf9503e30a048fb9771621f3a5c09a0514c051615f30218e`

See more details on using hashes here.

traceAI-together 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

TraceAI Together Instrumentation

Installation

Features

Usage

Basic Setup

Chat Completions

Streaming Chat

Completions (Legacy)

Embeddings

Async Client

Tool/Function Calling

Configuration Options

TraceConfig

Captured Attributes

Chat Completions Attributes

Completions Attributes

Embeddings Attributes

Available Models

Real-World Use Cases

RAG Pipeline

Multi-turn Conversation

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes