sutro

Sutro Python SDK

These details have not been verified by PyPI

Project links

Project description

Sutro Logo PyPI - Version PyPI - Downloads

Sutro makes it easy to analyze and generate unstructured data using LLMs, from quick experiments to billion token jobs.

Whether you're generating synthetic data, running model evals, structuring unstructured data, classifying data, or generating embeddings - batch inference is faster, cheaper, and easier with Sutro.

Visit sutro.sh to learn more and request access to the cloud beta.

🚀 Quickstart

Install:

[uv] pip install sutro

Authenticate:

sutro login

Run your first job:

import sutro as so
import polars as pl
from pydantic import BaseModel

# Load your data
df = pl.DataFrame({
    "review": [
        "The battery life is terrible.",
        "Great camera and build quality!",
        "Too expensive for what it offers."
    ]
})

# Add a system prompt (optional)
system_prompt = "Classify the sentiment of the review as positive, neutral, or negative."

# Define an output schema (optional)
class Sentiment(BaseModel):
    sentiment: str

# Run a prototyping (p0) job
df = so.infer(
    df,
    column="review",
    model="qwen-3-32b",
    output_schema=Sentiment
)

print(df)

Will produce a result like:

Prototyping Job Result

Scaling up:

# load a larger dataset
df = pl.read_parquet('hf://datasets/sutro/synthetic-product-reviews-20k/results.parquet')

# Run a production (p1) job
job_id = so.infer(
    df,
    column="review_text",
    model="qwen-3-32b",
    output_schema=Sentiment,
    job_priority=1 # <-- one line of code for near-limitless scale
)

You can track live progress of your job, view results, and share with your team from the Sutro web app:

Production Job Result

What is Sutro?

Sutro is a serverless, high-throughput batch inference service for LLM workloads. With just a few lines of Python, you can quickly run batch inference jobs using open-source foundation models—at scale, with strong cost/time guarantees, and without worrying about infrastructure.

Think of Sutro as online analytical processing (OLAP) for AI: you submit queries over unstructured data (documents, emails, product reviews, etc.), and Sutro handles the heavy lifting of job execution - from intelligent batching to cloud orchestration to inference framework and hardware optimizations. You just bring your data, and Sutro handles the rest.

📚 Documentation & Examples

Documentation
Example Guides:
- Synthetic Data Zero to Hero
- Synthetic Data for Privacy Preservation
- Large Scale Embedding Generation with Qwen3 0.6B
- More coming soon...

✨ Features

⚡ Run experiments faster Small scale jobs complete in minutes, large scale jobs run within 1 hour - more than 20x faster than competing cloud services.
📈 Seamless scaling
Use the same interface to run jobs with a few tokens, or billions at a time.
💰 Decreased Costs and Transparent Pricing
Up to 10x cheaper than alternative inference services. Use dry run mode to estimate costs before running large jobs.
🐍 Pythonic DataFrame and file integrations
Submit and receive results directly as Pandas/Polars DataFrames, or upload CSV/Parquet files.
🏗️ Zero infrastructure setup
No need to manage GPUs, tune inference frameworks, or orchestrate parallelization. Just data in, results out.
📊 Real-time observability dashboard Use the Sutro web app to monitor your jobs in real-time and see results as they are generated, tag jobs for easier tracking, and share results with your team.
🔒 Built with security in mind
Custom data retention options, and bring-your-own s3-compatible storage options available.

🧑‍💻 Typical Use Cases

Synthetic data generation: Create millions of product reviews, conversations, or paraphrases for pre-training or distillation.
Model evals: Easily run LLM benchmarks on a scheduled basis to detect model regressions or performance degradation.
Unstructured data analytics: Run analytical workloads over unstructured data (e.g. customer reviews, product descriptions, emails, etc.).
Semantic tagging: Add boolean/numeric/closed-set tags to messy data (e.g. LinkedIn bios, company descriptions).
Structured Extraction: Pull structured fields out of unstructured documents at scale.
Classification: Apply consistent labels across large datasets (spam, sentiment, topic, compliance risk).
Embedding generation: Generate and store embeddings for downstream search/analytics.

🔌 Integrations

DataFrames: Pandas, Polars
Files: CSV, Parquet
Storage: S3-Compatible Object Stores (e.g. R2, S3, GCS, etc.)

📦 Hosting Options

Cloud: Run Sutro on our secure, multi-tenant cloud.
Isolated Deployments: Bring your own storage, models, or cloud resources.
Local and Self-Hosted: Coming soon!

See our pricing page for more details.

🤝 Contributing

We welcome contributions! Please reach out to us at team@sutro.sh to get involved.

📄 License

Apache 2.0

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.58

May 11, 2026

0.1.57

Mar 17, 2026

0.1.56

Mar 6, 2026

0.1.55

Feb 18, 2026

0.1.54

Feb 10, 2026

0.1.53

Jan 28, 2026

0.1.52

Jan 15, 2026

0.1.51

Jan 8, 2026

0.1.50

Jan 8, 2026

0.1.43

Nov 14, 2025

0.1.42

Nov 11, 2025

0.1.41

Nov 11, 2025

0.1.40

Nov 6, 2025

0.1.39

Nov 5, 2025

0.1.38

Nov 1, 2025

0.1.37

Oct 14, 2025

0.1.36

Oct 13, 2025

0.1.35

Oct 4, 2025

0.1.34

Oct 2, 2025

This version

0.1.33

Sep 25, 2025

0.1.32

Sep 24, 2025

0.1.31

Sep 19, 2025

0.1.30

Sep 5, 2025

0.1.29

Sep 4, 2025

0.1.28

Aug 13, 2025

0.1.27

Aug 7, 2025

0.1.26

Aug 6, 2025

0.1.25

Jul 22, 2025

0.1.24

Jul 22, 2025

0.1.23

Jul 22, 2025

0.1.22

Jul 17, 2025

0.1.21

Jul 16, 2025

0.1.20

Jul 10, 2025

0.1.19

Jul 8, 2025

0.1.18

Jun 25, 2025

0.1.17

Jun 13, 2025

0.1.16

Jun 2, 2025

0.1.15

May 22, 2025

0.1.14

May 21, 2025

0.1.13

May 9, 2025

0.1.12

May 9, 2025

0.1.11

May 9, 2025

0.0.0

Apr 20, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sutro-0.1.33.tar.gz (22.1 kB view details)

Uploaded Sep 25, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sutro-0.1.33-py3-none-any.whl (23.7 kB view details)

Uploaded Sep 25, 2025 Python 3

File details

Details for the file sutro-0.1.33.tar.gz.

File metadata

Download URL: sutro-0.1.33.tar.gz
Upload date: Sep 25, 2025
Size: 22.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.15

File hashes

Hashes for sutro-0.1.33.tar.gz
Algorithm	Hash digest
SHA256	`ba8f24513a02eae68972bf58c7a445a3e0d3ab8088162cd5759b40078c0cae38`
MD5	`ffdd770e99e747aae9f9adac38217b6e`
BLAKE2b-256	`f941de9add5bf1366809ff52e771862196ba6db14a20a602acfa1df58c6dd770`

See more details on using hashes here.

File details

Details for the file sutro-0.1.33-py3-none-any.whl.

File metadata

Download URL: sutro-0.1.33-py3-none-any.whl
Upload date: Sep 25, 2025
Size: 23.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.15

File hashes

Hashes for sutro-0.1.33-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9843f0cea010eef1ab2bcb23073f144efe64454a64b577cc19acb8870b87e6c0`
MD5	`f4339ee8e7dc98ce21cea59385ff76d3`
BLAKE2b-256	`94b05ad31813304c5c46cef8990fc77d5455121b56287e674e35d5082633f3a4`

See more details on using hashes here.

sutro 0.1.33

Navigation

Verified details

Owner

Maintainers

Unverified details

Project links

Meta

Project description

🚀 Quickstart

Run your first job:

Scaling up:

What is Sutro?

📚 Documentation & Examples

✨ Features

🧑‍💻 Typical Use Cases

🔌 Integrations

📦 Hosting Options

🤝 Contributing

📄 License

Project details

Verified details

Owner

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes