A transpiler from stateful imperative workflows to declarative DSPy programs

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

⚡ dspyer

Reliable, optimizable LLM steps with zero DSPy boilerplate: typed outputs, automatic self-correction, and one-call prompt tuning.

dspyer Architecture Flow

Why dspyer?

If you are building production agents with LangChain, LangGraph, or custom LLM API loops, you face three primary challenges:

Prompt Decay: When you upgrade models (e.g., from GPT-4o to Claude 3.5 Sonnet), your carefully engineered prompt strings fail. They need manual, tedious re-tuning.
Brittle Validations: You write verbose try/except loops and custom logic to catch malformed JSON and missing fields from the LLM.
No Systematic Tuning: There is no simple way to optimize prompts programmatically or automatically select the best few-shot exemplars for your specific tasks.

Stanford DSPy solves this by treating prompts as parameters that can be compiled and optimized against a dataset. However, adopting DSPy directly requires learning a complex new syntax (Signatures, Predictors, Modules) and rewriting your entire codebase.

dspyer acts as an ergonomic bridge: it transpiles standard Python functions, Pydantic schemas, and agent graphs into optimized dspy.Module instances under the hood, allowing you to drop them straight back into your existing orchestrator. You write standard, PEP 484 type-hinted Python functions; dspyer compiles them into optimizable dspy.Module objects you can hand to any DSPy teleprompter.

Key Benefits

No vendor lock-in: Compiles to a standard dspy.Module; use any DSPy optimizer and dspy.save/load.
Self-correction loops: Failed Pydantic validation auto-generates feedback and re-queries the model until it conforms.
Telemetry and validation reports: OpenTelemetry spans plus per-node failure summaries.
Dataset flywheel: Successful self-corrections are logged as input/output pairs you can replay as a trainset.
DirectLM runtime: Bypasses LiteLLM with persistent pooled HTTP connections.

Each is shown with runnable code under Core Capabilities.

Install

Install standard releases directly from PyPI:

pip install dspyer
# or using uv:
uv add dspyer

Alternatively, install the latest pre-release directly from GitHub:

pip install git+https://github.com/theramkm/dspyer.git
# or using uv:
uv add git+https://github.com/theramkm/dspyer.git

Quickstart: Self-Correction in 30 Seconds (No API Key)

This runs completely offline using a mock model backend. The node contract requires an answer with at least one citation. The mock "forgets" the citation on the first try, fails validation, receives the correction feedback, and successfully repairs itself.

import dspy
from pydantic import BaseModel, Field, field_validator
from dspy_transpiler.graph import Graph, StatefulNode
from dspy_transpiler.compiler import AgentTranspiler, MockCompletionResult

# 1. Describe the schema contract you want the LLM to honor
class Query(BaseModel):
    query: str

class RAGResponse(BaseModel):
    answer: str = Field(description="Answer referencing the sources")
    citations: list[str] = Field(description="Sources cited, e.g. ['doc_1']")

    @field_validator("citations")
    @classmethod
    def must_cite(cls, v):
        if not v:  # Ensure we cite at least one source
            raise ValueError("Answer must cite at least one source.")
        return v

# 2. Define an optimizable, self-correcting node
node = StatefulNode(
    "Synthesizer", Query, RAGResponse,
    instructions="Answer the query and cite sources.",
    max_retries=3,
)
graph = Graph()
graph.add_node(node)
graph.set_entry_point("Synthesizer")
program = AgentTranspiler.compile(graph)

# 3. Offline mock: configuration and run
# (Hiding MockLM details for readability; click below to expand)

Click to view MockLM configuration (for offline testing)

class MockLM(dspy.LM):
    def __init__(self): super().__init__(model="mock")
    def forward(self, prompt=None, messages=None, **kw):
        saw_feedback = "feedback" in str(prompt or messages)
        good = '{"answer": "Apache-2.0 [doc_1].", "citations": ["doc_1"]}'
        bad  = '{"answer": "Apache-2.0.", "citations": []}'
        return MockCompletionResult(good if saw_feedback else bad, "mock")

dspy.configure(lm=MockLM())

r = program(query="What license is dspyer under?")

print("Answer:   ", r.answer)                                   # Apache-2.0 [doc_1].
print("Citations:", r.citations)                                # ['doc_1']
print("Self-correction loops:", r["_metadata"]["refinement_steps_taken"])  # 1

Live Run: Run python examples/quickstart.py to run this against a live provider (OpenAI, Gemini, Ollama, Anthropic).
Offline Example: Try python examples/run_rag_verifier.py to test detailed verification logic.

Core Capabilities

1. Zero-Boilerplate Decorator

Wrap any plain typed Python function. The parameters map to inputs, the docstring acts as instructions, and the return annotation defines the schema:

from dspy_transpiler import self_correcting
from pydantic import BaseModel

class SolverOutput(BaseModel):
    answer: str
    steps: list[str]

@self_correcting(max_retries=3)
def solve(question: str) -> SolverOutput:
    """Answer the question and outline the logic steps."""
    # Body is intentionally empty; dspyer generates the call from the signature
    pass

# Returns a SolverOutput instance
result = solve(question="What is the capital of France?")

You can also decorate standard dspy.Module classes to automatically wrap nested predictors:

@self_correcting(schema=SolverOutput, max_retries=3)
class Solver(dspy.Module):
    def __init__(self):
        super().__init__()
        self.solve = dspy.Predict("question -> answer, steps")

    def forward(self, question):
        return self.solve(question=question)

2. Prompt Optimization (Tune, Save, Load)

Compile your transpiled program, optimize against a dataset using any DSPy teleprompter, and save the serialized config to JSON:

from dspy.teleprompt import BootstrapFewShot

def metric(example, pred, trace=None) -> bool:
    return example.sentiment.lower() == pred.sentiment.lower()

optimizer = BootstrapFewShot(metric=metric, max_bootstrapped_demos=2)
optimized = optimizer.compile(program, trainset=trainset)

# Save prompts
optimized.save_prompts("agent_config.json")

# Load in production
production_program.load_prompts("agent_config.json")

On a bundled sentiment benchmark (examples/benchmark.py, run with a simulated backend), optimization lifts accuracy 60% → 90%, tuning only the reasoning node.

3. Orchestrator Integration (LangGraph)

You do not need to replace your orchestrator. You can compile individual dspyer nodes and invoke them inside existing LangGraph nodes:

compiled_agent = AgentTranspiler.compile(graph)

def run_agent_node(state):
    pred = compiled_agent(query=state["user_query"])
    return {"agent_response": pred.answer, "citations": pred.citations}

Alternatively, scaffold an entire LangGraph StateGraph topology into a dspyer.Graph automatically. Non-LLM nodes are preserved as native Python passthroughs:

from dspy_transpiler import from_langgraph

node_mappings = {
    "Clean": StatefulNode("Clean", CleanInput, CleanOutput, instructions="Normalize the query"),
    "Solve": StatefulNode("Solve", SolveInput, SolveOutput, instructions="Answer the query"),
}
graph = from_langgraph(builder, node_mappings=node_mappings)
program = AgentTranspiler.compile(graph)

4. Telemetry & Validation Reporting

Enable validation logging to capture production failure metadata:

program = AgentTranspiler.compile(graph, validation_log_path="logs/validation.jsonl")

Generate a summary report detailing per-node error rates and failing Pydantic fields:

from dspy_transpiler.utils import generate_validation_report

print(generate_validation_report("logs/validation.jsonl"))

Example report:

==================================================
           dspyer Batch Validation Report
==================================================

Node: Synthesizer
--------------------------------------------------
  Total Runs: 10
  Successful Runs: 8 (80.0%)
  Failed Runs: 2 (20.0%)
  Retry Rate: 40.0% (4/10 runs required retries)
  Average Retries: 0.80 per run
  Top Failing Pydantic Fields:
    - citations: 4 errors (66.7% of total errors)
    - answer: 2 errors (33.3% of total errors)

==================================================

5. Self-Correction Dataset Flywheel

Configure dataset_log_path on either the @self_correcting decorator or during transpilation compilation to capture successful self-correction runs (saving the initial input and the final corrected output):

program = AgentTranspiler.compile(graph, dataset_log_path="logs/flywheel.jsonl")

Then, load the logged executions using load_logged_dataset to dynamically generate a clean training dataset of dspy.Example objects:

from dspy_transpiler.utils import load_logged_dataset

# We must specify which keys act as model inputs
trainset = load_logged_dataset(
    dataset_log_path="logs/flywheel.jsonl",
    input_keys=["query"]
)

6. Escape Hatch Node Decorator (`@dspyer_node`)

Avoid brittle AST static analysis on complex node callables by using the @dspyer_node decorator. It explicitly defines a node contract, instructions, and schemas directly on functions:

from dspy_transpiler import dspyer_node

class ExtractorInput(BaseModel):
    query: str

class ExtractorOutput(BaseModel):
    entities: list[str]

@dspyer_node(
    input_model=ExtractorInput,
    output_model=ExtractorOutput,
    instructions="Extract named entities from the user query."
)
def extract_entities_node(state):
    # This node is explicitly registered with its typing contract
    # Bypasses AST static analysis during LangGraph conversion
    pass

7. Async & Streaming Pipelines

For concurrent web environments (like FastAPI), compile programs to execute asynchronously via aforward or stream intermediate events via astream:

program = AgentTranspiler.compile(graph, output_model=ExtractorOutput)

# 1. Async forward call
result = await program.aforward(query="Alice and Bob went to Paris")
print(result.entities)

# 2. Async event streaming
async for event in program.astream(query="Alice and Bob went to Paris"):
    print(f"Event: {event['event']} | Node: {event.get('node')}")

8. Pluggable Storage Adapters

Register custom thread-safe storage engines for production dataset logging and validation reporting using the BaseStorageAdapter interface. By default, it falls back to a thread-pooled, non-blocking FileStorageAdapter:

from dspy_transpiler.utils import BaseStorageAdapter, set_storage_adapter

class CustomDatabaseAdapter(BaseStorageAdapter):
    def append_line(self, target: str, line: str) -> None:
        # Custom synchronous DB write
        db.insert(target, line)

    async def append_line_async(self, target: str, line: str) -> None:
        # Custom non-blocking async DB write
        await db.async_insert(target, line)

# Register custom adapter globally
set_storage_adapter(CustomDatabaseAdapter())

Additional References

Feature	Summary
`use_cot=True`	Injects chain-of-thought rationales dynamically without polluting output schemas.
`ImmutableState.merge()`	Standard merge policies (`last_write_wins`, `combine_lists`, `raise`) to reconcile parallel branches.
`StatefulNode` parameters	Per-node `max_retries` and custom `refine_instructions` configurations.
`@dspyer_node`	Bypasses graph AST parsing with explicit input/output schema metadata declarations.
`aforward` / `astream`	Non-blocking async execution and fine-grained graph step streaming.
Copy-on-Write (COW)	High-speed dictionary state patching that preserves untouched branches.
Pluggable Storage	Thread-safe database and custom file adapters for production telemetry log sinks.

Project Status

Stable release (0.3.2), actively developed. Green CI across Python 3.10 to 3.14, fully type-checked (mypy) and linted (ruff), with a 69-case test suite. Issues and PRs are welcome.

License

Apache License 2.0.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

theramkm

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.6

Jun 28, 2026

0.3.5

Jun 24, 2026

0.3.4

Jun 24, 2026

0.3.3

Jun 23, 2026

This version

0.3.2

Jun 23, 2026

0.3.1

Jun 23, 2026

0.3.0

Jun 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dspyer-0.3.2.tar.gz (1.0 MB view details)

Uploaded Jun 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

dspyer-0.3.2-py3-none-any.whl (40.9 kB view details)

Uploaded Jun 23, 2026 Python 3

File details

Details for the file dspyer-0.3.2.tar.gz.

File metadata

Download URL: dspyer-0.3.2.tar.gz
Upload date: Jun 23, 2026
Size: 1.0 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dspyer-0.3.2.tar.gz
Algorithm	Hash digest
SHA256	`1037a2c5f73e544bfdc92e9fce16b49dbc400c63afd3546d281b138c51141b7c`
MD5	`081e37cb987db5807206d4730175f9a8`
BLAKE2b-256	`ee243de55ab353dee908fac799f8c50f4731fb78b65ab4f93ca6cde5df0ce1bd`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dspyer-0.3.2.tar.gz:

Publisher: release.yml on theramkm/dspyer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dspyer-0.3.2.tar.gz
- Subject digest: 1037a2c5f73e544bfdc92e9fce16b49dbc400c63afd3546d281b138c51141b7c
- Sigstore transparency entry: 1931396867
- Sigstore integration time: Jun 23, 2026
Source repository:
- Permalink: theramkm/dspyer@84cd50b07825fe1a621b375c8d7f879786d871cf
- Branch / Tag: refs/tags/v0.3.2
- Owner: https://github.com/theramkm
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@84cd50b07825fe1a621b375c8d7f879786d871cf
- Trigger Event: push

File details

Details for the file dspyer-0.3.2-py3-none-any.whl.

File metadata

Download URL: dspyer-0.3.2-py3-none-any.whl
Upload date: Jun 23, 2026
Size: 40.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dspyer-0.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a650845048d875ad4bfd0ea0bab44ba6386038728177b1b365efe5be609447b6`
MD5	`8190416e11c46262c0b268f6ef4780e1`
BLAKE2b-256	`c8d59235a8c4e49fe3f270c9105589799fe11cb7190a00809404a765ec63d488`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dspyer-0.3.2-py3-none-any.whl:

Publisher: release.yml on theramkm/dspyer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dspyer-0.3.2-py3-none-any.whl
- Subject digest: a650845048d875ad4bfd0ea0bab44ba6386038728177b1b365efe5be609447b6
- Sigstore transparency entry: 1931397144
- Sigstore integration time: Jun 23, 2026
Source repository:
- Permalink: theramkm/dspyer@84cd50b07825fe1a621b375c8d7f879786d871cf
- Branch / Tag: refs/tags/v0.3.2
- Owner: https://github.com/theramkm
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@84cd50b07825fe1a621b375c8d7f879786d871cf
- Trigger Event: push

dspyer 0.3.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

⚡ dspyer

Why dspyer?

Key Benefits

Install

Quickstart: Self-Correction in 30 Seconds (No API Key)

Core Capabilities

1. Zero-Boilerplate Decorator

2. Prompt Optimization (Tune, Save, Load)

3. Orchestrator Integration (LangGraph)

4. Telemetry & Validation Reporting

5. Self-Correction Dataset Flywheel

6. Escape Hatch Node Decorator (@dspyer_node)

7. Async & Streaming Pipelines

8. Pluggable Storage Adapters

Additional References

Project Status

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

6. Escape Hatch Node Decorator (`@dspyer_node`)