Security and reproducibility-first workflow orchestrator for research environments

These details have not been verified by PyPI

Project links

Project description

Hillstar Orchestrator v1.0.0

Hillstar Logo

API Documentation | User Manual | Setup Guide

A security and reproducibility-first workflow orchestration tool

Hillstar is an open-source workflow orchestrator built for scientific research labs and any environment where reproducibility and auditability are non-negotiable. Most worflow management tools in this space are designed for data/software engineering teams. Hillstar keeps the underlying rigor, but is built for researchers, analysts, and teams in regulated environments who need to prove what happened, when, and why—without requiring a background in DevOps.

Hillstar is designed with auditability, security, and governance as first-class concerns, so it fits naturally in tightly regulated environments.

The core design principle: explicit over implicit. No unrestricted API access. No magic. You define workflows as composable DAGs where each node performs one action and data flows explicitly between stages. Every decision is auditable: which model was called, what parameters were used, how much it cost, whether review was required before the output moved downstream.

This matters in environments where highly sensitive data is in use—teams working with human genomic data, clinical trials, or proprietary research—where governance isn't a nice-to-have, it's the foundation. Hillstar bakes in compliance checking, credential security, and comprehensive logging from the start.

Whether you are coordinating between multiple large language model (LLM) providers, integrating with custom or external agents via MCP servers, or running everything locally and offline, the same auditability guarantees apply.

Current Features (v1.0.0)

DAG-based workflows - Define complex research pipelines as directed acyclic graphs
Workflow visualization - Mermaid diagrams for GitHub, Obsidian, and Markdown
Multi-provider support - Integration with cloud and local models
Flexible model selection - Presets for cost, quality, and air-gapped setups
Full auditability - Comprehensive trace logs with model selection reasoning
Checkpoint/replay - Save state at key points, resume from checkpoints
Strict governance - Explicit permissions, no unrestricted API access
Air-gapped capability - Works offline with local models
Responsible AI focus - Explicit governance, compliance tracking, and audit trails
Credential Security - In-flight redaction of credentials, API keys, tokens, and PII
MCP Server Support - Optional MCP servers for integration with Claude Code and other tools

Quick Start

Installation & Setup

# Clone repository
git clone git@github.com:evoclock/hillstar-orchestrator.git
cd hillstar-orchestrator

# Install Hillstar
pip install -e .

# Verify installation
hillstar --version

# List available workflows
hillstar discover .

Run a Workflow

Create a workflow file or use the test example:

# Validate a workflow
hillstar validate examples/simple-workflow.json

# Execute
hillstar execute examples/simple-workflow.json

See docs/User_Manual.md for step-by-step examples of building workflows.

Output:

▶ Executing: examples/simple-workflow.json
📁 Output: ./.hillstar

 Workflow executed successfully

 Workflow ID: sample_workflow
 Trace file: .hillstar/trace_20260221_031505.jsonl

Governance & Development Mode

Hillstar enforces workflow-driven commits to ensure reproducibility. Three options:

Option 1: Execute a workflow, then commit (recommended)

hillstar execute workflow.json
git commit -m "[Fix] Bug fix with recent workflow"
# Success: Workflow execution marker allows commit

Option 2: Enable development mode for code-only work

# Toggle persistent development mode
hillstar mode dev
git commit -m "[Docs] Update README"
git commit -m "[Refactor] Clean up imports"

# Re-enable enforcement
hillstar mode normal

Option 3: One-time overrides

# Via environment variable
HILLSTAR_DEV_MODE=1 git commit -m "[Docs] Fix typo"

Architecture

Graph Execution Engine

Workflows are directed acyclic graphs (DAGs). Nodes execute in topological order.

Example DAG visualization:

graph TD
 load_schema["load_schema<br/>(file_read)"]
 style load_schema fill:#9C27B0,stroke:#6A1B9A,color:#fff
 load_pdf_corpus["load_pdf_corpus<br/>(file_read)"]
 style load_pdf_corpus fill:#9C27B0,stroke:#6A1B9A,color:#fff
 validate["validate<br/>(model_call, simple)"]
 style validate fill:#2196F3,stroke:#1565C0,color:#fff
 analyze["analyze<br/>(model_call, moderate)"]
 style analyze fill:#2196F3,stroke:#1565C0,color:#fff
 export["export<br/>(file_write)"]
 style export fill:#4CAF50,stroke:#2E7D32,color:#fff

 load_schema --> analyze
 load_pdf_corpus --> validate
 validate --> analyze
 analyze --> export

Color Legend:

🔵 Blue = model_call (AI/ML operations)
🟣 Purple = file_read (data input)
🟢 Green = file_write (data output)
🟠 Orange = script_run (custom operations)

Workflow Structure

Complete workflows require root-level configuration with DAG nodes:

{
 "id": "my_pipeline",
 "version": "1.0",
 "graph": {
 "nodes": {
 "analyze": {
 "tool": "model_call",
 "provider": "anthropic",
 "model": "claude-opus-4-6",
 "task": "Analyze data",
 "parameters": {
 "max_tokens": 4096
 }
 },
 "validate": {
 "tool": "script_run",
 "script": "./validate.py"
 }
 },
 "edges": [
 { "from": "analyze", "to": "validate" }
 ]
 },
 "provider_config": {
 "anthropic": {
 "tos_accepted": true,
 "audit_enabled": true,
 "restricted_use_acknowledged": true
 }
 }
}

Model Integration

Supported providers:

Cloud APIs: Anthropic (Claude), OpenAI (GPT), Mistral, Google (Gemini)
All use API keys/credentials (never embedded in workflows)
Local Models: Ollama, llama.cpp, Devstral, or any HTTP-compatible server
Custom Providers: Bring your own via wrapper scripts
Subscription mode: OpenAI only. Unlike Anthropic, OpenAI has decided to support access and usage of your subscription via third party harnesses/tools. A caveat worth mentioning is that if you are developing software, you should default to Cloud APIs for reliability.

Further information on subscription mode support can be found through the following links:

docs/MCP_TOPOLOGY.md
docs/OPENAI_HILLSTAR_SETUP.md

Model specification in workflows:

{
 "tool": "model_call",
 "provider": "anthropic",
 "model": "claude-opus-4-6",
 "parameters": {
 "system": "You are an expert in ...",
 "max_tokens": 4096
 }
}

Parameter Support Varies by Model:

Check model constraints before setting sampling parameters:

Anthropic Claude: Cannot use temperature and top_p simultaneously
OpenAI o-series & GPT-5: Do not support temperature (use reasoning_effort instead)
Google Gemini 3: Keep temperature at default (1.0) to avoid performance issues

See docs/PROVIDER_MODEL_REFERENCE.md for complete constraints by model and provider.

Model Selection & Presets

Hillstar provides flexible model selection with four preset strategies:

Four Built-in Presets:

minimize_cost - Cheapest models per complexity level
balanced - Mix of cost and quality
maximize_quality - Highest quality models regardless of cost
local_only - Air-gapped: Local models only (no cloud APIs)

Workflow Schema

See spec/workflow-schema.json for complete schema.

Nodes:

model_call - Call an LLM
file_read - Read a file
file_write - Write output
script_run - Execute a script
checkpoint - Save workflow state

Values Statement

Our Design Philosophy: As Meredith Whittaker warns in "AI agents are coming for your privacy", unconstrained AI agents pose significant privacy and autonomy risks. Hillstar is designed with explicit permissions, auditability, and governance boundaries to prevent systems operating with unrestricted access to user data and external systems.

NOT Supported (no exceptions):

xAI (Groq/Grok)
Palantir

Development

Project Structure

hillstar-orchestrator/
├── README.md # This file
├── LICENSE # Apache 2.0
├── requirements.txt # Python dependencies
├── pyproject.toml # Package configuration
├── .gitignore
│
├── cli.py # Command-line interface
│
├── config/ # Configuration management
│ ├── config.py
│ ├── config_manager.py
│ ├── model_selector.py
│ ├── provider_registry.py
│ └── provider_registry.default.json
│
├── execution/ # Workflow execution engine
│ ├── runner.py # Main orchestration
│ ├── node_executor.py # Node execution and provider chains
│ ├── model_selector.py # Model selection and fallback logic
│ ├── cost_manager.py # Cost tracking and budget enforcement
│ ├── config_validator.py # Configuration validation
│ ├── graph.py # DAG execution with topological ordering
│ ├── checkpoint.py # Checkpoint persistence
│ ├── trace.py # Execution tracing
│ └── observability.py # Comprehensive logging
│
├── governance/ # Compliance & policy
│ ├── compliance.py
│ ├── policy.py
│ ├── enforcer.py
│ ├── hooks.py
│ └── project_init.py
│
├── models/ # LLM provider integrations
│ ├── mcp_model.py
│ ├── anthropic_model.py
│ ├── anthropic_mcp_model.py
│ ├── anthropic_ollama_api_model.py
│ ├── openai_mcp_model.py
│ ├── mistral_api_model.py
│ ├── mistral_mcp_model.py
│ ├── ollama_mcp_model.py
│ └── devstral_local_model.py
│
├── workflows/ # Workflow discovery & validation
│ ├── validator.py
│ ├── discovery.py
│ ├── auto_discover.py
│ └── model_presets.py
│
├── utils/ # Utility functions
│ ├── credential_redactor.py
│ ├── exceptions.py
│ └── report.py
│
├── spec/ # Workflow JSON schema
│ └── workflow-schema.json
│
├── tests/ # Unit tests
│ ├── test_credential_redactor.py
│ ├── test_integration.py
│ ├── test_mcp_error_handling.py
│ └── test_workflow_execution.py
│
├── examples/ # Example workflows
│ ├── simple-workflow.json
│ └── multi-provider-workflow.json
│
├── docs/ # User documentation
│ ├── INSTALLATION.md
│ ├── QUICK_START.md
│ ├── USER_MANUAL.md
│ ├── PROVIDER_MODEL_REFERENCE.md
│ └── PROVIDER_SETUP.md
│
└── mcp-server/ # MCP server implementations
 ├── anthropic_mcp_server.py
 ├── openai_mcp_server.py
 ├── mistral_mcp_server.py
 └── ... (other MCP servers)

Local Development

# Clone and install
git clone git@github.com:evoclock/hillstar-orchestrator.git
cd hillstar-orchestrator

# Install in editable mode
pip install -e .

# Verify installation
hillstar --version

# Explore workflows
hillstar discover .

# Run test suite
pytest tests/ -v

Troubleshooting

API Key Issues

Error: "ANTHROPIC_API_KEY not found"

# Set your API key (replace with actual key)
export ANTHROPIC_API_KEY="sk-ant-..."

# Or use the interactive setup wizard
hillstar wizard

Error: "No valid credentials found for provider X"

Verify the env var is set: echo $PROVIDER_API_KEY
Check provider name spelling in workflow (e.g., anthropic not claude)
Run hillstar wizard to validate and save credentials

Model Issues

Error: "Unsupported parameter: 'temperature' not supported..."

Model does not support temperature (o3, o3-mini, GPT-5 series)
See docs/PROVIDER_MODEL_REFERENCE.md for constraints
Remove temperature from parameters, or use a different model

Error: "Model not found" or "Model not accessible"

Verify model name matches provider's documentation
Check provider registry: hillstar presets shows available models
Ensure API key has access to the model (some are restricted)

Workflow Validation

Error: "Invalid workflow schema"

# Validate workflow with verbose output
hillstar validate workflow.json --verbose

# Check against schema
cat spec/workflow-schema.json

Getting Help

# Show available commands
hillstar --help

# Get help for specific command
hillstar execute --help

# Check system compliance
hillstar enforce status

# View logs
ls .hillstar/

Why Hillstar?

Hillstar Banner

The name is inspired by the Chimborazo Hillstar, a hummingbird endemic to Ecuador's High Andes. Recent research (Cañas-Valle & Bouzat, 2025) has revealed remarkably unique behaviour among these hummingbirds: while the majority of hummingbirds are fiercely territorial, Chimborazo Hillstars engage in colonial nesting, practice cooperative roosting, and live together peacefully. They share information about resources, coordinate on survival strategies, and adapt fluidly to their environmental constraints.

License

Apache 2.0

Citation

If you use Hillstar Orchestrator in research, please cite:

@software{gamboa2026hillstar,
 title={Hillstar Orchestrator v1.0.0: A security-first Workflow
 orchestration for multi-agent AI pipelines},
 author={Gamboa, Julen},
 year={2026},
 url={https://github.com/evoclock/hillstar-orchestrator}
}

Status

🟢 v1.0.0 Production Release (Feb 28, 2026) - Core orchestration engine complete, tested, and production-ready with comprehensive security and governance.

v1.0.0 Capabilities:

Multi-provider access - Anthropic, OpenAI, Mistral, Google cloud APIs; local models (Ollama, llama.cpp, Devstral); MCP servers
Smart model selection - Four cost/quality presets or explicitly choose any model
Safe parameters - Model constraints auto-documented with helpful errors before execution
Workflow governance - Three commit modes: require workflow execution (default), dev mode, or one-time override
Secure credentials - Automatic credential redaction in logs/errors; never embedded in workflows
Workflow visualization - Mermaid DAG diagrams; topological execution order visible
Checkpoint/replay - Resume from saved state; audit trail of execution
Air-gapped ready - Run entirely offline with local models
Workflow discovery & validation - Auto-find and validate workflows before execution

v1.0.0 Test & Quality Metrics:

1,078 tests - 100% pass rate
91% code coverage - Comprehensive module coverage (41 modules at 100%, 20 at 95-99%)
Credential security - 24 pattern types detected and redacted
MCP integration - 7 MCP servers validated and tested
Performance - Topological DAG execution with intelligent fallback chains

Future Releases (v2.0+):

Advanced safety guards for complex testing infrastructure
SDK integration for token counting and real-time pricing
Extended provider support (Vertex AI, additional local models)
Plugin system and extensibility
Web UI for workflow visualization and management
Distributed execution across multiple machines

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.0

Mar 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hillstar_orchestrator-1.0.0.tar.gz (187.4 kB view details)

Uploaded Mar 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

hillstar_orchestrator-1.0.0-py3-none-any.whl (222.1 kB view details)

Uploaded Mar 2, 2026 Python 3

File details

Details for the file hillstar_orchestrator-1.0.0.tar.gz.

File metadata

Download URL: hillstar_orchestrator-1.0.0.tar.gz
Upload date: Mar 2, 2026
Size: 187.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.8

File hashes

Hashes for hillstar_orchestrator-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`9e5ce114be53ea772f2df0268172483e9905e9a76b03bd70320670c865137f21`
MD5	`7e81db5f291f7150d9516ac3c6b22fd5`
BLAKE2b-256	`6f189531a7df2bb8a52edb5b3851e85bceb3cc7370d12080ced9e1558e86b0c6`

See more details on using hashes here.

File details

Details for the file hillstar_orchestrator-1.0.0-py3-none-any.whl.

File metadata

Download URL: hillstar_orchestrator-1.0.0-py3-none-any.whl
Upload date: Mar 2, 2026
Size: 222.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.8

File hashes

Hashes for hillstar_orchestrator-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`47569eeae39ff2a40a59c26b33e221f94952289367743681492ea405a8029dc1`
MD5	`f76b4b479aa4dcc98c29a1bc9ce969f8`
BLAKE2b-256	`c06875deba059a40be716e1c25fc1f01aa98c5bce046e24a1031f2f91af2f064`

See more details on using hashes here.

hillstar-orchestrator 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Hillstar Orchestrator v1.0.0

A security and reproducibility-first workflow orchestration tool

Current Features (v1.0.0)

Quick Start

Installation & Setup

Run a Workflow

Governance & Development Mode

Option 1: Execute a workflow, then commit (recommended)

Option 2: Enable development mode for code-only work

Option 3: One-time overrides

Architecture

Graph Execution Engine

Workflow Structure

Model Integration

Model Selection & Presets

Workflow Schema

Values Statement

Development

Project Structure

Local Development

Troubleshooting

API Key Issues

Error: "ANTHROPIC_API_KEY not found"

Error: "No valid credentials found for provider X"

Model Issues

Error: "Unsupported parameter: 'temperature' not supported..."

Error: "Model not found" or "Model not accessible"

Workflow Validation

Error: "Invalid workflow schema"

Getting Help

Why Hillstar?

License

Citation

Status

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes