moonlight-ai

Lightweight AI Agents SDK for building intelligent automation systems

These details have not been verified by PyPI

Project links

Project description

MoonLight

Minimal async AI agent framework with zero bloat

Moonlight is a lightweight SDK for building AI agents with full control. It provides async stateful agents, multimodal input/output, structured responses via Pydantic/dataclass, and works with any OpenAI-compatible provider. No dependencies on OpenAI libraries, no hidden abstractions, no framework bloat.

Installation

uv pip install moonlight-ai

Quick Start

import asyncio
from moonlight import Provider, Agent, Content

# Configure provider
provider = Provider(
    source="openrouter",  # or "openai", "deepseek", custom URL
    api="your-api-key"
)

# Create agent
analysis_agent = Agent(
    provider=provider,
    model="qwen/qwen3-4b:free",
    system_role="You are a data analyst"
)

# Analyze some data
data = """
Q1 Sales: $125k, Q2 Sales: $157k, Q3 Sales: $198k, Q4 Sales: $223k
Top product: Widget A (45% revenue), Customer satisfaction: 4.2/5
"""

prompt = Content(f"Analyze this business data and provide key insights:\n{data}")

# Run async
response = asyncio.run(analysis_agent.run(prompt))
print(response.content)
# Output: The business shows strong growth momentum with 78% increase from Q1 to Q4...

Core Features

Structured Output

Return type-safe responses using Pydantic models or dataclasses:

from pydantic import BaseModel
from typing import List
from enum import Enum

class Sentiment(str, Enum):
    POSITIVE = "positive"
    NEGATIVE = "negative"
    NEUTRAL = "neutral"

class Entity(BaseModel):
    name: str
    type: str  # person, organization, location, etc.
    mentions: int

class Analysis(BaseModel):
    sentiment: Sentiment
    confidence: float
    key_topics: List[str]
    entities: List[Entity]
    summary: str

sentiment_agent = Agent(
    provider=provider,
    model="google/gemini-3-flash-preview",
    output_schema=Analysis  # Automatic JSON mode + validation
)

text = """
Apple Inc. announced record quarterly earnings today, with CEO Tim Cook 
praising the team's innovation. The iPhone 15 sales exceeded expectations 
in Asian markets, particularly China and India.
"""

result: Analysis = asyncio.run(sentiment_agent.run(Content(f"Analyze this text:\n{text}")))
print(result.sentiment)
# Output: Sentiment.POSITIVE

print(result.confidence)
# Output: 0.92

print(result.entities[0].name)
# Output: Apple Inc.

print(result.summary)
# Output: Apple reports strong earnings driven by iPhone 15 success in Asia

The SDK automatically:

Enables JSON mode on the provider
Injects schema into system prompt
Validates and parses response into your model
Handles nested structures and optional fields

Multimodal Input

Send images alongside text (URLs, local files, or base64):

response = asyncio.run(agent.run(
    Content(
        text="What's in these images?",
        images=[
            "https://example.com/image.jpg",  # URL
            "/path/to/local/image.png",        # Local file
            "data:image/jpeg;base64,..."       # Base64
        ]
    )
))
print(response.content)
# Output: The first image shows a sunset over mountains...

Images are automatically:

Downloaded from URLs (async)
Read from disk with proper MIME types
Converted to base64 data URIs
Validated and filtered

Conversation History

Agents maintain stateful conversation history:

agent = Agent(provider=provider, model="gpt-5.2")

# First turn
asyncio.run(agent.run(Content("My name is Alice")))

# Second turn (agent remembers context)
response = asyncio.run(agent.run(Content("What's my name?")))
print(response.content)
# Output: Your name is Alice

# Clear history
agent.clear()

# Update system role mid-conversation
agent.update_system_role("You are now a pirate")

Provider Support

Works with any OpenAI-compatible API:

# Built-in providers
Provider(source="openai", api="sk-...")
Provider(source="deepseek", api="sk-...")
Provider(source="openrouter", api="sk-...")
Provider(source="together", api="...")
Provider(source="groq", api="gsk-...")
Provider(source="google", api="...")

# Custom endpoints
Provider(source="http://localhost:11434/v1", api="ollama")
Provider(source="https://api.custom.com/v1", api="key")

Supported providers: OpenAI, DeepSeek, Together, Groq, Google AI, HuggingFace, OpenRouter, or any custom OpenAI-compatible endpoint.

Token Tracking

Agents track token usage automatically:

agent = Agent(provider=provider, model="gpt-4o")
asyncio.run(agent.run(Content("Hello")))

print(agent.get_total_tokens())
# Output: 156 (total tokens used)

Error Handling

Detailed error messages from providers:

response = asyncio.run(agent.run(Content("...")))

if response.error:
    print(f"Error: {response.error}")
    # Output: Error: Rate limited - too many requests
else:
    print(response.content)
    # Output: [normal response content]

Handles:

Invalid credentials (401)
Rate limits (429)
Content moderation (403)
Parameter validation errors (400)
Provider-specific raw error parsing

Architecture

moonlight/
└── src/
    ├── agent/
    │   ├── base.py             # Content dataclass
    │   ├── history.py          # AgentHistory (conversation + image processing)
    │   └── main.py             # Agent class
    ├── provider/
    │   ├── main.py             # Provider class
    │   └── completion.py       # GetCompletion (async API calls)
    └── helpers/
        └── model_converter.py  # Schema/model conversion utilities

Design Philosophy

Moonlight is intentionally minimal:

No framework lock-in: Standard Python async, bring your own orchestration
No hidden magic: Direct API calls, explicit control flow
No bloat: Zero dependencies on OpenAI SDK or heavy frameworks
Full control: Access raw responses, customize at any level
Provider agnostic: Works with any OpenAI-compatible API

What Moonlight Doesn't Do

To stay lightweight, Moonlight does not include:

Multi-agent orchestration (for now) (build your own with asyncio)
RAG systems or vector databases
Web scraping or search
Tool calling (planned)
Streaming responses
Built-in retry logic (planned)
Observability or logging (planned)

These are left to you or future extensions to keep the core minimal.

Advanced Configuration

agent = Agent(
    provider=provider,
    model="gpt-4o",
    system_role="You are an expert analyst",
    output_schema=MyModel,  # Optional structured output
    temperature=0.7,
    top_p=0.9,
    max_completion_tokens=2048,
    frequency_penalty=0.5,
    presence_penalty=0.5
)

# Access history
messages = agent.get_history()

# Token usage
tokens = agent.get_total_tokens()

Building From Source

# Clone repo
git clone https://github.com/ecstra/moonlight.git
cd moonlight

# Build distribution
pip install build twine
python -m build

# Install locally
pip install dist/moonlight_ai-0.2.0-py3-none-any.whl

# Test
python -c "from moonlight import Agent; print('OK')"

Roadmap

Retry logic for API calls and schema validation
Sequential and parallel agent execution engines
Tool calling support
Logging and Observability
MCP (Model Context Protocol) integration
RAG and WebSearch (if needed)

License

MIT License - use freely in personal and commercial projects.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.0

Jun 1, 2026

0.2.7

Jan 31, 2026

0.2.6

Jan 16, 2026

0.2.5

Jan 12, 2026

0.2.4

Jan 11, 2026

0.2.3

Jan 11, 2026

This version

0.2.2

Jan 10, 2026

0.2.1

Jan 10, 2026

0.1.10

Dec 24, 2025

0.1.9

May 31, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

moonlight_ai-0.2.2.tar.gz (24.1 kB view details)

Uploaded Jan 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

moonlight_ai-0.2.2-py3-none-any.whl (22.7 kB view details)

Uploaded Jan 10, 2026 Python 3

File details

Details for the file moonlight_ai-0.2.2.tar.gz.

File metadata

Download URL: moonlight_ai-0.2.2.tar.gz
Upload date: Jan 10, 2026
Size: 24.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for moonlight_ai-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`d7c30a09c27a88165e812a5ba928fad6ffcc0c36222194c328470b3acdec8328`
MD5	`15d9990e526ab00ec2a201dfa94b4b62`
BLAKE2b-256	`3fe55aa394a39db6c0d15af3586e54e73790a6c0cbe9224a63eaa48a0f9b5a9b`

See more details on using hashes here.

File details

Details for the file moonlight_ai-0.2.2-py3-none-any.whl.

File metadata

Download URL: moonlight_ai-0.2.2-py3-none-any.whl
Upload date: Jan 10, 2026
Size: 22.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for moonlight_ai-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`102613a519444973ac0453e6ca81890363fe926790899e44ad71816c52bc93e0`
MD5	`4616e14aa037603d7239989abc872582`
BLAKE2b-256	`8bb49fcd5a1370c5e5a5383610f41537c5fc80ee2a52ad90ebe8605d6db6caa4`

See more details on using hashes here.

moonlight-ai 0.2.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Installation

Quick Start

Core Features

Structured Output

Multimodal Input

Conversation History

Provider Support

Token Tracking

Error Handling

Architecture

Design Philosophy

What Moonlight Doesn't Do

Advanced Configuration

Building From Source

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes