Chuk Artifacts provides a production-ready, modular artifact storage system that works seamlessly across multiple storage backends (memory, filesystem, AWS S3, IBM Cloud Object Storage) with Redis or memory-based metadata caching and strict session-based security.

Project description

CHUK Artifacts

Session-scoped, grid-based artifact storage for AI apps and MCP servers

CHUK Artifacts provides a unified, async API for storing and retrieving files ("artifacts") across local development and production cloud environments—while enforcing session boundaries and issuing presigned upload/download URLs so clients interact with storage directly and securely.

Architecture at a Glance

Your app talks to ArtifactStore; it enforces session rules and issues presigned URLs. Clients upload/download directly to storage—no credentials exposed, no proxying large file streams.

                         (Your App / MCP Server)
                                     │
                                     │  ArtifactStore API (async)
                                     ▼
┌───────────────────────────────────────────────────────────────┐
│                        ArtifactStore                           │
│                                                               │
│  • Enforces session boundaries                                │
│  • Talks to storage providers                                 │
│  • Issues presigned upload/download URLs                       │
└───────────────┬───────────────────────────┬────────────────────┘
                │                           │
                │ session lookup            │ read/write files
                │                           │
                ▼                           ▼
        ┌────────────┐              ┌────────────────────────────┐
        │  Sessions  │              │        Storage             │
        │  (Redis)   │              │ (Memory / FS / S3 / COS)   │
        └─────┬──────┘              └───────┬─────────┬──────────┘
              │                              │         │
              │ authz                        │         │
              │                              │         │
              ▼                              ▼         ▼
        (session_id)                  grid/{sandbox}/{session}/{artifact}

Caption: The application calls ArtifactStore; the store consults the session provider for authz and talks to the configured storage backend. Clients use short-lived presigned URLs for direct uploads/downloads.

Why This Exists

Most platforms offer object storage (S3, COS, FS)—but not a security boundary.

CHUK Artifacts ensures every object belongs to a session and is accessed only through that session.

Highlights:

🔒 Session isolation - Every file belongs to a session, preventing data leaks
🏗️ Predictable grid paths - grid/{sandbox}/{session}/{artifact} for infinite scale
🔗 Presigned URLs - Secure direct upload/download without exposing credentials
🌐 Multiple backends - Memory, Filesystem, S3, IBM COS (same API)
⚡ Async-first - Built for FastAPI, MCP servers, and modern Python apps
🎯 Zero config - Works out of the box, configure only what you need

Install

pip install chuk-artifacts

or:

uv add chuk-artifacts

Quick Start

from chuk_artifacts import ArtifactStore

async with ArtifactStore() as store:
    # Store a file
    file_id = await store.store(
        data=b"Hello, world!",
        mime="text/plain",
        summary="greeting",
        filename="hello.txt",
        user_id="alice"
    )

    # Generate secure download URL (15 minutes)
    url = await store.presign_short(file_id)

    # Read file content
    text = await store.read_file(file_id, as_text=True)
    assert text == "Hello, world!"

    # Update the file
    await store.update_file(
        file_id,
        data=b"Hello, updated world!",
        summary="Updated greeting"
    )

That's it! No AWS credentials, no Redis setup, no configuration files. Perfect for development and testing.

Providers & Sessions

Feature	Memory	Filesystem	S3	IBM COS
Persistence	No	Yes	Yes	Yes
Horizontal scale	No	Limited	Yes	Yes
Presigned URLs	Yes*	Yes	Yes	Yes
Setup complexity	None	Minimal	Moderate	Moderate
Best use	Dev/Test	Small deploys	Production	Enterprise

* Memory URLs are virtual.

Quick config:

# Development (default)
# No configuration needed!

# Filesystem
export ARTIFACT_PROVIDER=filesystem
export ARTIFACT_FS_ROOT=./my-artifacts

# S3
export ARTIFACT_PROVIDER=s3
export AWS_ACCESS_KEY_ID=AKIA...
export AWS_SECRET_ACCESS_KEY=...
export ARTIFACT_BUCKET=my-bucket

# IBM COS
export ARTIFACT_PROVIDER=ibm_cos
export IBM_COS_ACCESS_KEY=...
export IBM_COS_SECRET_KEY=...
export IBM_COS_ENDPOINT=https://s3.us-south.cloud-object-storage.appdomain.cloud
export ARTIFACT_BUCKET=my-bucket

Core Concepts

Grid Architecture = Infinite Scale

Files are organized in a predictable, hierarchical grid structure:

grid/
├── {sandbox_id}/          # Application/environment isolation
│   ├── {session_id}/      # User/workflow grouping
│   │   ├── {artifact_id}  # Individual files
│   │   └── {artifact_id}
│   └── {session_id}/
│       ├── {artifact_id}
│       └── {artifact_id}
└── {sandbox_id}/
    └── ...

Why Grid Architecture?

🔒 Security: Natural isolation between applications and users
📈 Scalability: Supports billions of files across thousands of sessions
🌐 Federation: Easily distribute across multiple storage regions
🛠️ Operations: Predictable paths for backup, monitoring, and cleanup
🔍 Debugging: Clear hierarchical organization for troubleshooting

# Grid paths are generated automatically
file_id = await store.store(data, mime="text/plain", summary="Test")

# Inspect the grid path
metadata = await store.metadata(file_id)
print(metadata.key)  # grid/my-app/session-abc123/artifact-def456

# Parse any grid path
parsed = store.parse_grid_key(metadata.key)
print(f"Sandbox: {parsed.sandbox_id}")
print(f"Session: {parsed.session_id}")
print(f"Artifact: {parsed.artifact_id}")

Sessions = Security Boundaries

Every file belongs to a session. Sessions prevent users from accessing each other's files:

# Files are isolated by session
alice_file = await store.store(
    data=b"Alice's private data",
    mime="text/plain",
    summary="Private file",
    user_id="alice"  # Auto-creates session for Alice
)

bob_file = await store.store(
    data=b"Bob's private data",
    mime="text/plain",
    summary="Private file",
    user_id="bob"  # Auto-creates session for Bob
)

# Cross-session operations are blocked for security
try:
    await store.copy_file(alice_file, target_session_id="bob_session")
except ArtifactStoreError:
    print("🔒 Cross-session access denied!")  # Security enforced

Common Recipes

Upload with Presigned URL

For large files, let clients upload directly to storage:

# Generate presigned upload URL
url, temp_id = await store.presign_upload(
    session_id="alice",
    filename="photo.jpg",
    mime_type="image/jpeg",
    expires=1800  # 30 minutes
)

# Client uploads to URL (HTTP PUT)
# No server proxying needed!

# Register the uploaded file
await store.register_uploaded_artifact(
    temp_id,
    mime="image/jpeg",
    summary="Profile pic",
    filename="photo.jpg"
)

Batch Store

Upload multiple files in one operation:

files = [
    {
        "data": image1_bytes,
        "mime": "image/jpeg",
        "filename": f"products/img-{i}.jpg",
        "summary": f"Product image {i}",
        "meta": {"product_id": "LPT-001"}
    }
    for i in range(10)
]

file_ids = await store.store_batch(files, session_id="catalog")
print(f"Uploaded {len([id for id in file_ids if id])} images")

Directory-Like Operations

# List files in a session
files = await store.list_by_session("session-123")
for f in files:
    print(f"{f.filename}: {f.bytes} bytes")

# Get directory contents
docs = await store.get_directory_contents("session-123", "docs/")
images = await store.get_directory_contents("session-123", "images/")

# Copy within same session (security enforced)
backup_id = await store.copy_file(
    doc_id,
    new_filename="docs/README_backup.md"
)

Web Framework Integration

from fastapi import FastAPI, UploadFile
from chuk_artifacts import ArtifactStore

app = FastAPI()
store = ArtifactStore(storage_provider="s3", session_provider="redis")

@app.post("/upload")
async def handle_upload(file: UploadFile, user_id: str):
    content = await file.read()

    file_id = await store.store(
        data=content,
        mime=file.content_type,
        summary=f"Uploaded: {file.filename}",
        filename=file.filename,
        user_id=user_id
    )

    # Generate download URL
    url = await store.presign_medium(file_id)
    return {"file_id": file_id, "download_url": url}

@app.get("/files/{user_id}")
async def list_files(user_id: str):
    session_id = f"user_{user_id}"
    files = await store.list_by_session(session_id)
    return [
        {
            "id": f.artifact_id,
            "name": f.filename,
            "size": f.bytes,
            "created": f.stored_at
        }
        for f in files
    ]

MCP Server Integration

from mcp import Server
from chuk_artifacts import ArtifactStore
import base64

server = Server("artifacts-mcp")
store = ArtifactStore()

@server.tool("upload_file")
async def upload_file(data_b64: str, filename: str, session_id: str):
    """MCP tool for file uploads"""
    data = base64.b64decode(data_b64)

    file_id = await store.store(
        data=data,
        mime="application/octet-stream",
        summary=f"Uploaded: {filename}",
        filename=filename,
        session_id=session_id
    )

    url = await store.presign_medium(file_id)
    return {
        "file_id": file_id,
        "filename": filename,
        "size": len(data),
        "download_url": url
    }

@server.tool("list_files")
async def list_files(session_id: str):
    """List files in session"""
    files = await store.list_by_session(session_id)
    return {
        "files": [
            {
                "id": f.artifact_id,
                "name": f.filename,
                "size": f.bytes,
                "type": f.mime
            }
            for f in files
        ]
    }

Configuration

Development (Zero Config)

from chuk_artifacts import ArtifactStore

# Just works!
store = ArtifactStore()

Filesystem (Local Persistence)

from chuk_artifacts.config import configure_filesystem

configure_filesystem(root="./my-artifacts")
store = ArtifactStore()

S3 (Production)

from chuk_artifacts.config import configure_s3

configure_s3(
    access_key="AKIA...",
    secret_key="...",
    bucket="production-artifacts",
    region="us-east-1"
)
store = ArtifactStore()

Docker Compose

version: '3.8'
services:
  app:
    image: myapp
    environment:
      ARTIFACT_PROVIDER: s3
      AWS_ACCESS_KEY_ID: ${AWS_ACCESS_KEY_ID}
      AWS_SECRET_ACCESS_KEY: ${AWS_SECRET_ACCESS_KEY}
      ARTIFACT_BUCKET: myapp-artifacts
      SESSION_PROVIDER: redis
      SESSION_REDIS_URL: redis://redis:6379/0
    depends_on:
      - redis

  redis:
    image: redis:7-alpine
    volumes:
      - redis_data:/data

volumes:
  redis_data:

Advanced Features

Presigned URLs

# Different durations
url = await store.presign(file_id, expires=3600)  # Custom
short = await store.presign_short(file_id)        # 15 min
medium = await store.presign_medium(file_id)      # 1 hour
long = await store.presign_long(file_id)          # 24 hours

Rich Metadata

file_id = await store.store(
    data=image_bytes,
    mime="image/jpeg",
    summary="Product photo",
    filename="products/laptop-pro.jpg",
    user_id="marketing",
    meta={
        "product_id": "LPT-001",
        "tags": ["laptop", "professional"],
        "dimensions": {"width": 1920, "height": 1080}
    }
)

# Update metadata without changing content
await store.update_metadata(
    file_id,
    summary="Updated product photo",
    meta={"tags": ["laptop", "professional", "workspace"]},
    merge=True
)

File Operations

# Update file content
await store.update_file(file_id, data=new_content)

# Copy (same session only)
copy_id = await store.copy_file(file_id, new_filename="backup.txt")

# Move/rename
moved = await store.move_file(file_id, new_filename="renamed.txt")

# Check existence
if await store.exists(file_id):
    print("File exists!")

# Delete
deleted = await store.delete(file_id)

Monitoring

# Validate configuration
status = await store.validate_configuration()
print(f"Storage: {status['storage']['status']}")
print(f"Sessions: {status['session']['status']}")

# Get statistics
stats = await store.get_stats()
print(f"Provider: {stats['storage_provider']}")
print(f"Bucket: {stats['bucket']}")

# Cleanup expired sessions
cleaned = await store.cleanup_expired_sessions()

Error Handling

from chuk_artifacts import (
    ArtifactStoreError,
    ArtifactNotFoundError,
    ArtifactExpiredError,
    ProviderError
)

try:
    data = await store.retrieve(file_id)
except ArtifactNotFoundError:
    print("File not found")
except ArtifactExpiredError:
    print("File has expired")
except ProviderError as e:
    print(f"Storage error: {e}")
except ArtifactStoreError as e:
    print(f"Access denied: {e}")

Security Best Practices

Session Isolation

# ✅ Good: Each user gets their own session
user_session = f"user_{user.id}"
await store.store(data, mime="text/plain", session_id=user_session)

# ✅ Good: Organization-level isolation
org_session = f"org_{org.id}"

# ❌ Bad: Shared sessions across users
shared_session = "global"  # All users can see each other's files!

Access Control

async def secure_download(file_id: str, user_id: str):
    """Verify ownership before serving"""
    metadata = await store.metadata(file_id)
    expected_session = f"user_{user_id}"

    if metadata.session_id != expected_session:
        raise HTTPException(403, "Access denied")

    return await store.presign(file_id)

Secure Configuration

# ✅ Good: Environment variables
store = ArtifactStore(
    storage_provider=os.getenv("ARTIFACT_PROVIDER", "memory")
)

# ✅ Good: IAM roles (AWS/IBM)
# No credentials needed!

# ❌ Bad: Hardcoded credentials
store = ArtifactStore(
    access_key="AKIA123...",  # Never do this!
)

Performance

Typical benchmarks with S3 + Redis:

✅ File Storage:     3,083 files/sec
✅ File Retrieval:   4,693 reads/sec
✅ File Updates:     2,156 updates/sec
✅ Batch Operations: 1,811 batch items/sec
✅ Session Listing:  ~2ms for 20+ files
✅ Metadata Access:  <1ms with Redis

Performance tips:

Use batch operations for multiple files
Reuse store instances (connection pooling)
Use presigned URLs for large files
Choose appropriate TTL values

Testing

Run Smoke Tests

# Comprehensive test suite
python examples/smoke_run.py

# Expected: 32/33 tests passing (97%)

Integration Demo

python examples/artifact_grid_demo.py
python examples/grid_demo.py
python examples/usage_examples_demo.py

Unit Tests

import asyncio
from chuk_artifacts import ArtifactStore

async def test_basic_operations():
    async with ArtifactStore() as store:
        # Store
        file_id = await store.store(
            data=b"test",
            mime="text/plain",
            summary="Test"
        )

        # Verify
        assert await store.exists(file_id)
        content = await store.retrieve(file_id)
        assert content == b"test"

        # Metadata
        meta = await store.metadata(file_id)
        assert meta.bytes == 4

        print("✅ Tests passed!")

asyncio.run(test_basic_operations())

FAQ

Q: Do I need Redis for development?

A: No! Memory providers work great for development. Only use Redis for production when you need persistence or multi-instance deployment.

Q: Can I switch storage providers without code changes?

A: Yes! Just change the ARTIFACT_PROVIDER environment variable. The API is identical across all providers.

Q: How do I handle large files?

A: Use presigned upload URLs for client-side uploads:

url, artifact_id = await store.presign_upload(
    session_id="user",
    filename="video.mp4",
    mime_type="video/mp4",
    expires=1800  # 30 min
)
# Client uploads directly to URL

Q: What happens when files expire?

A: Files and metadata are automatically cleaned up based on TTL:

# Set TTL when storing
await store.store(data, mime="text/plain", ttl=3600)  # 1 hour

# Manual cleanup
expired = await store.cleanup_expired_sessions()

Q: Is it production ready?

A: Yes! Features for production:

High performance (3,000+ ops/sec)
Multiple storage backends (S3, IBM COS)
Session-based security
Redis support for distributed deployments
Comprehensive error handling
Health checks and monitoring
Docker/K8s ready

Roadmap

GCS backend - Google Cloud Storage support
Azure Blob Storage - Microsoft Azure support
Checksums - SHA-256 validation on all operations
Client-side encryption - Optional end-to-end encryption
Audit logging - Detailed access logs for compliance
Lifecycle policies - Automated archival and deletion rules
CDN integration - CloudFront/Cloudflare integration
Multi-region - Automatic replication across regions

Next Steps

Install: pip install chuk-artifacts
Try it: Copy the Quick Start example
Development: Use default memory providers
Production: Configure S3 + Redis
Integration: Add to your FastAPI/MCP server

Ready to build with enterprise-grade file storage? 🚀

License

MIT License - see LICENSE file for details.

Project details

Release history Release notifications | RSS feed

0.11.3

Apr 23, 2026

0.11.1

Feb 18, 2026

0.11

Feb 18, 2026

0.10.1

Jan 7, 2026

0.10

Dec 15, 2025

0.9.1

Nov 28, 2025

0.9

Nov 26, 2025

0.8.1

Nov 6, 2025

0.8

Nov 5, 2025

0.7

Nov 5, 2025

0.6

Nov 5, 2025

This version

0.5

Nov 5, 2025

0.4.1

Jun 23, 2025

0.4

Jun 20, 2025

0.3

Jun 13, 2025

0.2.2

Jun 4, 2025

0.2.1

Jun 2, 2025

0.2

Jun 2, 2025

0.1.7

Jun 2, 2025

0.1.6

Jun 2, 2025

0.1.5

Jun 1, 2025

0.1.4

Jun 1, 2025

0.1.3

Jun 1, 2025

0.1.2

May 31, 2025

0.1.1

May 31, 2025

0.1.0

May 31, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chuk_artifacts-0.5.tar.gz (99.8 kB view details)

Uploaded Nov 5, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

chuk_artifacts-0.5-py3-none-any.whl (43.7 kB view details)

Uploaded Nov 5, 2025 Python 3

File details

Details for the file chuk_artifacts-0.5.tar.gz.

File metadata

Download URL: chuk_artifacts-0.5.tar.gz
Upload date: Nov 5, 2025
Size: 99.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for chuk_artifacts-0.5.tar.gz
Algorithm	Hash digest
SHA256	`eacd1b78692795111369912ad19c8a58e75d4ded34acb964a5a6675fdb3b783f`
MD5	`2108fa11960038a41161d3d9be7f1b17`
BLAKE2b-256	`fb30075627e6400735b70d6e5687cc7adb26a225eded1d1df7d3187e0dcfdc0c`

See more details on using hashes here.

File details

Details for the file chuk_artifacts-0.5-py3-none-any.whl.

File metadata

Download URL: chuk_artifacts-0.5-py3-none-any.whl
Upload date: Nov 5, 2025
Size: 43.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for chuk_artifacts-0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`da4c9223f84c0fa73936c8474f5bd674e6d1b12f7659c6b7f00ca64194f79c41`
MD5	`18b304e56ed2cdc9f04fc290e1b7f215`
BLAKE2b-256	`afa7b067df31c15dada7d016b0dd81d4cc27c31c3e7624efeea964c1c782282a`

See more details on using hashes here.

chuk-artifacts 0.5

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

CHUK Artifacts

Architecture at a Glance

Why This Exists

Install

Quick Start

Providers & Sessions

Core Concepts

Grid Architecture = Infinite Scale

Sessions = Security Boundaries

Common Recipes

Upload with Presigned URL

Batch Store

Directory-Like Operations

Web Framework Integration

MCP Server Integration

Configuration

Development (Zero Config)

Filesystem (Local Persistence)

S3 (Production)

Docker Compose

Advanced Features

Presigned URLs

Rich Metadata

File Operations

Monitoring

Error Handling

Security Best Practices

Session Isolation

Access Control

Secure Configuration

Performance

Testing

Run Smoke Tests

Integration Demo

Unit Tests

FAQ

Q: Do I need Redis for development?

Q: Can I switch storage providers without code changes?

Q: How do I handle large files?

Q: What happens when files expire?

Q: Is it production ready?

Roadmap

Next Steps

Links

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes