Python SDK for creating verse-based content sites with AI translations, multimedia (images, audio), semantic search, RAG-grounded Puranic context, and deployment

These details have not been verified by PyPI

Project links

Homepage

Project description

Sanatan Verse SDK - Python SDK for Spiritual Verse Collections

Complete toolkit for generating rich multimedia content for spiritual text collections (Hanuman Chalisa, Sundar Kaand, etc.)

Features

🔄 Complete Workflow: Generate media and embeddings from canonical sources - all in one command
📖 Canonical Sources: Local YAML files ensure text accuracy and quality
🎨 AI Images: Generate themed images with DALL-E 3
🎵 Audio Pronunciation: Full and slow-speed audio with ElevenLabs
🔍 Semantic Search: Vector embeddings for intelligent verse discovery
📚 Multi-Collection: Organized support for multiple verse collections
🎨 Theme System: Customizable visual styles (modern, traditional, kids-friendly, etc.)

Quick Start

Start here: End-to-End Workflow

New Project Setup (Recommended)

# 1. Install
pip install sanatan-verse-sdk

# 2. Create project with collection templates
verse-init --project-name my-verse-project --collection hanuman-chalisa
cd my-verse-project

# 3. Configure API keys
cp .env.example .env
# Edit .env and add your API keys from:
# - OpenAI: https://platform.openai.com/api-keys
# - ElevenLabs: https://elevenlabs.io/app/settings/api-keys

# 4. Add canonical Devanagari text
# Edit data/verses/hanuman-chalisa.yaml with actual verse text

# 5. Validate setup
verse-validate

# 6. Generate multimedia content
verse-generate --collection hanuman-chalisa --verse 1

What you get: Verse file, AI-generated image, audio (full + slow speed), and search embeddings!

Existing Project

# Validate and fix structure
verse-validate --fix

# Generate content
verse-generate --collection hanuman-chalisa --verse 15

# Check status
verse-status --collection hanuman-chalisa

Advanced Usage

# Multiple collections at once
verse-init --collection hanuman-chalisa --collection sundar-kaand

# Custom number of sample verses
verse-init --collection my-collection --num-verses 10

# Generate specific components only
verse-generate --collection sundar-kaand --verse 3 --image
verse-generate --collection sundar-kaand --verse 3 --audio

# Skip embeddings update (faster)
verse-generate --collection hanuman-chalisa --verse 15 --no-update-embeddings

What Gets Generated

Each verse generation creates:

🎨 Image: images/{collection}/{theme}/verse-01.png (DALL-E 3)
🎵 Audio (full): audio/{collection}/verse-01-full.mp3 (ElevenLabs)
🎵 Audio (slow): audio/{collection}/verse-01-slow.mp3 (0.75x speed)
🔍 Embeddings: data/embeddings/collections/{collection}.json + data/embeddings/collections/index.json (for semantic search)

Text Source: Canonical Devanagari text from data/verses/{collection}.yaml (Local Verses Guide)

Migration Note: Legacy combined embeddings (data/embeddings.json) are no longer written by default. Use verse-embeddings --legacy-output if you still need the combined file.

Puranic Context Generation

Enrich verse pages with grounded story references from indexed sacred texts. Two-stage workflow:

Stage 1 — Index a Source Text

verse-index-sources --file data/sources/ananda-ramayana.txt

This command:

Splits the source text into ~4000-char chunks
Parses each chunk into discrete named episodes (keywords, type, summary in English + Hindi)
Generates embeddings for each episode
Writes outputs:
- data/puranic-index/{key}.yml — human-readable episode index with _meta section
- data/embeddings/puranic/{key}.json — embedding vectors for RAG retrieval
- data/puranic-references.yml — registry of indexed sources

Only needs to run once per source, or when the source file changes.

# Use Bedrock Cohere for better Sanskrit/Hindi accuracy
verse-index-sources --file data/sources/shiv-puran.txt --provider bedrock-cohere

# If Bedrock input exceeds limits, use truncation policy
verse-embeddings --provider bedrock-cohere --truncate-policy chunk

# Larger chunk size for dense Puranic prose
verse-index-sources --file data/sources/valmiki-ramayana.pdf --chunk-size 6000

Stage 2 — Generate Puranic Context per Verse

verse-puranic-context --collection hanuman-chalisa --all

For each verse this command:

Embeds the verse text using the same provider as the indexed source
Runs cosine similarity search across all indexed sources to find the most relevant episodes
Filters to episodes involving the collection's subject (configured in _data/collections.yml)
Passes top episodes + verse text to GPT-4o with citation constraints
Post-validates each entry: drops entries where the subject is not an active participant
Writes puranic_context: block into the verse's .md frontmatter

# Skip verses that already have context (default)
verse-puranic-context --collection hanuman-chalisa --all

# Regenerate all existing entries
verse-puranic-context --collection hanuman-chalisa --all --regenerate

# Single verse
verse-puranic-context --collection hanuman-chalisa --verse chaupai-06

Collection Subject Configuration

The subject filter is resolved via a two-level hierarchy — no CLI flag needed:

Option A — Project-level default (single-subject projects): set once in _data/verse-config.yml, applies to all collections:

# _data/verse-config.yml
defaults:
  subject: Hanuman
  subject_type: deity

Option B — Collection-level override: set per collection in _data/collections.yml (takes priority over project default):

# _data/collections.yml
hanuman-chalisa:
  subject: Hanuman      # overrides or supplements project default
  subject_type: deity

krishna-bhajans:
  subject: Krishna      # different subject for this collection
  subject_type: deity

Resolution order: collection-level → project default → error if neither is set and indexed sources exist.

Multiple Sources

Multiple indexed sources are automatically combined in RAG retrieval:

_data/verse-config.yml         ← set defaults.subject here

data/sources/
  shiv-puran-part1.txt
  ananda-ramayana.txt        ← add new sources here

data/puranic-index/
  shiv-puran-part1.yml       ← auto-generated episode index
  ananda-ramayana.yml

data/embeddings/
  puranic/
    shiv-puran-part1.json      ← auto-generated embedding vectors
    ananda-ramayana.json

See verse-index-sources and verse-puranic-context for full documentation.

Migration Note: Puranic embeddings now live under data/embeddings/puranic/. If you have legacy files in data/embeddings/{source}.json, move them or re-run verse-index-sources to regenerate.

Installation

pip install sanatan-verse-sdk

Commands

Project Setup

verse-init - Initialize new project with recommended structure
verse-validate - Validate project structure and configuration

Content Generation

verse-parse-source - Parse canonical source text into YAML
verse-generate - Complete orchestrator for verse content (text fetching, multimedia generation, embeddings)
verse-translate - Translate verses into multiple languages (Hindi, Spanish, French, etc.)
verse-images - Generate images using DALL-E 3
verse-audio - Generate audio pronunciations using ElevenLabs
verse-embeddings - Generate vector embeddings for semantic search (multi-collection guide)

Puranic Context

verse-index-sources - Index Puranic source texts (PDFs, TXTs) into episodes and embeddings for RAG retrieval
verse-puranic-context - Generate Puranic context boxes for verses (RAG-grounded or GPT-4o free recall)

Project Management

verse-add - Add new verse entries to collections (supports multi-chapter formats)
verse-status - Check status, completion, and validate text against canonical source
verse-sync - Sync verse text with canonical source (fix mismatches)
verse-deploy - Deploy Cloudflare Worker for API proxy

Embeddings Config

embeddings.yml - Shared defaults and precedence (CLI > config > env > defaults)

Configuration

Copy the example environment file and add your API keys:

cp .env.example .env
# Edit .env and add your API keys

See the End-to-End Workflow for the full lifecycle, and the Usage Guide for advanced workflows and best practices.

Documentation

End-to-End Workflow - Initialize, generate, index, and deploy (full lifecycle)
Usage Guide - Advanced workflows, batch processing, and best practices
Local Verses Guide - Using local YAML files for verse text
Chapter-Based Formats - Multi-chapter collections (Bhagavad Gita, etc.)
Command Reference - Detailed documentation for all commands
Development Guide - Setup and contributing to verse-sdk
Troubleshooting - Common issues and solutions
Multi-Collection Guide - Working with multiple collections
Publishing Guide - For maintainers

Example Project

Hanuman GPT - Multi-collection project with Hanuman Chalisa, Sundar Kaand, and Sankat Mochan Hanumanashtak

Requirements

Python 3.8+
OpenAI API key (for text/images/embeddings)
ElevenLabs API key (for audio)

License

MIT License - See LICENSE file for details

Support

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.102.0

Mar 18, 2026

0.101.0

Mar 18, 2026

0.100.0

Mar 18, 2026

0.99.0

Mar 18, 2026

0.98.0

Mar 18, 2026

0.97.0

Mar 18, 2026

0.96.0

Mar 18, 2026

0.95.0

Mar 18, 2026

0.94.0

Mar 18, 2026

0.93.0

Mar 17, 2026

0.92.0

Mar 17, 2026

0.91.0

Mar 17, 2026

0.90.0

Mar 17, 2026

0.89.0

Mar 17, 2026

0.88.0

Mar 17, 2026

0.87.0

Mar 17, 2026

0.85.0

Mar 5, 2026

0.84.0

Mar 5, 2026

0.83.0

Mar 5, 2026

0.82.0

Mar 5, 2026

0.81.0

Mar 4, 2026

0.80.0

Mar 4, 2026

0.79.0

Mar 4, 2026

0.78.0

Mar 4, 2026

0.77.0

Mar 4, 2026

0.76.0

Mar 4, 2026

0.75.0

Mar 4, 2026

0.74.0

Mar 4, 2026

0.73.0

Mar 4, 2026

0.72.0

Mar 4, 2026

0.71.0

Mar 4, 2026

0.70.0

Mar 4, 2026

0.69.0

Mar 4, 2026

0.68.0

Mar 4, 2026

0.67.0

Mar 4, 2026

0.66.0

Mar 4, 2026

0.65.0

Mar 4, 2026

0.64.0

Mar 4, 2026

0.63.0

Mar 4, 2026

0.62.0

Mar 4, 2026

0.61.0

Mar 4, 2026

0.60.0

Mar 4, 2026

0.59.0

Mar 4, 2026

0.58.0

Mar 2, 2026

0.57.0

Mar 1, 2026

0.56.0

Mar 1, 2026

0.55.0

Mar 1, 2026

0.54.0

Mar 1, 2026

0.53.0

Mar 1, 2026

This version

0.51.0

Mar 1, 2026

0.50.0

Mar 1, 2026

0.49.0

Feb 28, 2026

0.48.0

Feb 28, 2026

0.47.0

Feb 28, 2026

0.46.0

Feb 28, 2026

0.45.0

Feb 28, 2026

0.44.0

Feb 28, 2026

0.43.0

Feb 28, 2026

0.42.0

Feb 28, 2026

0.41.0

Feb 28, 2026

0.40.0

Feb 28, 2026

0.39.0

Feb 27, 2026

0.38.0

Feb 27, 2026

0.35.0

Feb 21, 2026

0.34.0

Feb 21, 2026

0.33.0

Feb 21, 2026

0.32.0

Feb 21, 2026

0.31.3

Feb 21, 2026

0.31.2

Feb 21, 2026

0.31.1

Feb 21, 2026

0.31.0

Feb 21, 2026

0.30.9

Feb 21, 2026

0.30.8

Feb 21, 2026

0.30.7

Feb 21, 2026

0.30.6

Feb 21, 2026

0.30.5

Feb 21, 2026

0.30.4

Feb 21, 2026

0.30.3

Feb 21, 2026

0.30.2

Feb 21, 2026

0.30.1

Feb 21, 2026

0.30.0

Feb 21, 2026

0.29.0

Feb 21, 2026

0.28.1

Feb 21, 2026

0.28.0

Feb 20, 2026

0.27.3

Feb 20, 2026

0.27.2

Feb 20, 2026

0.27.1

Feb 18, 2026

0.27.0

Feb 17, 2026

0.26.3

Feb 17, 2026

0.26.2

Feb 17, 2026

0.26.1

Feb 17, 2026

0.26.0

Feb 17, 2026

0.25.6

Feb 17, 2026

0.25.5

Feb 17, 2026

0.25.4

Feb 17, 2026

0.25.3

Feb 17, 2026

0.25.2

Feb 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sanatan_verse_sdk-0.51.0.tar.gz (133.4 kB view details)

Uploaded Mar 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sanatan_verse_sdk-0.51.0-py3-none-any.whl (152.1 kB view details)

Uploaded Mar 1, 2026 Python 3

File details

Details for the file sanatan_verse_sdk-0.51.0.tar.gz.

File metadata

Download URL: sanatan_verse_sdk-0.51.0.tar.gz
Upload date: Mar 1, 2026
Size: 133.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for sanatan_verse_sdk-0.51.0.tar.gz
Algorithm	Hash digest
SHA256	`c1e39391993a23f007be78af3cc790cfc3d771a24df50b1e3bfbd5d7dbc98905`
MD5	`561af9a04e3ea46af3e4da9a8de4d329`
BLAKE2b-256	`13e6976df5b68e970fc40df4825fd218d031a685537c1a4767994e405e82fb00`

See more details on using hashes here.

File details

Details for the file sanatan_verse_sdk-0.51.0-py3-none-any.whl.

File metadata

Download URL: sanatan_verse_sdk-0.51.0-py3-none-any.whl
Upload date: Mar 1, 2026
Size: 152.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for sanatan_verse_sdk-0.51.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f8d8ae78dfe787c6f284c2da108993676416adde5cd7192d2194af97dcdbf244`
MD5	`03e656dc39d3a058e26989ab5459106d`
BLAKE2b-256	`8afbfc5762718ff897d95fd5c3f5fe5755844102945d3142b002a30723791d0a`

See more details on using hashes here.

sanatan-verse-sdk 0.51.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Sanatan Verse SDK - Python SDK for Spiritual Verse Collections

Features

Quick Start

New Project Setup (Recommended)

Existing Project

Advanced Usage

What Gets Generated

Puranic Context Generation

Stage 1 — Index a Source Text

Stage 2 — Generate Puranic Context per Verse

Collection Subject Configuration

Multiple Sources

Installation

Commands

Project Setup

Content Generation

Puranic Context

Project Management

Embeddings Config

Configuration

Documentation

Example Project

Requirements

License

Support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes