Branch AI reasoning into parallel thought paths

These details have not been verified by PyPI

Project links

Project description

🔀 Thought Fork

Branch your AI's reasoning like Git branches.

Thought Fork is a Python framework that introduces Attributed Convergence to AI reasoning. Instead of asking a single AI model to give you one generic answer, Thought Fork dynamically invents $N$ distinct reasoning perspectives (stances) tailored to your specific prompt, branches the AI into parallel reasoning streams, and then merges them back together into a single, synthesized, highly-structured answer.

The result? The AI explicitly credits which perspective contributed what insight, providing unprecedented transparency and depth.

(Scroll down for Arabic | انزل تحت للشرح بالعربي)

🚀 Why Thought Fork?

Standard LLM queries suffer from "averaging"—they give you the safest, most generic middle-ground answer. Thought Fork solves this by forcing the model to argue with itself from distinct angles before concluding.

Features:

🧠 Dynamic Stance Selection: The engine analyzes your prompt and automatically invents the perfect reasoning personas (e.g., a pragmatic-startup-builder, a long-term-code-quality-advocate, and a performance-focused-engineer).
⚡ Parallel Execution: Forks are resolved concurrently using asyncio.
🧬 Attributed Convergence (Synthesis): The final output weaves the parallel thoughts together, explicitly crediting the source of each idea and resolving contradictions.
📡 Server-Sent Events (SSE) Streaming: Fully built-in SSE support for real-time frontend streaming of parallel forks.

📦 Installation

pip install thought-fork

Requirements

Python 3.10+
openai (used as the universal async client)
pydantic

🔑 Configuration (Bring Your Own API)

Thought Fork is provider-agnostic. Because it relies on the universal AsyncOpenAI client, you can use OpenAI, Anthropic (via OpenRouter), Gemini, or Local Models (like Ollama).

By default, the library expects your API key in the environment variables:

export OPENROUTER_API_KEY="your-key-here"
# or
export OPENAI_API_KEY="your-key-here"

You can also pass the configuration explicitly in code using ForkConfig:

from thought_fork import ForkConfig
from openai import AsyncOpenAI

# 1. OpenAI Directly
config = ForkConfig(
    api_key="sk-proj-...",
    api_base_url="https://api.openai.com/v1",
    fork_model="gpt-4o-mini",
    synthesis_model="gpt-4o",
    stance_selector_model="gpt-4o-mini"
)

# 2. Google Gemini (via OpenAI compatibility)
config = ForkConfig(
    api_key="your-google-api-key",
    api_base_url="https://generativelanguage.googleapis.com/v1beta/openai",
    fork_model="gemini-2.0-flash",
    synthesis_model="gemini-2.0-pro-exp",
    stance_selector_model="gemini-2.0-flash"
)

# 3. Local Models (Ollama / vLLM)
# API key is automatically bypassed for localhost URLs
config = ForkConfig(
    api_base_url="http://localhost:11434/v1",
    fork_model="llama3",
    synthesis_model="llama3",
    stance_selector_model="llama3"
)

# 4. Anthropic / Any Provider via OpenRouter (Default)
config = ForkConfig(
    api_key="sk-or-v1-...",
    api_base_url="https://openrouter.ai/api/v1",
    fork_model="anthropic/claude-haiku-4.5",
    synthesis_model="anthropic/claude-sonnet-4-6",
    stance_selector_model="anthropic/claude-haiku-4.5"
)

# 5. Enterprise BYOC (Bring Your Own Client)
# If you have custom proxies, headers, or httpx settings:
my_client = AsyncOpenAI(api_key="...", default_headers={"X-Custom": "123"})
config = ForkConfig(client=my_client)

Production Resilience

Thought Fork v0.6+ is designed for production reliability:

Concurrency Limiting: max_concurrent_forks=5 protects against 429 rate limits.
Automatic Retries: max_retries=2 silently handles 502 Bad Gateway errors with exponential backoff.
Graceful Degradation: If a fork completely fails, synthesis continues with the remaining successful forks.

💻 Quickstart (Python SDK)

The easiest way to use Thought Fork is via the synthesize() helper function.

import asyncio
from thought_fork import synthesize, ForkConfig

async def main():
    config = ForkConfig(api_key="your-api-key")
    prompt = "Should I build a monolith or microservices for my new startup?"
    
    # This automatically invents 3 custom stances, runs them in parallel, 
    # and synthesizes the final answer.
    result = await synthesize(prompt, fork_count=3, config=config)
    
    print("--- Synthesis ---")
    print(result.synthesis)
    
    print("\n--- Individual Forks ---")
    for detail in result.fork_details:
        print(f"Fork {detail['id']} ({detail['stance']}): {detail['token_count']} tokens")
    
    print(f"\nTotal tokens: {result.token_usage['total']}")

if __name__ == "__main__":
    asyncio.run(main())

Advanced Usage (Manual Forks)

If you want to manually specify the stances instead of letting the AI invent them dynamically:

from thought_fork import synthesize

stances = ["cautious", "creative", "pragmatic"]
result = await synthesize(prompt, stances=stances)

Or you can use the ForkManager directly to build your own custom pipeline (see the examples/ directory in the repository).

🌐 The API Engine (FastAPI + SSE)

Thought Fork isn't just a Python library; it ships with a production-ready FastAPI streaming engine.

To start the API server locally:

uvicorn api.main:app --port 8000

The server exposes a POST /fork endpoint that streams Server-Sent Events (SSE).

Event Flow:

stances_selected — Emitted first with the AI's chosen personas for the prompt.
fork_start — Emitted when a specific reasoning path begins.
fork_chunk — Real-time tokens streamed from parallel forks (interleaved).
fork_done — Emitted when a fork finishes.
synthesis_chunk — Real-time tokens for the final convergence.
synthesis_done — Final event with token usage, timing metrics, and a session_id.

(See api/streaming.py for the full implementation.)

"A tool is only as good as the perspectives it considers."

🔀 Thought Fork (بالعربي)

فرّع تفكير الذكاء الاصطناعي حقك زي ما تفرّع فروع Git.

Thought Fork هو فريم وورك (إطار عمل) بالبايثون يقدم فكرة التجميع المنسوب (Attributed Convergence) للذكاء الاصطناعي. بدال ما تطلب من الذكاء الاصطناعي يعطيك إجابة وحدة عامة، النظام هذا يبتكر بشكل ديناميكي $N$ وجهات نظر مختلفة (زوايا تفكير) مفصلة على مقاس سؤالك بالضبط، ويقسم الذكاء الاصطناعي لمسارات تفكير متوازية (تشتغل مع بعض)، وبعدين يدمجها كلها في إجابة نهائية وحدة ومرتبة.

النتيجة؟ الذكاء الاصطناعي يوضح لك صراحةً أي زاوية تفكير جابت أي فكرة، وهالشي يعطيك شفافية وعمق مستحيل تشوفه بالوضع العادي.

🚀 ليه تستخدم Thought Fork؟

الأسئلة العادية للنماذج اللغوية (LLMs) تعاني من مشكلة "أخذ المتوسط"—يعني دايماً يعطونك الإجابة الأكثر أماناً واللي تناسب الكل. هنا يجي Thought Fork يحل هالمشكلة بانه يجبر النموذج إنه يتجادل مع نفسه من زوايا مختلفة ومتباينة قبل لا يعطيك الخلاصة.

المميزات:

🧠 اختيار ديناميكي لزوايا التفكير: المحرك يحلل سؤالك ومباشرة يبتكر شخصيات تفكير مناسبة (مثلاً: مبرمج-حريص-على-الجودة، مؤسس-شركة-عملي، مهندس-مهووس-بالأداء).
⚡ تشغيل متوازي: كل الفروع تشتغل بنفس الوقت باستخدام asyncio عشان ما نضيع وقتك.
🧬 تجميع منسوب (Synthesis): الإجابة النهائية تدمج كل الأفكار المتوازية مع بعض، وتعطي الكريدت (الفضل) لصاحب الفكرة، وتحل أي تناقضات بينهم.
📡 بث حي (SSE Streaming): دعم كامل للـ Server-Sent Events عشان تقدر تبث الأفكار المتوازية لايف على الواجهة الأمامية (Frontend).

📦 التثبيت

pip install thought-fork

المتطلبات

بايثون 3.10+
مكتبة openai (نستخدمها كعميل أساسي لكل الموديلات)
مكتبة pydantic

🔑 الإعدادات (جيب الـ API حقك)

إطار Thought Fork ما يهمه وش مزود الخدمة اللي تستخدمه. لأنه يعتمد على AsyncOpenAI، تقدر تستخدم OpenAI، أو Anthropic (عن طريق OpenRouter)، أو Gemini، أو حتى النماذج المحلية (زي Ollama).

افتراضياً، المكتبة تتوقع إن مفتاح الـ API حقك موجود في متغيرات البيئة (Environment variables):

export OPENROUTER_API_KEY="your-key-here"
# أو
export OPENAI_API_KEY="your-key-here"

وتقدر طبعاً تمرر الإعدادات مباشرة في الكود باستخدام ForkConfig:

from thought_fork import ForkConfig
from openai import AsyncOpenAI

# 1. استخدام OpenAI مباشرة
config = ForkConfig(
    api_key="sk-proj-...",
    api_base_url="https://api.openai.com/v1",
    fork_model="gpt-4o-mini",
    synthesis_model="gpt-4o",
    stance_selector_model="gpt-4o-mini"
)

# 2. استخدام Google Gemini
config = ForkConfig(
    api_key="your-google-api-key",
    api_base_url="https://generativelanguage.googleapis.com/v1beta/openai",
    fork_model="gemini-2.0-flash",
    synthesis_model="gemini-2.0-pro-exp",
    stance_selector_model="gemini-2.0-flash"
)

# 3. النماذج المحلية (Ollama / vLLM)
# ما يحتاج API key للروابط المحلية (localhost)
config = ForkConfig(
    api_base_url="http://localhost:11434/v1",
    fork_model="llama3",
    synthesis_model="llama3",
    stance_selector_model="llama3"
)

# 4. Anthropic أو أي موديل عبر OpenRouter (وهذا الافتراضي)
config = ForkConfig(
    api_key="sk-or-v1-...",
    api_base_url="https://openrouter.ai/api/v1",
    fork_model="anthropic/claude-haiku-4.5",
    synthesis_model="anthropic/claude-sonnet-4-6",
    stance_selector_model="anthropic/claude-haiku-4.5"
)

# 5. للشركات المتقدمة (استخدم عميلك الخاص)
# لو عندك بروكسي أو هيدرز مخصصة:
my_client = AsyncOpenAI(api_key="...", default_headers={"X-Custom": "123"})
config = ForkConfig(client=my_client)

الاعتمادية للإنتاج (Production Resilience)

نسخة Thought Fork v0.6+ مصممة عشان تكون قوية في بيئة الإنتاج:

تحديد التزامن (Concurrency Limiting): max_concurrent_forks=5 تحميك من حظر الـ 429 Too Many Requests.
إعادة المحاولة التلقائية: max_retries=2 تتعامل بصمت مع أخطاء 502 وترجع تحاول (Exponential backoff).
التعامل المرن مع الأخطاء: لو فشل فرع من الفروع تماماً، عملية التجميع (Synthesis) تكمل شغلها وتدمج الفروع الباقية اللي نجحت بدون ما يكرش النظام.

💻 البداية السريعة (Python SDK)

أسهل طريقة تستخدم فيها Thought Fork هي دالة synthesize().

import asyncio
from thought_fork import synthesize, ForkConfig

async def main():
    config = ForkConfig(api_key="your-api-key")
    prompt = "هل الأفضل أبني مشروعي كـ Monolith أو Microservices؟"
    
    # هالسطر بيبتكر 3 زوايا تفكير، ويشغلهم بنفس الوقت، ويعطيك الخلاصة.
    result = await synthesize(prompt, fork_count=3, config=config)
    
    print("--- الخلاصة (Synthesis) ---")
    print(result.synthesis)
    
    print("\n--- الفروع الفردية (Individual Forks) ---")
    for detail in result.fork_details:
        print(f"الفرع {detail['id']} ({detail['stance']}): استخدم {detail['token_count']} توكنز")
    
    print(f"\nإجمالي التوكنز: {result.token_usage['total']}")

if __name__ == "__main__":
    asyncio.run(main())

استخدام متقدم (حدد الزوايا بنفسك)

لو كنت تبي تحدد زوايا التفكير يدوياً بدال ما تخلي الذكاء الاصطناعي يبتكرها لك:

from thought_fork import synthesize

stances = ["cautious", "creative", "pragmatic"]
result = await synthesize(prompt, stances=stances)

أو تقدر تستخدم ForkManager مباشرة عشان تبني البايبلاين الخاص فيك (شيك على مجلد examples/ في الريبو).

🌐 محرك الـ API (استخدام FastAPI + SSE)

مشروع Thought Fork مو بس مكتبة بايثون؛ هو يجي مع محرك بث FastAPI جاهز للإنتاج.

عشان تشغل السيرفر المحلي:

uvicorn api.main:app --port 8000

السيرفر يوفر نقطة نهاية POST /fork تبث البيانات حية بنظام (SSE).

كيف يمشي التسلسل؟

stances_selected — تنرسل أول شيء وتعلمك وش الشخصيات اللي اختارها الذكاء الاصطناعي.
fork_start — تنرسل أول ما يبدأ مسار تفكير معين.
fork_chunk — نصوص حية (Tokens) تجي من الفروع المتوازية (مدموجة مع بعض).
fork_done — تنرسل لا خلص الفرع.
synthesis_chunk — نصوص حية لخلاصة الأفكار.
synthesis_done — آخر حدث يرسلك إحصائيات التوكنز والوقت والـ session_id.

(تقدر تشوف api/streaming.py عشان تفهم التفاصيل الكاملة.)

"قوة الأداة ما تتجاوز قوة الزوايا اللي تفكر منها."

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.6.1

Jul 3, 2026

0.6.0

Jul 3, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

thought_fork-0.6.1.tar.gz (36.6 kB view details)

Uploaded Jul 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

thought_fork-0.6.1-py3-none-any.whl (33.5 kB view details)

Uploaded Jul 3, 2026 Python 3

File details

Details for the file thought_fork-0.6.1.tar.gz.

File metadata

Download URL: thought_fork-0.6.1.tar.gz
Upload date: Jul 3, 2026
Size: 36.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for thought_fork-0.6.1.tar.gz
Algorithm	Hash digest
SHA256	`f133a0a81c1bb0cb549f56a589473b9c556b2f31edae550cd469ce3a9fb6cf42`
MD5	`44b8237772f25d8ac0a7eec37dc64a77`
BLAKE2b-256	`4e51b86fa580607772509b529d4f6af855867d73688d92fae8fd9f84ae33d527`

See more details on using hashes here.

File details

Details for the file thought_fork-0.6.1-py3-none-any.whl.

File metadata

Download URL: thought_fork-0.6.1-py3-none-any.whl
Upload date: Jul 3, 2026
Size: 33.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for thought_fork-0.6.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9912ca71428cb5da48f185d9798bb4d4940c19f2854d4e7590c51fcb54b35945`
MD5	`7ab2152a9385c58f6fd0d5f1501f2f64`
BLAKE2b-256	`4e1daaaba2dc750093ad3c71d29967cbd0ec7c19f32a9f57f8f6aea3b3056673`

See more details on using hashes here.

thought-fork 0.6.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🔀 Thought Fork

🚀 Why Thought Fork?

📦 Installation

Requirements

🔑 Configuration (Bring Your Own API)

Production Resilience

💻 Quickstart (Python SDK)

Advanced Usage (Manual Forks)

🌐 The API Engine (FastAPI + SSE)

🔀 Thought Fork (بالعربي)

🚀 ليه تستخدم Thought Fork؟

📦 التثبيت

المتطلبات

🔑 الإعدادات (جيب الـ API حقك)

الاعتمادية للإنتاج (Production Resilience)

💻 البداية السريعة (Python SDK)

استخدام متقدم (حدد الزوايا بنفسك)

🌐 محرك الـ API (استخدام FastAPI + SSE)

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes