Skip to main content

Free multi-provider AI SDK — drop-in OpenAI & Anthropic compatibility with 27 free providers

Project description

AI Synapse — Free Multi-Provider AI SDK

Tests Providers Python License: MIT OpenAI Compatible Docker

Stop paying for AI inference. Drop-in from ai_engine import OpenAI — routes through 27+ free providers with automatic failover and intelligent routing.


Free Providers

No API Key Required (Truly Free)

Provider Model Endpoint
Pollinations openai text.pollinations.ai/openai
Hermes Hermes-3-Llama hermes.ai.unturf.com/v1
G4F Groq llama-3.3-70b g4f.space/api/groq
G4F Gemini gemini-2.5-flash g4f.space/api/gemini
G4F NVIDIA nemotron-3 g4f.space/api/nvidia
OpenCode Zen north-mini-code opencode.ai/zen/v1

Free Tier APIs (Signup Required)

Provider Free Tier Signup
Groq 30 RPM, 14,400 RPD console.groq.com
OpenRouter 23 free models openrouter.ai
Gemini 5-30 RPM aistudio.google.com
NVIDIA 40 RPM build.nvidia.com
Cerebras 30 RPM cloud.cerebras.ai
Cloudflare 10K neurons/day dash.cloudflare.com
GitHub Varies github.com/marketplace
Vercel $5/month free vercel.com
Cohere 20 RPM, 1K/month cohere.com
Mistral 1 RPS, 500K tokens/min console.mistral.ai
HuggingFace $0.10/month huggingface.co
Kilo Auto free routing app.kilo.ai

Custom Providers (Your Keys)

Provider Models Signup
hcnsec Various api.hcnsec.cn
LLM7 Various llm7.io
PaxSenix Various api.paxsenix.org

Self-Hosted Options

Provider Setup Models
GPT4Free docker run -p 8080:8080 hlohaus789/g4f GPT-4o, Claude, Gemini
Ollama curl -fsSL https://ollama.com/install.sh | sh Llama, Mistral, etc.

Quick Start

Option 1: Free Tier APIs (Recommended)

# 1. Copy environment template
cp .env.example .env

# 2. Get free API keys (see guide below)
# 3. Add keys to .env file
# 4. Start server
python server.py

Option 2: Self-Hosted (No API Keys Needed)

# Start g4f server
docker run -d -p 8080:8080 hlohaus789/g4f

# Start AI Engine
python server.py

For Developers

OpenAI SDK Compatible

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:8000/v1",
    api_key="dummy"  # Not needed for free providers
)

# Chat completion
response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

# Streaming
stream = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Tell me a story"}],
    stream=True
)
for chunk in stream:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="")

cURL

curl http://localhost:8000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4", "messages": [{"role": "user", "content": "Hello!"}]}'

JavaScript

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: 'http://localhost:8000/v1',
    apiKey: 'dummy'
});

const response = await client.chat.completions.create({
    model: 'gpt-4',
    messages: [{ role: 'user', content: 'Hello!' }]
});

Features

Core

  • OpenAI-compatible API with streaming
  • Automatic provider failover
  • Intelligent key rotation
  • Response caching with TTL

Intelligence

  • Task-based model selection
  • Cost optimization
  • Latency tracking
  • A/B testing

Enterprise

  • Multi-tenancy with quotas
  • RBAC (Admin/User/Viewer)
  • Audit logging
  • Billing tracking

Platform

  • Plugin system
  • Workflow engine
  • CLI tool
  • Docker deployment

API Endpoints

Endpoint Method Description
/v1/chat/completions POST Chat completions (OpenAI-compatible, supports stream: true)
/v1/models GET/POST List all models
/api/providers GET List providers
/api/health/{name}/ping POST Live health ping for a provider
/api/capabilities GET Provider/model capabilities (vision, etc.)
/api/status GET Engine status
/api/statistics GET Usage statistics
/health GET Health check
/metrics GET Prometheus metrics
/docs GET Swagger UI (interactive API explorer)
/redoc GET ReDoc API documentation

Rate Limit Headers

All API responses include:

  • X-RateLimit-Limit — Max requests per minute
  • X-RateLimit-Remaining — Remaining requests in current window

Documentation


Contributing

See CONTRIBUTING.md for guidelines.


License

MIT License - See LICENSE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai_synapse-4.1.5.tar.gz (229.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ai_synapse-4.1.5-py3-none-any.whl (219.7 kB view details)

Uploaded Python 3

File details

Details for the file ai_synapse-4.1.5.tar.gz.

File metadata

  • Download URL: ai_synapse-4.1.5.tar.gz
  • Upload date:
  • Size: 229.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for ai_synapse-4.1.5.tar.gz
Algorithm Hash digest
SHA256 95dfb80e2ab08abbe3ba3cd06e7fbe724ef9ffc11a707eef5e9c7593a364e4d8
MD5 86794d4bb7131757b9630b3931eef138
BLAKE2b-256 643789c45ff48ae9dc98a022baf8c9507c0d22ff7590af91b4c7e9efc5335b50

See more details on using hashes here.

File details

Details for the file ai_synapse-4.1.5-py3-none-any.whl.

File metadata

  • Download URL: ai_synapse-4.1.5-py3-none-any.whl
  • Upload date:
  • Size: 219.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for ai_synapse-4.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 3606932f305a8c4ba8645817081090dfba7ae88aaf166f09c6146076f7b488f4
MD5 1f2d1e6b73ec9c2591ce67aa9593bdc6
BLAKE2b-256 50eac2ec0ab0b9622a61bc3a27853d5099bb77f5e51b0a9d09da10a57fb2e0d3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page