Skip to main content

Free multi-provider AI SDK — drop-in OpenAI & Anthropic compatibility with 27 free providers

Project description

AI Synapse — Free Multi-Provider AI SDK

Tests Providers Python License: MIT OpenAI Compatible Docker

Stop paying for AI inference. Drop-in from ai_engine import OpenAI — routes through 27+ free providers with automatic failover and intelligent routing.


Free Providers

No API Key Required (Truly Free)

Provider Model Endpoint
Pollinations openai text.pollinations.ai/openai
Hermes Hermes-3-Llama hermes.ai.unturf.com/v1
G4F Groq llama-3.3-70b g4f.space/api/groq
G4F Gemini gemini-2.5-flash g4f.space/api/gemini
G4F NVIDIA nemotron-3 g4f.space/api/nvidia
OpenCode Zen north-mini-code opencode.ai/zen/v1

Free Tier APIs (Signup Required)

Provider Free Tier Signup
Groq 30 RPM, 14,400 RPD console.groq.com
OpenRouter 23 free models openrouter.ai
Gemini 5-30 RPM aistudio.google.com
NVIDIA 40 RPM build.nvidia.com
Cerebras 30 RPM cloud.cerebras.ai
Cloudflare 10K neurons/day dash.cloudflare.com
GitHub Varies github.com/marketplace
Vercel $5/month free vercel.com
Cohere 20 RPM, 1K/month cohere.com
Mistral 1 RPS, 500K tokens/min console.mistral.ai
HuggingFace $0.10/month huggingface.co
Kilo Auto free routing app.kilo.ai

Custom Providers (Your Keys)

Provider Models Signup
hcnsec Various api.hcnsec.cn
LLM7 Various llm7.io
PaxSenix Various api.paxsenix.org

Self-Hosted Options

Provider Setup Models
GPT4Free docker run -p 8080:8080 hlohaus789/g4f GPT-4o, Claude, Gemini
Ollama curl -fsSL https://ollama.com/install.sh | sh Llama, Mistral, etc.

Quick Start

Option 1: Free Tier APIs (Recommended)

# 1. Copy environment template
cp .env.example .env

# 2. Get free API keys (see guide below)
# 3. Add keys to .env file
# 4. Start server
python server.py

Option 2: Self-Hosted (No API Keys Needed)

# Start g4f server
docker run -d -p 8080:8080 hlohaus789/g4f

# Start AI Engine
python server.py

For Developers

OpenAI SDK Compatible

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:8000/v1",
    api_key="dummy"  # Not needed for free providers
)

# Chat completion
response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

# Streaming
stream = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Tell me a story"}],
    stream=True
)
for chunk in stream:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="")

cURL

curl http://localhost:8000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4", "messages": [{"role": "user", "content": "Hello!"}]}'

JavaScript

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: 'http://localhost:8000/v1',
    apiKey: 'dummy'
});

const response = await client.chat.completions.create({
    model: 'gpt-4',
    messages: [{ role: 'user', content: 'Hello!' }]
});

Features

Core

  • OpenAI-compatible API with streaming
  • Automatic provider failover
  • Intelligent key rotation
  • Response caching with TTL

Intelligence

  • Task-based model selection
  • Cost optimization
  • Latency tracking
  • A/B testing

Enterprise

  • Multi-tenancy with quotas
  • RBAC (Admin/User/Viewer)
  • Audit logging
  • Billing tracking

Platform

  • Plugin system
  • Workflow engine
  • CLI tool
  • Docker deployment

API Endpoints

Endpoint Method Description
/v1/chat/completions POST Chat completions (OpenAI-compatible, supports stream: true)
/v1/models GET/POST List all models
/api/providers GET List providers
/api/health/{name}/ping POST Live health ping for a provider
/api/capabilities GET Provider/model capabilities (vision, etc.)
/api/status GET Engine status
/api/statistics GET Usage statistics
/health GET Health check
/metrics GET Prometheus metrics
/docs GET Swagger UI (interactive API explorer)
/redoc GET ReDoc API documentation

Rate Limit Headers

All API responses include:

  • X-RateLimit-Limit — Max requests per minute
  • X-RateLimit-Remaining — Remaining requests in current window

Documentation


Contributing

See CONTRIBUTING.md for guidelines.


License

MIT License - See LICENSE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai_synapse-4.2.0.tar.gz (229.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ai_synapse-4.2.0-py3-none-any.whl (219.8 kB view details)

Uploaded Python 3

File details

Details for the file ai_synapse-4.2.0.tar.gz.

File metadata

  • Download URL: ai_synapse-4.2.0.tar.gz
  • Upload date:
  • Size: 229.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for ai_synapse-4.2.0.tar.gz
Algorithm Hash digest
SHA256 64c54bfaebccefba20f9b45aff54c0d69e2332ec7249886c9c3d12d0a6d9614a
MD5 89bb3b176375cf834ce9fb7255b276ec
BLAKE2b-256 94d18c33a5b8fdd26effc9c5bcf4eb7c85a3e14d1ed1b601875edb5ebf9852c2

See more details on using hashes here.

File details

Details for the file ai_synapse-4.2.0-py3-none-any.whl.

File metadata

  • Download URL: ai_synapse-4.2.0-py3-none-any.whl
  • Upload date:
  • Size: 219.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for ai_synapse-4.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a40be249bf97be364b54f399b3b2828106f28155a8c1bf8dcdec87824042cf7c
MD5 5a8911f5712ca46a9619943f4ee82f6b
BLAKE2b-256 eb4fdae58893b10c5a2960d0e6e37275708db2b96b100a0f85bffb3a20f0b9ba

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page