Skip to main content

Free multi-provider AI SDK — drop-in OpenAI & Anthropic compatibility with 27 free providers

Project description

AI Synapse — Free Multi-Provider AI SDK

Tests Providers Python License: MIT OpenAI Compatible Docker

Stop paying for AI inference. Drop-in from ai_engine import OpenAI — routes through 27+ free providers with automatic failover and intelligent routing.


Free Providers

No API Key Required (Truly Free)

Provider Model Endpoint
Pollinations openai text.pollinations.ai/openai
Hermes Hermes-3-Llama hermes.ai.unturf.com/v1
G4F Groq llama-3.3-70b g4f.space/api/groq
G4F Gemini gemini-2.5-flash g4f.space/api/gemini
G4F NVIDIA nemotron-3 g4f.space/api/nvidia
OpenCode Zen north-mini-code opencode.ai/zen/v1

Free Tier APIs (Signup Required)

Provider Free Tier Signup
Groq 30 RPM, 14,400 RPD console.groq.com
OpenRouter 23 free models openrouter.ai
Gemini 5-30 RPM aistudio.google.com
NVIDIA 40 RPM build.nvidia.com
Cerebras 30 RPM cloud.cerebras.ai
Cloudflare 10K neurons/day dash.cloudflare.com
GitHub Varies github.com/marketplace
Vercel $5/month free vercel.com
Cohere 20 RPM, 1K/month cohere.com
Mistral 1 RPS, 500K tokens/min console.mistral.ai
HuggingFace $0.10/month huggingface.co
Kilo Auto free routing app.kilo.ai

Custom Providers (Your Keys)

Provider Models Signup
hcnsec Various api.hcnsec.cn
LLM7 Various llm7.io
PaxSenix Various api.paxsenix.org

Self-Hosted Options

Provider Setup Models
GPT4Free docker run -p 8080:8080 hlohaus789/g4f GPT-4o, Claude, Gemini
Ollama curl -fsSL https://ollama.com/install.sh | sh Llama, Mistral, etc.

Quick Start

Option 1: Free Tier APIs (Recommended)

# 1. Copy environment template
cp .env.example .env

# 2. Get free API keys (see guide below)
# 3. Add keys to .env file
# 4. Start server
python server.py

Option 2: Self-Hosted (No API Keys Needed)

# Start g4f server
docker run -d -p 8080:8080 hlohaus789/g4f

# Start AI Engine
python server.py

For Developers

OpenAI SDK Compatible

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:8000/v1",
    api_key="dummy"  # Not needed for free providers
)

# Chat completion
response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello!"}]
)
print(response.choices[0].message.content)

# Streaming
stream = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Tell me a story"}],
    stream=True
)
for chunk in stream:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="")

cURL

curl http://localhost:8000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "gpt-4", "messages": [{"role": "user", "content": "Hello!"}]}'

JavaScript

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: 'http://localhost:8000/v1',
    apiKey: 'dummy'
});

const response = await client.chat.completions.create({
    model: 'gpt-4',
    messages: [{ role: 'user', content: 'Hello!' }]
});

Features

Core

  • OpenAI-compatible API with streaming
  • Automatic provider failover
  • Intelligent key rotation
  • Response caching with TTL

Intelligence

  • Task-based model selection
  • Cost optimization
  • Latency tracking
  • A/B testing

Enterprise

  • Multi-tenancy with quotas
  • RBAC (Admin/User/Viewer)
  • Audit logging
  • Billing tracking

Platform

  • Plugin system
  • Workflow engine
  • CLI tool
  • Docker deployment

API Endpoints

Endpoint Method Description
/v1/chat/completions POST Chat completions (OpenAI-compatible, supports stream: true)
/v1/models GET/POST List all models
/api/providers GET List providers
/api/health/{name}/ping POST Live health ping for a provider
/api/capabilities GET Provider/model capabilities (vision, etc.)
/api/status GET Engine status
/api/statistics GET Usage statistics
/health GET Health check
/metrics GET Prometheus metrics
/docs GET Swagger UI (interactive API explorer)
/redoc GET ReDoc API documentation

Rate Limit Headers

All API responses include:

  • X-RateLimit-Limit — Max requests per minute
  • X-RateLimit-Remaining — Remaining requests in current window

Documentation


Contributing

See CONTRIBUTING.md for guidelines.


License

MIT License - See LICENSE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai_synapse-4.0.14.tar.gz (232.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ai_synapse-4.0.14-py3-none-any.whl (223.2 kB view details)

Uploaded Python 3

File details

Details for the file ai_synapse-4.0.14.tar.gz.

File metadata

  • Download URL: ai_synapse-4.0.14.tar.gz
  • Upload date:
  • Size: 232.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for ai_synapse-4.0.14.tar.gz
Algorithm Hash digest
SHA256 d20513c5acc70ed2e470560c755e458fffc4231239daaec672980a650ff60be2
MD5 c2d0a916cbb7fd3a848187a4fa1dcc09
BLAKE2b-256 e96d9ac51a2fe3359ba36bc81eb7ea4b3c8830bca10e6883b6b56a493d58e57a

See more details on using hashes here.

File details

Details for the file ai_synapse-4.0.14-py3-none-any.whl.

File metadata

  • Download URL: ai_synapse-4.0.14-py3-none-any.whl
  • Upload date:
  • Size: 223.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for ai_synapse-4.0.14-py3-none-any.whl
Algorithm Hash digest
SHA256 288eb00fd07de30c458bd396cc11e34bfbb5bb6d756548de7c5e16554d4e8564
MD5 1df9125b77f1ff435dcd583d06786574
BLAKE2b-256 840942c5d3de687c4a12278136f34f197a2970666d7859fc2589db9f39083b43

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page