统一的LLM调用库，支持多种LLM提供商（OpenAI、DeepSeek、通义千问、智谱GLM、百度文心、MiniCPM、Kimi、Gemini、Claude、Ollama、llama.cpp）

These details have not been verified by PyPI

Project links

Project description

LLM Hub - Unified LLM Calling Library

A unified calling library supporting multiple LLM providers (International + Domestic Chinese AI), providing consistent API interfaces, command-line tools, and graphical interface.

✨ Features

🔌 Unified API - One interface for all LLM providers
🌍 Multi-Provider Support - OpenAI, DeepSeek, Kimi, Gemini, Claude, and more
🇨🇳 Domestic Chinese AI - Qwen (Tongyi Qianwen), GLM (Zhipu), ERNIE (Baidu), MiniCPM
💻 Local Deployment - Ollama, llama.cpp
🤔 Thinking Mode - Display reasoning process (DeepSeek support)
📊 Accurate Token Counting - Official tokenizers for each provider
🖥️ CLI & GUI - Command-line and graphical interfaces
📡 Streaming Output - Real-time response generation
💾 Config Management - JSON/YAML configs, profiles, environment variables

📋 Supported Providers

Provider	Type	Token Counter	Thinking Mode
OpenAI	Cloud	✅ tiktoken	❌
DeepSeek	Cloud	✅ deepseek-tokenizer	✅
Kimi (Moonshot)	Cloud	✅ API	❌
Google Gemini	Cloud	⚠️ Estimate	❌
Anthropic Claude	Cloud	⚠️ Estimate	❌
Qwen (Tongyi)	Cloud	✅ qwen-tokenizer	❌
GLM (Zhipu)	Cloud	✅ transformers/API	❌
ERNIE (Baidu)	Cloud	✅ transformers	❌
MiniCPM	Cloud/Local	✅ transformers	❌
Ollama	Local	✅ API tokenize	❌
llama.cpp	Local	✅ Subprocess	❌

🚀 Quick Start

Installation

# Basic installation
pip install gracefox-llm-hub

# Install with UI support
pip install gracefox-llm-hub[ui]

# Install with accurate token counting
pip install gracefox-llm-hub[tokenizers]

# Install all features
pip install gracefox-llm-hub[all]

Basic Usage

from llm_hub import LLMClient, LLMConfig, Provider

# Create client for DeepSeek
client = LLMClient(LLMConfig(
    provider=Provider.DEEPSEEK,
    model="deepseek-chat",
    api_key="your-api-key"
))

# Single turn chat
response = client.chat("What is artificial intelligence?")
print(response.content)

# Streaming output
for chunk in client.stream_chat("Tell me a story"):
    print(chunk, end="", flush=True)

Multi-turn Chat

from llm_hub import LLMClient, LLMConfig, Provider
from llm_hub.models.message import Conversation

client = LLMClient(LLMConfig(
    provider=Provider.OPENAI,
    model="gpt-3.5-turbo",
    api_key="your-api-key"
))

conv = Conversation()
conv.add_user("My name is Xiao Ming")
response = client.chat(conv)
conv.add_assistant(response.content)

conv.add_user("Do you remember my name?")
response = client.chat(conv)
print(response.content)  # Will remember "Xiao Ming"

Thinking Mode (DeepSeek)

from llm_hub import LLMClient, LLMConfig, Provider
from llm_hub.models.response import StreamEventType

config = LLMConfig(
    provider=Provider.DEEPSEEK,
    model="deepseek-chat",
    api_key="your-api-key",
    reasoning=True,      # Enable thinking mode
    show_thinking=True,  # Display thinking process
    stream=True
)

client = LLMClient(config)

for chunk in client.stream_chat("Solve 15 + 27 step by step"):
    if chunk.type == StreamEventType.THINKING:
        print(f"\n[Thinking] {chunk.delta}")
    elif chunk.type == StreamEventType.CONTENT:
        print(chunk.delta, end="", flush=True)

Token Counting

from llm_hub import LLMClient, LLMConfig, Provider
from llm_hub.utils.token_counter import get_token_counter

client = LLMClient(LLMConfig(
    provider=Provider.QWEN,
    model="qwen-turbo",
    api_key="your-api-key"
))

# Count tokens for text
tokens = client.count_tokens("Hello, 你好！")
print(f"Token count: {tokens}")

# Count conversation tokens
total = client.count_conversation_tokens(conversation)
print(f"Total tokens: {total}")

# Truncate by token limit
truncated = client.truncate_to_tokens(long_text, max_tokens=1000)

Domestic Chinese AI Examples

from llm_hub import LLMClient, LLMConfig, Provider

# Qwen (Tongyi Qianwen)
client = LLMClient(LLMConfig(
    provider=Provider.QWEN,
    model="qwen-turbo",
    api_key="your-dashscope-api-key"
))

# GLM (Zhipu)
client = LLMClient(LLMConfig(
    provider=Provider.GLM,
    model="glm-4-flash",
    api_key="your-zhipu-api-key"
))

# ERNIE (Baidu Wenxin)
client = LLMClient(LLMConfig(
    provider=Provider.ERNIE,
    model="ernie-3.5-8k",
    api_key="your-api-key",
    secret_key="your-secret-key"  # Required for Baidu
))

# MiniCPM (local via Ollama)
client = LLMClient(LLMConfig(
    provider=Provider.MINICPM,
    model="minicpm",
    ollama_url="http://localhost:11434"
))

🖥️ Command Line Interface

# Basic usage
llm-hub --provider deepseek --api-key YOUR_KEY "Hello"

# Use JSON config string
llm-hub --json '{"provider": "deepseek", "model": "deepseek-chat", "api_key": "xxx"}' "Hello"

# Use config file
llm-hub --config-file config.json "Hello"

# Use config profile
llm-hub --profile my-deepseek "Hello"

# Interactive mode
llm-hub --provider ollama --interactive

# Streaming output
llm-hub --provider openai --stream "Tell a story"

# List all config profiles
llm-hub --list-profiles

# Save current config as profile
llm-hub --provider deepseek --api-key xxx --save-profile my-ds

🖼️ Graphical User Interface

# Launch UI
llm-hub-ui

# Or
python run_ui.py

The UI supports:

Provider switching (International + Domestic AI)
Thinking mode display
Real-time streaming output
Token statistics
Conversation export
Config loading/saving

⚙️ Configuration

Configuration File (JSON)

{
    "provider": "deepseek",
    "model": "deepseek-chat",
    "api_key": "your-api-key",
    "temperature": 0.7,
    "max_tokens": 2000,
    "reasoning": true,
    "stream": true
}

Environment Variables

export LLM_PROVIDER=deepseek
export LLM_MODEL=deepseek-chat
export LLM_API_KEY=your-key
export LLM_TEMPERATURE=0.7

llm-hub "Hello"

Configuration Priority

Lowest to highest:

Default configuration
Environment variables
Config profile
Config file
JSON string
Command line arguments

📦 Project Structure

llm-hub/
├── llm_hub/
│   ├── core/           # Core functionality
│   ├── models/         # Data models
│   ├── providers/      # LLM provider implementations
│   ├── utils/          # Utilities (token counter, logger)
│   └── ui/             # GUI application
├── examples/           # Example code
├── tests/              # Unit tests
├── pyproject.toml      # Project configuration
└── README.md           # Documentation

🔧 Development

# Clone repository
git clone https://gitee.com/SteHub/llm-hub.git
cd llm-hub

# Install in development mode
pip install -e .[dev,all]

# Run tests
pytest

# Format code
black llm_hub/

# Run CLI
llm-hub --help

# Run UI
llm-hub-ui

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing)
Open a Pull Request

📄 License

GNU General Public License v3.0 - See LICENSE file for details

GPL-3.0 Core Requirements

Freedom to Use - Anyone can freely use this software
Source Code Availability - Source code must be provided when distributing
Same License - Modified versions must be released under the same GPL-3.0 license
Copyright Notice - Original copyright and license notices must be retained

🙏 Acknowledgements

OpenAI - GPT models
DeepSeek - DeepSeek chat models
阿里云 - Tongyi Qianwen (Qwen)
智谱AI - GLM models
百度 - ERNIE (Wenxin)
Ollama - Local model runner
llama.cpp - Local GGUF model runner

📧 Contact

Author: GraceFox
Email: 948743980@qq.com
Homepage: https://gitee.com/SteHub/llm-hub
Issues: https://gitee.com/SteHub/llm-hub/issues

Quick Reference

Common Commands

# Install
pip install gracefox-llm-hub

# Run CLI
llm-hub --provider deepseek --api-key KEY "Hello"

# Run UI
llm-hub-ui

# Test token counting
python -c "from llm_hub.utils.token_counter import get_token_counter; c = get_token_counter('openai'); print(c.count('Hello'))"

Common Providers

Provider	Provider Value	Model Example
OpenAI	`openai`	`gpt-3.5-turbo`
DeepSeek	`deepseek`	`deepseek-chat`
Kimi	`kimi`	`moonshot-v1-8k`
Qwen	`qwen`	`qwen-turbo`
GLM	`glm`	`glm-4-flash`
ERNIE	`ernie`	`ernie-3.5-8k`
Ollama	`ollama`	`llama2`

⭐ Star this project on Gitee if you find it useful!

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.2

Apr 12, 2026

0.2.0

Apr 7, 2026

This version

0.1.4

Apr 6, 2026

0.1.3

Apr 6, 2026

0.1.1

Apr 6, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gracefox_llm_hub-0.1.4.tar.gz (92.9 kB view details)

Uploaded Apr 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gracefox_llm_hub-0.1.4-py3-none-any.whl (85.4 kB view details)

Uploaded Apr 6, 2026 Python 3

File details

Details for the file gracefox_llm_hub-0.1.4.tar.gz.

File metadata

Download URL: gracefox_llm_hub-0.1.4.tar.gz
Upload date: Apr 6, 2026
Size: 92.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.5

File hashes

Hashes for gracefox_llm_hub-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`32157799fd78684709c3ad5850dde0e1641abe2e80f177da4ae0d961ba5ec6a8`
MD5	`e053f11470cf68a8b483e2f80903024e`
BLAKE2b-256	`a727dce3e4cca27f54851745bb24196f67fe8bed10f81ff01c9af44c0ef64b7a`

See more details on using hashes here.

File details

Details for the file gracefox_llm_hub-0.1.4-py3-none-any.whl.

File metadata

Download URL: gracefox_llm_hub-0.1.4-py3-none-any.whl
Upload date: Apr 6, 2026
Size: 85.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.5

File hashes

Hashes for gracefox_llm_hub-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4ee13e41a1df9b5082841b5b06768de71fdc38c12ba3384d04b4152e6f111341`
MD5	`7c6195f48a04a799fd58f6175479ca2c`
BLAKE2b-256	`692bd16da92f8b9e97a4de212847ea2f4910522d688426746637e5779fddb098`

See more details on using hashes here.

gracefox-llm-hub 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LLM Hub - Unified LLM Calling Library

✨ Features

📋 Supported Providers

🚀 Quick Start

Installation

Basic Usage

Multi-turn Chat

Thinking Mode (DeepSeek)

Token Counting

Domestic Chinese AI Examples

🖥️ Command Line Interface

🖼️ Graphical User Interface

⚙️ Configuration

Configuration File (JSON)

Environment Variables

Configuration Priority

📦 Project Structure

🔧 Development

🤝 Contributing

📄 License

GPL-3.0 Core Requirements

🙏 Acknowledgements

📧 Contact

Quick Reference

Common Commands

Common Providers

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes