Lightweight OpenAI API server on top of Llama local models. Suitable for combined use with Microsoft AutoGen.
Project description
Llama_CPP OpenAI API Server Project Overview
Introduction
The llama_cpp_openai
module provides a lightweight implementation of an OpenAI API server on top of
Llama CPP models. This implementation is particularly designed for use with Microsoft AutoGen and includes support for function calls. The project is structured around the llama_cpp_python
module and is aimed at facilitating the integration of AI models in applications using OpenAI clients or API.
Project Structure
The project is organized into several key directories and files:
-
llama_cpp_openai: Contains the core implementation of the API server.
__init__.py
: Initialization file for the module._api_server.py
: Defines the OpenAPI server, using FastAPI for handling requests._llama_cpp_functions_chat_handler.py
: Implements thellama-2-functionary
chat handler that supports function calling.
-
examples: Provides example scripts demonstrating the usage of the API server.
README.md
: Overview and description of example scripts.autogen_basic.py
: Basic integration of AutoGen with Llama_CPP using the OpenAI API server.autogen_functions.py
: Sets up an AutoGen chatbot with function calls capabilities.basic.py
: Demonstrates the setup and start of an API server using the Llama library.
Key Features
- FastAPI Integration: Utilizes FastAPI for efficient and easy-to-use API endpoints.
- Llama Library Usage: Leverages the Llama library for handling AI model interactions.
- Function Call Support: Includes capabilities for function calls in chatbot environments.
- Examples for Quick Start: Provides example scripts for easy understanding and implementation.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for llama_cpp_openai-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 52beb6e5e0ccbca99f54a329ca1d5fad1ca66a225a7123fb73cbfa502f10b0c6 |
|
MD5 | 2f7fe17caf0b27d5672f83359f88266c |
|
BLAKE2b-256 | adc18e320c49fe8b3236ff01347bc6aad915bbebd5731e36570e6f56e3fb4432 |