No project description provided
Project description
llama-api-server
This project is under active deployment. Breaking changes could be made any time.
Llama as a Service! This project try to build a REST-ful API server compatible to OpenAI API using open source backends like llama.
Tested with
- openai-python
- OPENAI_API_TYPE=default
- OPENAI_API_TYPE=azure
Supported APIs
- Completions
- set
temperature
,top\_p
, andtop\_k
- set
max\_tokens
- set
stop
- set
stream
- set
n
- set
presence\_penalty
andfrequency\_penalty
- set
logit\_bias
- set
- Embeddings
- batch process
- Chat
Supported backed
Others
- Documents
- Token auth
- Intergration tests
- Performance parameters like
n_batch
andn_thread
- A tool to download/prepare pretrain model
Get start
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
llama_api_server-0.1.0.tar.gz
(5.5 kB
view hashes)
Built Distribution
Close
Hashes for llama_api_server-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f153b50a8aa2ef8d6923c1924cb4aa3d135baa23c63570adcc5b9d617fc93a55 |
|
MD5 | dff66d60e25fed297b5ca91ab8fceb4f |
|
BLAKE2b-256 | de05bd552bba50082e6297efc84ee8ca8226eb41676430ca1006d1465dcc07a4 |