No project description provided
Project description
Imitater
A unified language model server built upon vllm and infinity.
Usage
Install
pip install -e .
Launch Server
imitater -c config/example.yaml
Show configuration instruction.
Add an OpenAI model
- name: OpenAI model name
- token: OpenAI token
Add a chat model
- name: Display name
- path: Model name on hub or local model path
- device: Device IDs
- port: Port ID
- maxlen: Maximum model length (optional)
- agent_type: Agent type (optional) {react, aligned}
- template: Template jinja file (optional)
- gen_config: Generation config folder (optional)
Add an embedding model
- name: Display name
- path: Model name on hub or local model path
- device: Device IDs (does not support multi-gpus)
- port: Port ID
- batch_size: Batch size (optional)
[!NOTE] Chat template is required for the chat models.
Use
export USE_MODELSCOPE_HUB=1
to download model from modelscope.
Test Server
python tests/test_openai.py -c config/example.yaml
Roadmap
- Response choices.
- Rerank model support.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
imitater-0.2.4.tar.gz
(20.5 kB
view details)
Built Distribution
imitater-0.2.4-py3-none-any.whl
(25.2 kB
view details)
File details
Details for the file imitater-0.2.4.tar.gz
.
File metadata
- Download URL: imitater-0.2.4.tar.gz
- Upload date:
- Size: 20.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1639829832dabe14196d6f653de4208c67767c7d2e29a4d21d05e8cec160f389 |
|
MD5 | 783295942eca2f2dc0435fed4df8c5f1 |
|
BLAKE2b-256 | 08bbe381e84631c56f2ad21609ec50995daf9796590eae6896c94e8521375843 |
File details
Details for the file imitater-0.2.4-py3-none-any.whl
.
File metadata
- Download URL: imitater-0.2.4-py3-none-any.whl
- Upload date:
- Size: 25.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 34012bf16e724955bda91f78f7ec68303d7b6a236f3f0d9e81ed5cf7c9b64f91 |
|
MD5 | 5b30948267f0754af77df1c3a186366a |
|
BLAKE2b-256 | fb9b492bb87c063dd65d27ebd2df0196023702a011c851d1db191131422b3c4d |