An easy-to-use, high-performance(?) backend for serving LLMs and other AI models, built on FastAPI.

These details have not been verified by PyPI

Project links

Project description

FastMindAPI

PyPI - Version GitHub License GitHub code size in bytes PyPI - Downloads

An easy-to-use, high-performance(?) backend for serving LLMs and other AI models, built on FastAPI.

🚀 Quick Start

Install

pip install fastmindapi

Use

Run the server

# in Shell
fastmindapi-server --port 8000

# in Python
import fastmindapi as FM

server = FM.Server()
server.run()

Access via client / HTTP requests

curl http://IP:PORT/docs#/

import fastmindapi as FM

client = FM.Client(IP="x.x.x.x", PORT=xxx) # 127.0.0.1:8000 for default

client.add_model_info_list(model_info_list)
client.load_model(model_name)
client.generate(model_name, generation_request)

🪧 We primarily maintain the backend server; the client is provided for reference only. The main usage is through sending HTTP requests. (We might release FM-GUI in the future.)

✨ Features

Model: Support models with various backends

✅ Transformers
- TransformersCausalLM ( AutoModelForCausalLM)
- PeftCausalLM ( PeftModelForCausalLM )
✅ llama.cpp
- LlamacppLM (Llama)
MLC LLM
vllm
...

Modules: More than just chatting with models

Function Calling (extra tools in Python)
Retrieval
Agent
...

Flexibility: Easy to Use & Highly Customizable

Load the model when coding / runtime
Add any APIs you want

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.0.9

Oct 9, 2024

0.0.8

Oct 2, 2024

0.0.7

Sep 30, 2024

This version

0.0.6

Sep 27, 2024

0.0.5

Sep 26, 2024

0.0.4

Sep 24, 2024

0.0.3

Sep 23, 2024

0.0.2

Sep 20, 2024

0.0.1

Sep 19, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastmindapi-0.0.6.tar.gz (173.4 kB view hashes)

Uploaded Sep 27, 2024 Source

Built Distribution

fastmindapi-0.0.6-py3-none-any.whl (32.7 kB view hashes)

Uploaded Sep 27, 2024 Python 3

Hashes for fastmindapi-0.0.6.tar.gz

Hashes for fastmindapi-0.0.6.tar.gz
Algorithm	Hash digest
SHA256	`11179c359b683a87163b4a0c6f8b7b9b65b0366ce819a70ef49aa9e67751a3cd`
MD5	`ccbcecd73c1c091972ea17b0d08af0bf`
BLAKE2b-256	`cf687a6d5e38233e1d1583dcfeb14538b26315c354f9679c8a15dc0c1f96f860`

Hashes for fastmindapi-0.0.6-py3-none-any.whl

Hashes for fastmindapi-0.0.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1de784d374800b6f6979e54fe05bd9c8d23b19dee70dc15369399dc225516c58`
MD5	`b9785994e7d92bcbf8797d820fd367a2`
BLAKE2b-256	`b47d82de2cc59bdb585910a420899a4b9f3ee33493be878161f6377f73257418`