A Python client for Monster API v2

These details have not been verified by PyPI

Project links

Homepage

Project description

Monsterapi v2

A Python client for interacting with Monster API v2 in .

Installation

pip install monsterapi

Note: For detailed documentation please visit here

Has support to following MonsterAPI services:

Beta Next Gen LLM Services

Supported Models:

1. "TinyLlama/TinyLlama-1.1B-Chat-v1.0", 
2. "microsoft/phi-2"
3. "mistralai/Mistral-7B-Instruct-v0.2" 
4. "HuggingFaceH4/zephyr-7b-beta"

Highlights:

1. 99% Cheaper 
2. Synchronous results
3. Token based Pricing.

Service Introduction: here API Reference: here Usage Guide: here

GA LLM Models Old Generation

Supported Models:

1. falcon-7b-instruct
2. mpt-7b-instruct
3. llama2-7b-chat
4. falcon-40b-instruct
5. mpt-30b-instruct
6. codellama-13b-instruct
7. zephyr-7b-beta
8. Monster Deploy LLMs (deploy-llm)

Detailed payloads of models that are supported ? here

QuickServe API: New service from monsterapi deploy popular LLM models into monsterapi compute infrastructure with one request.

a. How to use client to launch and manage a quickserve deployment ? here

Additional Information link: here

Code Documentation:

Client module code documentation can be found here

Basic Usage to access Hosted AI-Models

Import Module

from monsterapi import client

set `MONSTER_API_KEY` env variable to your API key.

os.environ["MONSTER_API_KEY"] = <your_api_key>
client = client() # Initialize client

pass `api_key` parameter to client constructor.

client = client(<api_key>) # pass api_key as parameter

Use generate method

result = client.generate(model='falcon-7b-instruct', data={
    "prompt": "Your prompt here",
    # ... other parameters
})

Quick Serve LLM

Launch a llama2-7b model using QuickServe API

Prepare and send payload to launch a LLM deployment. a. Choose Per_GPU_VRAM and GPU_Count based on your model size and batch size. Please see here for detailed list of supported model and infrastructure matrix.

launch_payload = {
    "basemodel_path": "meta-llama/Llama-2-7b-chat",
    "loramodel_path": "",
    "prompt_template": "{prompt}{completion}",
    "api_auth_token": "b6a97d3b-35d0-4720-a44c-59ee33dbc25b",
    "per_gpu_vram": 24,
    "gpu_count": 1
}

# Launch a deployment
ret = client.deploy("llm", launch_payload) 
deployment_id = ret.get("deployment_id")
print(ret)

# Get deployment status
status_ret = client.get_deployment_status(deployment_id)
print(status_ret)

logs_ret = client.get_deployment_logs(deployment_id)
print(logs_ret)

# Terminate Deployment
terminate_return = client.terminate_deployment(deployment_id)
print(terminate_return)

Run tests

Install test dependencies

pip install monsterapi[tests]

Run functional tests involving actual API key

export MONSTER_API_KEY=<your_api_key>
python3 -m pytest tests/ # Run all tests includes functional tests using actual API key

Run unit tests

export MONSTER_API_KEY="dummy"
python3 -m pytest tests/ -m "not slow" # Run only unit tests

PIP package push Instructions

pip install --upgrade setuptools wheel

python setup.py sdist bdist_wheel

pip install twine

twine upload dist/*

LLama Index CLient Usage

pip install llama_index llama-index-core llama-parse llama-index-readers-file

from monsterapi.LLamaIndexClient import MonsterLLM

model = "meta-llama/Meta-Llama-3-8B-Instruct"
llm = MonsterLLM(
        model=model, temperature=0.1, max_tokens=256
    )

see examples/llama_index_and_chainlit/ for detailed chainlit example

About us

Check us out at monsterapi.ai

Checkout our new MonsterAPI Deploy service here

Check out new no-code finetuning service here

Checkout our Monster-SD Stable Diffusion v1.5 vs XL Comparison space here

Checkout our Monster API LLM comparison space here

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.0.9.3

Sep 10, 2024

1.0.9.2

Aug 2, 2024

1.0.9.1

Aug 2, 2024

This version

1.0.9

Jun 21, 2024

1.0.8.post1

Jun 4, 2024

1.0.8.post0

Jun 4, 2024

1.0.8

Apr 12, 2024

1.0.7.post1

Mar 19, 2024

1.0.7

Mar 1, 2024

1.0.6

Jan 26, 2024

1.0.6b0 pre-release

Mar 1, 2024

1.0.5.post2

Dec 14, 2023

1.0.5.post1

Dec 7, 2023

1.0.5

Dec 7, 2023

1.0.4

Dec 4, 2023

1.0.3

Nov 14, 2023

1.0.2

Sep 13, 2023

1.0.2b3 pre-release

Nov 28, 2023

1.0.2b2.post3 pre-release

Nov 1, 2023

1.0.2b2.post2 pre-release

Oct 16, 2023

1.0.2b1 pre-release

Oct 14, 2023

1.0.1.post2

Sep 10, 2023

1.0.1

Sep 6, 2023

1.0.0

Aug 28, 2023

0.1.3

Aug 21, 2023

0.1.2.dev0 pre-release

Aug 19, 2023

0.1.1.dev0 pre-release

Aug 7, 2023

0.1.0.dev0 pre-release

Aug 7, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

monsterapi-1.0.9.tar.gz (29.1 kB view hashes)

Uploaded Jun 21, 2024 Source

Built Distribution

monsterapi-1.0.9-py3-none-any.whl (28.9 kB view hashes)

Uploaded Jun 21, 2024 Python 3

Hashes for monsterapi-1.0.9.tar.gz

Hashes for monsterapi-1.0.9.tar.gz
Algorithm	Hash digest
SHA256	`7886c7c652126d1aab1d2d0ac241304aaf26ce39955de0419bb40ddd96ca7069`
MD5	`210b6a1d8b2ae75e9233b90a1bff6af8`
BLAKE2b-256	`4ff029e2c15b20d6c88252a0c2f4e30bb42cbd78063a54773d0da16ae47c9b70`

Hashes for monsterapi-1.0.9-py3-none-any.whl

Hashes for monsterapi-1.0.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b09a6a0ddd9a618ad01f6b8653b24f0b8ae0a0a11c60356a6eeb87161f2e65b3`
MD5	`33cd9c670b89f0cf3a18634af0763b77`
BLAKE2b-256	`1b43323045105e5475ec0c10f19652f8101b0ecb8637ddea1c1267b3964f86b2`

monsterapi 1.0.9

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Monsterapi v2

Installation

Has support to following MonsterAPI services:

Beta Next Gen LLM Services

Supported Models:

Highlights:

GA LLM Models Old Generation

Supported Models:

Code Documentation:

Basic Usage to access Hosted AI-Models

Import Module

set MONSTER_API_KEY env variable to your API key.

pass api_key parameter to client constructor.

Use generate method

Quick Serve LLM

Launch a llama2-7b model using QuickServe API

Run tests

Install test dependencies

Run functional tests involving actual API key

Run unit tests

PIP package push Instructions

LLama Index CLient Usage

About us

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

set `MONSTER_API_KEY` env variable to your API key.

pass `api_key` parameter to client constructor.