Skip to main content

Sych LLM Playground is a command line tool deploy and interact with language models on the cloud.

Project description

Sych LLM Playground

PyPI Status Python Version License

Read the documentation at https://sych-llm-playground.readthedocs.io/ Tests Codecov

pre-commit Black

Requirements

  • Python Version: 3.10 or higher. Ensure that Python is properly installed on your system.

Installation

You can install Sych LLM Playground via pip from PyPI:

$ pip install sych-llm-playground

Usage

Please see the Command-line Reference for details.

Features

Sych LLM Playground offers a streamlined experience for managing and interacting with language models in the cloud. Its capabilities include:

Configure Cloud Credentials

  • A guided setup to input and securely store credentials for your preferred cloud platforms.
> sych-llm-playground configure

                 ******
            *******  ******
        *******          *******
      ****,      *****       *****
      ***    **************    ***   @@@@@@@@@                    @@@
      ***    ***       ****    ***   @@@       @@@   @@@  @@@@@@@ @@@@@@@@
      ***    ***       ****    ***    @@@@@@@@  @@@ @@@  @@@      @@@   @@@
      **     *****    *****    ***  @&     @@@   @@@@@   @@@      @@@   @@@
          **************       ***   @@@@@@@      @@@      @@@@@@ @@@   @@@
      *******              *******               @@@
          *******      ******
               **********
                   **

Welcome to the Sych LLM Playground CLI.
This tool is part of our efforts to contribute to the open-source community.
Explore more at https://sych.io

For detailed documentation, visit https://sych-llm-playground.readthedocs.io

Let's begin with the configuration.

[?] Please choose a provider:: AWS
 > AWS

Please Provide your AWS Access Key: xxxxxxxx
Please provide your AWS Secret Key: xxxxx
Please provide your ARN of the IAM role for SageMaker: xxxxxx
Please provide the AWS Region you want to deploy in [us-west-2]:
Configuration successful!

Deploy Models

  • Easily deploy various language models to supported cloud platforms.
> sych-llm-playground deploy

[?] Please choose a provider:: AWS
 > AWS

✓ Cloud Credentials validated.

✓ Cloud Credentials loaded.

[?] Select a model id to deploy:: Llama-2-7b - v2.0.0
 > Llama-2-7b - v2.0.0
   Llama-2-7b-chat - v1.1.0
   Llama-2-13b - v2.0.0
   Llama-2-13b-chat - v1.1.0
   Llama-2-70b - v1.1.0
   Llama-2-70b-chat v1.1.0

Deploying... Why not grab a cup of coffee? /|\

✓ Model and Endpoint Deployed

Endpoint name: sych-llm-pg-meta-textgeneration-llama-2-7b-e-1692399247

✓ Created REST API

✓ Fetched REST API

✓ Created API resources

✓ Created a POST method

✓ Created API Integration with SageMaker endpoint

✓ API Deployed

Public API HTTP (POST) URL: https://dhdb1mu9w1.execute-api.us-west-2.amazonaws.com/prod/predict

Deployment successful!

List Resources

  • Get an overview of all the deployed resources, including models, endpoints, and API Gateways.
> sych-llm-playground list

[?] Please choose a provider:: AWS
 > AWS

✓ Cloud Credentials validated.

✓ Cloud Credentials loaded.

Deployed Models:
{'name': 'sych-llm-pg-meta-textgeneration-llama-2-7b-f-m-1692586488'}

Deployed Endpoints:
{'name': 'sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692586488', 'url': 'https://runtime.sagemaker.us-west-2.amazonaws.com/endpoints/sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692586488/invocations'}

Deployed API Gateways:
{'name': 'sych-llm-pg-api-sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692558825', 'id': 'dhdb1mu9w1', 'method': 'POST', 'url': 'https://dhdb1mu9w1.execute-api.us-west-2.amazonaws.com/prod/predict'}

Interact with Models

  • Utilize a simple interface to communicate with deployed models, sending queries and receiving responses including a chat interface with conversation history for chat models:
> sych-llm-playground interact

[?] Please choose a provider:: AWS
 > AWS

✓ Cloud Credentials validated.

✓ Cloud Credentials loaded.

[?] Select an endpoint to interact with:: sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692383398
 > sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692383398

Provide a system instruction to guide the model's behavior (optional, e.g., 'Please talk in riddles.'): Be professional
Your desired Max new tokens? (default 256): 70
Your desired top_p? (default 0.9):
Your desired Temperature? (default 0.6) :

Type 'exit' to end the chat.

You: Hi my name is Ryan

Model:  Hello Ryan,

It's a pleasure to meet you. How are you today?

You: What is my name?

Model:  Ryan, it's nice to meet you. How are you today?

You: exit
Exiting chat...
Chat ended.

  • Interact via Public HTTP API
curl -X POST \
    -H 'Content-Type: application/json' \
    -H 'custom_attributes: accept_eula=true' \
    -d '{"inputs": [[{"role": "system", "content": "Talk profession"}, {"role": "user", "content": "Hi my name is Ryan"}]], "parameters": {"max_new_tokens": 256, "top_p": 0.9, "temperature": 0.6}}' \
    'https://valauuhvic.execute-api.us-west-2.amazonaws.com/prod/predict'

[
  {
    "generation":{
      "role":"assistant",
      "content":" Hello Ryan, it's a pleasure to meet you. How may I assist you today? Is there something specific you need help with or would you like to discuss a particular topic? I'm here to listen and provide guidance to the best of my abilities. Please feel free to ask me anything."
    }
  }
]%

Cleanup Resources

  • Safely remove deployed models, endpoints and API Gateways to manage costs and maintain a clean environment.
> sych-llm-playground cleanup

[?] Please choose a provider:: AWS
 > AWS

✓ Cloud Credentials validated.

✓ Cloud Credentials loaded.

[?] What would you like to cleanup?: Endpoint
  Model
> Endpoint
  API Gateway

[?] Select a endpoint to cleanup:: sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692383398
 > sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692383398

Endpoint sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692383398 cleaned up successfully.

Coming Soon

  • Fine-tuning via CLI: Direct fine-tuning of models using the CLI will be available soon.

  • Interaction via GUI: Playround will soon support interaction via a graphical user interface.

  • Local Playground Downloading and interacting selected models will soon be available.

Supported Models

Sych LLM Playground currently supports the following language models, with more to be added soon:

Llama Models

  • Llama-2-7b: Version 2.0.0
  • Llama-2-7b-chat: Version 1.1.0
  • Llama-2-13b: Version 2.0.0
  • Llama-2-13b-chat: Version 1.1.0
  • Llama-2-70b: Version 1.1.0
  • Llama-2-70b-chat: Version 1.1.0

Stay tuned for updates as we expand support to include additional language models.

Supported Cloud Platforms

Playground currently supports the following platforms:

Amazon Web Services (AWS) SageMaker

Amazon SageMaker is a managed service that provides developers and data scientists with the ability to build, train, and deploy machine learning (ML) models quickly and easily. Here are some key concepts to understand:

  • Model: A trained machine learning model that can be deployed to an endpoint for making predictions.
  • Endpoint: A hosted deployment of your model, facilitating real-time predictions.
  • API Gateway: A gateway that allows you to call your endpoints. In the context of this tool, it enables interaction with models via a publicly accessible HTTP URL. This tool automatically creates a publicly available POST API endpoint upon successful deployment.

The naming conventions for Models, Endpoints, and API Gateways deployed by this Playground follow these formats:

  • Model: "sych-llm-pg-{model_id}-m-{timestamp}"
  • Endpoint: sych-llm-pg-{model_id}-e-{timestamp}
  • API Gateway: "sych-llm-pg-api-{endpoint_name}"

The timestamp in the naming convention helps in matching endpoints with their corresponding models, especially when two or more models exist with the same ID.

AWS SageMaker Instance Types and Cloud Costs

AWS SageMaker allows running models on specific hardware instance types, such as ml.g5.2xlarge. It's essential to be aware of the associated costs and quotas:

  • It is common to have an applied default quota value of 0 for specific instance types on AWS.

  • To enable them, you need to:

    1. Go to your AWS Console > Service Quotas.
    2. Navigate to AWS Services -> Amazon SageMaker -> Apply Quotas for specific instance types.
    3. Apply for the required quota. Please note, it can sometimes take over 1 day to get a quota approved.
  • Here's a link to search for specific instance types used by a model, which you can apply quotas for. If you can't find the instance type for your model, and do not have a quota assigned, the CLI will display an error message with the exact instance type that needs an assigned quota value.

  • For more information about the costs associated with SageMaker and the specific instance types, you can refer to the AWS SageMaker Pricing Page.

Requirements

1. Register for an AWS Account

If you don't have one already, you can create an AWS account here.

2. Create an IAM Role for SageMaker and API Gateway

Follow these steps to set up the role:

a. Create a New IAM Role: Navigate to IAM in the AWS Console, and create a new role.

b. Add Trust Policy: Use the following custom trust policy to allow SageMaker and API Gateway to assume this role:

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Principal": {
        "Service": ["sagemaker.amazonaws.com", "apigateway.amazonaws.com"]
      },
      "Action": "sts:AssumeRole"
    }
  ]
}

c. Attach Permission Policy: Under the newly created role, attach the AmazonSageMakerFullAccess managed policy.

3. Create an IAM User with Necessary Permissions

Here's how to create the user:

a. Create IAM User: In the IAM section of the AWS Console, create a new user.

b. Attach Managed Policies: Attach the AmazonSageMakerFullAccess and AmazonAPIGatewayAdministrator managed policies to the user.

c. Add Custom Inline Policy: Add the following custom inline policy, replacing YOUR_IAM_ROLE_ARN with the ARN of the IAM role you created earlier:

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Action": "iam:PassRole",
      "Resource": "YOUR_IAM_ROLE_ARN"
    }
  ]
}

d. Create an Access Key: In the user's security credentials tab, create a new access key. Be sure to store the generated Access Key ID and Secret Access Key in a safe place.

4. Configure the CLI

Use the Access Key ID, Secret Access Key and your IAM role ARN to configure the CLI as shown in the examples above.

Other Cloud Providers

Other popular cloud platforms are on our roadmap and will be supported soon. Stay tuned for updates, and don't hesitate to contribute or request support for your preferred platforms.

Contributing

Contributions are very welcome. To learn more, see the Contributor Guide.

License

Distributed under the terms of the Apache 2.0 license, Sych LLM Playground is free and open source software.

Issues

If you encounter any problems, please file an issue along with a detailed description.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sych-llm-playground-0.3.0.tar.gz (27.1 kB view details)

Uploaded Source

Built Distribution

sych_llm_playground-0.3.0-py3-none-any.whl (28.7 kB view details)

Uploaded Python 3

File details

Details for the file sych-llm-playground-0.3.0.tar.gz.

File metadata

  • Download URL: sych-llm-playground-0.3.0.tar.gz
  • Upload date:
  • Size: 27.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for sych-llm-playground-0.3.0.tar.gz
Algorithm Hash digest
SHA256 239c7a6d268480fb1811d8cffebe29cba72dd1ece10a4dea0ef90eb83a942192
MD5 3553b266e146edfc90cc14a8a1ebaa56
BLAKE2b-256 3ab7e83b928985755b8dba2f24abb4df8bc5bd2c514eb94982c60b6a166de658

See more details on using hashes here.

File details

Details for the file sych_llm_playground-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for sych_llm_playground-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 86ba26c6a494c1ff805237196460378787d5b2dc318de701acd14488f80d0114
MD5 e566f5262a855c3babc4337f3d08448b
BLAKE2b-256 cfdd8346bbd198008d06763f314a6d58d338ca4e67a2ada9b7ed39ae334f45ce

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page