Sych LLM Playground is a command line tool deploy and interact with language models on the cloud.
Project description
Sych LLM Playground
Requirements
- Python Version: 3.10 or higher. Ensure that Python is properly installed on your system.
Installation
You can install Sych LLM Playground via pip from PyPI:
$ pip install sych-llm-playground
Usage
Please see the Command-line Reference for details.
Features
Sych LLM Playground offers a streamlined experience for managing and interacting with language models in the cloud. Its capabilities include:
Configure Cloud Credentials
- A guided setup to input and securely store credentials for your preferred cloud platforms.
> sych-llm-playground configure
******
******* ******
******* *******
****, ***** *****
*** ************** *** @@@@@@@@@ @@@
*** *** **** *** @@@ @@@ @@@ @@@@@@@ @@@@@@@@
*** *** **** *** @@@@@@@@ @@@ @@@ @@@ @@@ @@@
** ***** ***** *** @& @@@ @@@@@ @@@ @@@ @@@
************** *** @@@@@@@ @@@ @@@@@@ @@@ @@@
******* ******* @@@
******* ******
**********
**
Welcome to the Sych LLM Playground CLI.
This tool is part of our efforts to contribute to the open-source community.
Explore more at https://sych.io
For detailed documentation, visit https://sych-llm-playground.readthedocs.io
Let's begin with the configuration.
[?] Please choose a provider:: AWS
> AWS
Please Provide your AWS Access Key: xxxxxxxx
Please provide your AWS Secret Key: xxxxx
Please provide your ARN of the IAM role for SageMaker: xxxxxx
Please provide the AWS Region you want to deploy in [us-west-2]:
Configuration successful!
Deploy Models
- Easily deploy various language models to supported cloud platforms.
> sych-llm-playground deploy
[?] Please choose a provider:: AWS
> AWS
✓ Cloud Credentials validated.
✓ Cloud Credentials loaded.
[?] Select a model id to deploy:: Llama-2-7b - v2.0.0
> Llama-2-7b - v2.0.0
Llama-2-7b-chat - v1.1.0
Llama-2-13b - v2.0.0
Llama-2-13b-chat - v1.1.0
Llama-2-70b - v1.1.0
Llama-2-70b-chat v1.1.0
Deploying... Why not grab a cup of coffee? /|\
✓ Model and Endpoint Deployed
Endpoint name: sych-llm-pg-meta-textgeneration-llama-2-7b-e-1692399247
✓ Created REST API
✓ Fetched REST API
✓ Created API resources
✓ Created a POST method
✓ Created API Integration with SageMaker endpoint
✓ API Deployed
Public API HTTP (POST) URL: https://dhdb1mu9w1.execute-api.us-west-2.amazonaws.com/prod/predict
Deployment successful!
List Resources
- Get an overview of all the deployed resources, including models, endpoints, and API Gateways.
> sych-llm-playground list
[?] Please choose a provider:: AWS
> AWS
✓ Cloud Credentials validated.
✓ Cloud Credentials loaded.
Deployed Models:
{'name': 'sych-llm-pg-meta-textgeneration-llama-2-7b-f-m-1692586488'}
Deployed Endpoints:
{'name': 'sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692586488', 'url': 'https://runtime.sagemaker.us-west-2.amazonaws.com/endpoints/sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692586488/invocations'}
Deployed API Gateways:
{'name': 'sych-llm-pg-api-sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692558825', 'id': 'dhdb1mu9w1', 'method': 'POST', 'url': 'https://dhdb1mu9w1.execute-api.us-west-2.amazonaws.com/prod/predict'}
Interact with Models
- Utilize a simple interface to communicate with deployed models, sending queries and receiving responses including a chat interface with conversation history for chat models:
> sych-llm-playground interact
[?] Please choose a provider:: AWS
> AWS
✓ Cloud Credentials validated.
✓ Cloud Credentials loaded.
[?] Select an endpoint to interact with:: sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692383398
> sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692383398
Provide a system instruction to guide the model's behavior (optional, e.g., 'Please talk in riddles.'): Be professional
Your desired Max new tokens? (default 256): 70
Your desired top_p? (default 0.9):
Your desired Temperature? (default 0.6) :
Type 'exit' to end the chat.
You: Hi my name is Ryan
Model: Hello Ryan,
It's a pleasure to meet you. How are you today?
You: What is my name?
Model: Ryan, it's nice to meet you. How are you today?
You: exit
Exiting chat...
Chat ended.
- Interact via Public HTTP API
curl -X POST \
-H 'Content-Type: application/json' \
-H 'custom_attributes: accept_eula=true' \
-d '{"inputs": [[{"role": "system", "content": "Talk profession"}, {"role": "user", "content": "Hi my name is Ryan"}]], "parameters": {"max_new_tokens": 256, "top_p": 0.9, "temperature": 0.6}}' \
'https://valauuhvic.execute-api.us-west-2.amazonaws.com/prod/predict'
[
{
"generation":{
"role":"assistant",
"content":" Hello Ryan, it's a pleasure to meet you. How may I assist you today? Is there something specific you need help with or would you like to discuss a particular topic? I'm here to listen and provide guidance to the best of my abilities. Please feel free to ask me anything."
}
}
]%
Cleanup Resources
- Safely remove deployed models, endpoints and API Gateways to manage costs and maintain a clean environment.
> sych-llm-playground cleanup
[?] Please choose a provider:: AWS
> AWS
✓ Cloud Credentials validated.
✓ Cloud Credentials loaded.
[?] What would you like to cleanup?: Endpoint
Model
> Endpoint
API Gateway
[?] Select a endpoint to cleanup:: sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692383398
> sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692383398
Endpoint sych-llm-pg-meta-textgeneration-llama-2-7b-f-e-1692383398 cleaned up successfully.
Coming Soon
-
Fine-tuning via CLI: Direct fine-tuning of models using the CLI will be available soon.
-
Interaction via GUI: Playround will soon support interaction via a graphical user interface.
-
Local Playground Downloading and interacting selected models will soon be available.
Supported Models
Sych LLM Playground currently supports the following language models, with more to be added soon:
Llama Models
- Llama-2-7b: Version 2.0.0
- Llama-2-7b-chat: Version 1.1.0
- Llama-2-13b: Version 2.0.0
- Llama-2-13b-chat: Version 1.1.0
- Llama-2-70b: Version 1.1.0
- Llama-2-70b-chat: Version 1.1.0
Stay tuned for updates as we expand support to include additional language models.
Supported Cloud Platforms
Playground currently supports the following platforms:
Amazon Web Services (AWS) SageMaker
Amazon SageMaker is a managed service that provides developers and data scientists with the ability to build, train, and deploy machine learning (ML) models quickly and easily. Here are some key concepts to understand:
- Model: A trained machine learning model that can be deployed to an endpoint for making predictions.
- Endpoint: A hosted deployment of your model, facilitating real-time predictions.
- API Gateway: A gateway that allows you to call your endpoints. In the context of this tool, it enables interaction with models via a publicly accessible HTTP URL. This tool automatically creates a publicly available POST API endpoint upon successful deployment.
The naming conventions for Models, Endpoints, and API Gateways deployed by this Playground follow these formats:
- Model:
"sych-llm-pg-{model_id}-m-{timestamp}"
- Endpoint:
sych-llm-pg-{model_id}-e-{timestamp}
- API Gateway:
"sych-llm-pg-api-{endpoint_name}"
The timestamp in the naming convention helps in matching endpoints with their corresponding models, especially when two or more models exist with the same ID.
AWS SageMaker Instance Types and Cloud Costs
AWS SageMaker allows running models on specific hardware instance types, such as ml.g5.2xlarge
. It's essential to be aware of the associated costs and quotas:
-
It is common to have an applied default quota value of 0 for specific instance types on AWS.
-
To enable them, you need to:
- Go to your AWS Console > Service Quotas.
- Navigate to AWS Services -> Amazon SageMaker -> Apply Quotas for specific instance types.
- Apply for the required quota. Please note, it can sometimes take over 1 day to get a quota approved.
-
Here's a link to search for specific instance types used by a model, which you can apply quotas for. If you can't find the instance type for your model, and do not have a quota assigned, the CLI will display an error message with the exact instance type that needs an assigned quota value.
-
For more information about the costs associated with SageMaker and the specific instance types, you can refer to the AWS SageMaker Pricing Page.
Requirements
1. Register for an AWS Account
If you don't have one already, you can create an AWS account here.
2. Create an IAM Role for SageMaker and API Gateway
Follow these steps to set up the role:
a. Create a New IAM Role: Navigate to IAM in the AWS Console, and create a new role.
b. Add Trust Policy: Use the following custom trust policy to allow SageMaker and API Gateway to assume this role:
{
"Version": "2012-10-17",
"Statement": [
{
"Effect": "Allow",
"Principal": {
"Service": ["sagemaker.amazonaws.com", "apigateway.amazonaws.com"]
},
"Action": "sts:AssumeRole"
}
]
}
c. Attach Permission Policy: Under the newly created role, attach the AmazonSageMakerFullAccess
managed policy.
3. Create an IAM User with Necessary Permissions
Here's how to create the user:
a. Create IAM User: In the IAM section of the AWS Console, create a new user.
b. Attach Managed Policies: Attach the AmazonSageMakerFullAccess
and AmazonAPIGatewayAdministrator
managed policies to the user.
c. Add Custom Inline Policy: Add the following custom inline policy, replacing YOUR_IAM_ROLE_ARN
with the ARN of the IAM role you created earlier:
{
"Version": "2012-10-17",
"Statement": [
{
"Effect": "Allow",
"Action": "iam:PassRole",
"Resource": "YOUR_IAM_ROLE_ARN"
}
]
}
d. Create an Access Key: In the user's security credentials tab, create a new access key. Be sure to store the generated Access Key ID and Secret Access Key in a safe place.
4. Configure the CLI
Use the Access Key ID, Secret Access Key and your IAM role ARN to configure the CLI as shown in the examples above.
Other Cloud Providers
Other popular cloud platforms are on our roadmap and will be supported soon. Stay tuned for updates, and don't hesitate to contribute or request support for your preferred platforms.
Contributing
Contributions are very welcome. To learn more, see the Contributor Guide.
License
Distributed under the terms of the Apache 2.0 license, Sych LLM Playground is free and open source software.
Issues
If you encounter any problems, please file an issue along with a detailed description.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file sych-llm-playground-0.3.0.tar.gz
.
File metadata
- Download URL: sych-llm-playground-0.3.0.tar.gz
- Upload date:
- Size: 27.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.17
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 239c7a6d268480fb1811d8cffebe29cba72dd1ece10a4dea0ef90eb83a942192 |
|
MD5 | 3553b266e146edfc90cc14a8a1ebaa56 |
|
BLAKE2b-256 | 3ab7e83b928985755b8dba2f24abb4df8bc5bd2c514eb94982c60b6a166de658 |
File details
Details for the file sych_llm_playground-0.3.0-py3-none-any.whl
.
File metadata
- Download URL: sych_llm_playground-0.3.0-py3-none-any.whl
- Upload date:
- Size: 28.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.17
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 86ba26c6a494c1ff805237196460378787d5b2dc318de701acd14488f80d0114 |
|
MD5 | e566f5262a855c3babc4337f3d08448b |
|
BLAKE2b-256 | cfdd8346bbd198008d06763f314a6d58d338ca4e67a2ada9b7ed39ae334f45ce |