Skip to main content

A Python package for replicating Gradio applications

Project description

sambanova_gradio

is a Python package that makes it very easy for developers to create machine learning apps that are powered by sambanova's Inference API.

Installation

Install this package: pip install sambanova-gradio

That's it!

Basic Usage

Just like if you were to use the sambanova API, you should first save your sambanova API token to this environment variable:

export SAMBANOVA_API_KEY=<your token>

Then in a Python file, write:

import gradio as gr
import sambanova_gradio

gr.load(
    name='Meta-Llama-3.1-405B-Instruct',
    src=sambanova_gradio.registry,
).launch()

Run the Python file, and you should see a Gradio Interface connected to the model on sambanova!

ChatInterface

Customization

Once you can create a Gradio UI from a sambanova endpoint, you can customize it by setting your own input and output components, or any other arguments to gr.Interface. For example, the screenshot below was generated with:

import gradio as gr
import sambanova_gradio

gr.load(
    name='Meta-Llama-3.1-405B-Instruct',
    src=sambanova_gradio.registry,
    title='Sambanova-Gradio Integration',
    description="Chat with Meta-Llama-3.1-405B-Instruct model.",
    examples=["Explain quantum gravity to a 5-year old.", "How many R are there in the word Strawberry?"]
).launch()

ChatInterface with customizations

Composition

Or use your loaded Interface within larger Gradio Web UIs, e.g.

import gradio as gr
import sambanova_gradio

with gr.Blocks() as demo:
    with gr.Tab("405B"):
        gr.load('Meta-Llama-3.1-405B-Instruct', src=sambanova_gradio.registry)
    with gr.Tab("70B"):
        gr.load('Meta-Llama-3.1-70B-Instruct-8k', src=sambanova_gradio.registry)

demo.launch()

Under the Hood

The sambanova-gradio Python library has two dependencies: openai and gradio. It defines a "registry" function sambanova_gradio.registry, which takes in a model name and returns a Gradio app.

Supported Models in Sambanova Cloud

Model Context Length Output Length Dtype / Precision
Meta-Llama-3.1-8B-Instruct 4096 1000 BF16
Meta-Llama-3.1-8B-Instruct-8k 8192 1000 BF16
Meta-Llama-3.1-70B-Instruct 4096 1000 BF16
Meta-Llama-3.1-70B-Instruct-8k 8192 1000 BF16
Meta-Llama-3.1-405B-Instruct 4096 1000 BF16
Meta-Llama-3.1-405B-Instruct-8k 8192 1000 BF16

Note: if you are getting a 401 authentication error, then the sambanova API Client is not able to get the API token from the environment variable. This happened to me as well, in which case save it in your Python session, like this:

import os

os.environ["SAMBANOVA_API_KEY"] = ...

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sambanova_gradio-0.1.6.tar.gz (3.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sambanova_gradio-0.1.6-py3-none-any.whl (3.1 kB view details)

Uploaded Python 3

File details

Details for the file sambanova_gradio-0.1.6.tar.gz.

File metadata

  • Download URL: sambanova_gradio-0.1.6.tar.gz
  • Upload date:
  • Size: 3.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.0

File hashes

Hashes for sambanova_gradio-0.1.6.tar.gz
Algorithm Hash digest
SHA256 740e0307b5ba51c038cf1aebd71a56a499fa51e45682932dc579481129f25a8a
MD5 186c1a2eba93bee6ab5e2ee5f0c3658e
BLAKE2b-256 9c2a93a5d4fdcbdb2ed07841c7a0a0ab352c64b13e2d07ad69dab687ab90aad3

See more details on using hashes here.

File details

Details for the file sambanova_gradio-0.1.6-py3-none-any.whl.

File metadata

File hashes

Hashes for sambanova_gradio-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 dbc029fa02d54a19e9870aa3b61a59f1d38d24c1ba4606f237cf4bc2f9ddf8a0
MD5 73239b7396bef4e28846ab7c09940e8d
BLAKE2b-256 8b7b99266914148e480c7afb07829018f41a40e7f3103a30f4e0b1d5a39b9a38

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page