Skip to main content

An LLM CLI plugin for Cloudflare Workers AI models.

Project description

llm-cloudflare

A plugin for the llm CLI that allows you to use the text generation models (LLMs) running on globally on Cloudflare Workers AI, including models like Llama 3.1, Mistral 7B, Gemma and a number of task-specific fine tunes.

llm-cloudflare is useful for:

  • Using and building with LLMs that may not efficiently run on your local machine (limited GPU, memory, etc) vs. having Workers AI run it on a GPU near you.
  • Validating the performance of and/or comparing multiple models.
  • Experimenting without needing to download models ahead-of-time.

Usage

Prerequisite: You'll need the llm CLI installed first.

Install and setup the plugin:

# Install the plugin from pip
llm install llm-cloudflare

# Provide a valid Workers AI token
# Docs: https://developers.cloudflare.com/workers-ai/get-started/rest-api/#1-get-api-token-and-account-id
llm keys set cloudflare

# Set your Cloudflare account ID
# Docs: https://developers.cloudflare.com/workers-ai/get-started/rest-api/#1-get-api-token-and-account-id
export CLOUDFLARE_ACCOUNT_ID="33charlonghexstringhere"

Use it by specifying a Workers AI model:

llm -m "@cf/meta/llama-3.1-8b-instruct" "Write a Cloudflare Worker in ESM format that returns an empty JSON object as a response. Show only the code."

You can set a Workers AI model as the default model in llm:

# Set Llama 3.1 8B as the default
llm models default "@cf/meta/llama-3.1-8b-instruct" 
# See what model is set as the default
llm models default
# @cf/meta/llama-3.1-8b-instruct

Available models

This plugin provides access to the text generation models (LLMs) provided by Workers AI.

To see what models are available, invoke llm models. Models prefixed with Cloudflare Workers AI are provided by this plugin.

The supported models are generated by scripts. New models thus rely on this plugin being updated periodically.

In the future, this plugin may also add support for Workers AI's embedding models for use with llm embed.

Credits

Credit to @hex for https://github.com/hex/llm-perplexity, which heavily inspired the design of this plugin.

License

Copyright Cloudflare, Inc (2024). Apache-2.0 licensed. See the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_cloudflare-0.5.6.tar.gz (8.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llm_cloudflare-0.5.6-py3-none-any.whl (8.8 kB view details)

Uploaded Python 3

File details

Details for the file llm_cloudflare-0.5.6.tar.gz.

File metadata

  • Download URL: llm_cloudflare-0.5.6.tar.gz
  • Upload date:
  • Size: 8.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for llm_cloudflare-0.5.6.tar.gz
Algorithm Hash digest
SHA256 f60299962418ef5121e7a5d05b1a83a48c5f677e18090a5840d2c237f4dcccdd
MD5 9c80d0d32bc391a51a5c6ec9c510ca64
BLAKE2b-256 a00041d48a86ffdcd5f5891aab7e00d8fd9621f39d7b219b60d8c81f09dfcde8

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_cloudflare-0.5.6.tar.gz:

Publisher: workflow.yml on elithrar/llm-cloudflare

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file llm_cloudflare-0.5.6-py3-none-any.whl.

File metadata

  • Download URL: llm_cloudflare-0.5.6-py3-none-any.whl
  • Upload date:
  • Size: 8.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for llm_cloudflare-0.5.6-py3-none-any.whl
Algorithm Hash digest
SHA256 d856430d189535e9a0adc88e4ad16ee90c7683fa78aa9d4875da2db6094a943f
MD5 5b0a6498635316b4361bf2186315d856
BLAKE2b-256 b4ae445206a1f3731f2f81b7a5ac06c0c60855bd40ab4e342e20a1b4830b33f8

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_cloudflare-0.5.6-py3-none-any.whl:

Publisher: workflow.yml on elithrar/llm-cloudflare

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page