LLM plugin for models hosted on Replicate

These details have not been verified by PyPI

Project links

Project description

llm-replicate

LLM plugin for models hosted on Replicate

Installation

First, install the LLM command-line utility.

Now install this plugin in the same environment as LLM.

llm install llm-replicate

Configuration

You will need an API key from Replicate. You can obtain one here.

You can set that as an environment variable called REPLICATE_API_TOKEN, or add it to the llm set of saved keys using:

llm keys set replicate

Enter key: <paste key here>

To fetch and save details of the default collection of language models hosted on Replicate, run this:

llm replicate fetch-models

To add specific models that aren't listed in that collection, use the llm replicate add command.

For the Llama 2 model from a16z-infra/llama13b-v2-chat run this:

llm replicate add a16z-infra/llama13b-v2-chat \
  --chat --alias llama2

The --chat flag indicates that this is a chat model, which means it will be able to work with -c continue mode.

Here's the 70b version of that model:

llm replicate add \
  replicate/llama70b-v2-chat \
  --chat --alias llama70b

Usage

To run a prompt against a model, pass its name or an alias to llm -m:

llm -m llama2 "Ten great names for a pet pelican"

Sure, here are ten great names for a pet pelican:

Pelty

Peanut

Puddles

Nibbles

Fuzzy

Gizmo

Hank

Luna

Scooter

Splishy

I hope these suggestions help you find the perfect name for your pet pelican! Do you have any other questions?

Chat models can support continuing conversations, for example:

llm -c "Five more and make them more nautical"

Ahoy matey! Here be five more nautical-themed names for yer pet pelican:

Captain Hook

Anchoryn

Seadog

Plunder

Pointe Pelican

I hope these suggestions help ye find the perfect name for yer feathered friend! Do ye have any other questions, matey?

Run llm models list to see the full list of models:

llm models list

You should see something like this:

Replicate: replicate-flan-t5-xl
Replicate: replicate-llama-7b
Replicate: replicate-gpt-j-6b
Replicate: replicate-dolly-v2-12b
Replicate: replicate-oasst-sft-1-pythia-12b
Replicate: replicate-stability-ai-stablelm-tuned-alpha-7b
Replicate: replicate-vicuna-13b
Replicate: replicate-replit-code-v1-3b
Replicate: replicate-replit-replit-code-v1-3b
Replicate: replicate-joehoover-falcon-40b-instruct (aliases: falcon)
Replicate (chat): replicate-a16z-infra-llama13b-v2-chat (aliases: llama2)

Then run a prompt through a specific model like this:

llm -m replicate-vicuna-13b "Five great names for a pet llama"

Registering extra models

To register additional models that are not included in the default Language models collection, find their ID on Replicate and use the llm replicate add command.

For example, to add the joehoover/falcon-40b-instruct model, run this:

llm replicate add joehoover/falcon-40b-instruct \
  --alias falcon

This adds the model with the alias falcon - you can have 0 or more aliases for a model.

Now you can run it like this:

llm -m replicate-joehoover-falcon-40b-instruct \
  "Three reasons to get a pet falcon"

Or using the alias like this:

llm -m falcon "Three reasons to get a pet falcon"

You can edit the list of models you have registered using the default $EDITOR like this:

llm replicate edit-models

If you register a model using the --chat option that model will be treated slightly differently. Prompts sent to the model will be formatted like this:

User: user input here
Assistant:

If you use -c conversation mode the prompt will include previous messages in the conversation, like this:

User: Ten great names for a pet pelican
Assistant: Sure, here are ten great names for a pet pelican:

1. Pelty
2. Peanut
3. Puddles
4. Nibbles
5. Fuzzy
6. Gizmo
7. Hank
8. Luna
9. Scooter
10. Splishy

I hope these suggestions help you find the perfect name for your pet pelican! Do you have any other questions?
User: Five more and make them more nautical
Assistant:

Fetching all Replicate predictions

Replicate logs all predictions made against models. You can fetch all of these predictions using the llm replicate fetch-predictions command:

llm replicate fetch-predictions

This will create or populate a table in your LLM logs.db database called replicate_predictions.

The data in this table will cover ALL Replicate models, not just language models that have been queried using this tool.

Running llm replicate fetch-predictions multiple times will only fetch predictions that have been created since the last time the command was run.

To browse the resulting data in Datasette, run this:

datasette "$(llm logs path)"

The schema for that table will look like this:

CREATE TABLE [replicate_predictions] (
   [id] TEXT PRIMARY KEY,
   [_model_guess] TEXT,
   [completed_at] TEXT,
   [created_at] TEXT,
   [error] TEXT,
   [input] TEXT,
   [logs] TEXT,
   [metrics] TEXT,
   [output] TEXT,
   [started_at] TEXT,
   [status] TEXT,
   [urls] TEXT,
   [version] TEXT,
   [webhook_completed] TEXT
)

This schema may change if the Replicate API adds new fields in the future.

Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

cd llm-replicate
python3 -m venv venv
source venv/bin/activate

Now install the dependencies and test dependencies:

pip install -e '.[test]'

To run the tests:

pytest

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.3.1

Apr 18, 2024

0.2

Jul 18, 2023

0.1

Jul 18, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_replicate-0.3.1.tar.gz (12.5 kB view details)

Uploaded Apr 18, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm_replicate-0.3.1-py3-none-any.whl (11.0 kB view details)

Uploaded Apr 18, 2024 Python 3

File details

Details for the file llm_replicate-0.3.1.tar.gz.

File metadata

Download URL: llm_replicate-0.3.1.tar.gz
Upload date: Apr 18, 2024
Size: 12.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.0.0 CPython/3.11.9

File hashes

Hashes for llm_replicate-0.3.1.tar.gz
Algorithm	Hash digest
SHA256	`8097f2b70c2670685cc00851a3551965035a0d8a9b67f728d99cfa8e06f04aab`
MD5	`d83ff6d0d73096ea63daf863e5beb274`
BLAKE2b-256	`470860318cb6e4d99aa24500f30bd82088ef30a30b9cf2cc438df48286dd0375`

See more details on using hashes here.

File details

Details for the file llm_replicate-0.3.1-py3-none-any.whl.

File metadata

Download URL: llm_replicate-0.3.1-py3-none-any.whl
Upload date: Apr 18, 2024
Size: 11.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.0.0 CPython/3.11.9

File hashes

Hashes for llm_replicate-0.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7ae240d7390f0f557b249f3d3889c7cd3015b4a6818747495ca48f31a8a4a177`
MD5	`c37be4e31f3672df5b6b85c2f6c3a52f`
BLAKE2b-256	`7db72bf20e72b57d463dd3b759773f1a5c675e229c7cb03e7737f9f2f560cfb1`

See more details on using hashes here.

llm-replicate 0.3.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

llm-replicate

Installation

Configuration

Usage

Registering extra models

Fetching all Replicate predictions

Development

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes