Simple yet effective command line client for chatting with big LLMs using the llm_lite wrapper or ollama models

Project description

ChatLLM CLI

Gif

Overview

This project provides a command-line interface (CLI) for interacting with various large language models (LLMs) using the LiteLLM wrapper. It supports multiple providers, including OpenAI, Anthropic, Azure, and Gemini. The CLI allows users to chat with these models, manage budgets, and handle API keys efficiently.

Inspired by simple one script app by Marcolardera

Big Refactor

The project has undergone a significant refactor to improve modularity and maintainability. The codebase is now split into multiple modules:

llm_api: Handles interactions with the LLMs.
prompt: Manages user prompts and input handling.
logs: Manages logging and console output.
config: Handles configuration, API keys, and budget management.

Configuration

The configuration is managed through a config.yaml file. Below is an example configuration:

provider: "anthropic"
model: "claude-3-sonnet-20240229"
temperature: 0.7
markdown: true
easy_copy: true
non_interactive: false
json_mode: false
use_proxy: false
proxy: "socks5://127.0.0.1:2080"
storage_format: "markdown"
embedding_model: "text-embedding-ada-002"
embedding_dimension: 1536
max_context_tokens: 3500
show_spinner: true
max_tokens: 1024
budget_enabled: true
budget_amount: 10.0
budget_duration: "monthly"
budget_user: "default_user"

# API Keys (uncomment and fill in as needed)
# openai_api_key: "<INSERT YOUR OPENAI API KEY HERE>"
# anthropic_api_key: "<INSERT YOUR ANTHROPIC API KEY HERE>"
# azure_api_key: "<INSERT YOUR AZURE API KEY HERE>"
# gemini_api_key: "<INSERT YOUR GEMINI API KEY HERE>"

# Azure specific settings (uncomment if using Azure)
# azure_endpoint: "https://xxxx.openai.azure.com/"
# azure_api_version: "2023-07-01-preview"
# azure_deployment_name: "gpt-35-turbo"
# azure_deployment_name_eb: "text-embedding-ada-002"

Installation and Usage

Install the CLI:

pipx install git+https://github.com/tadeasf/chat_llm_cli.git@styling

Configure the CLI: Edit the config.yaml file to set your preferred provider, model, and other settings.
Run the CLI:
```
chat_llm_cli
```
Commands:
- /q: Quit the application.
- /c <n>: Copy the nth code block to the clipboard.
- /e: Open the last response in the editor.

Screenshots

Screenshot1 Screenshot2 Screenshot3 Screenshot4 Screenshot5

How It Works

Interacting with LLMs

The llm_api module is responsible for sending messages to the LLM with the given context and returning the response. It handles different providers and ensures all messages have valid roles. The module uses LiteLLM's completion function to interact with the models.

Key Functions and Flow

chat_with_context:
- This function sends a message to the LLM with the given context and returns the response.
- It takes in the configuration dictionary, a list of messages, a prompt session, proxy configuration, and a flag to show a spinner.
- It ensures all messages have valid roles and handles provider-specific requirements, such as prefix handling for Anthropic models.
- It uses LiteLLM's completion function to get the response from the LLM.
- The response is then processed by the handle_response function.

LiteLLM Integration

Completion Function: The completion function from LiteLLM is used to interact with the models. It takes in parameters such as the model, messages, and API key. For more details, refer to the LiteLLM documentation on completion.
Provider-Specific Parameters: The module handles provider-specific parameters, such as prefix handling for Anthropic models. For more details, refer to the LiteLLM documentation on provider-specific parameters.
Response Handling: The response from the LLM is processed to extract the content and usage statistics. For more details, refer to the LiteLLM documentation on response output.

Managing User Prompts

The prompt module manages user prompts and input handling. It provides features such as key bindings for quitting the application and opening the last response in an editor.

start_prompt:
- This function starts the prompt loop and handles user input.
- It takes in the prompt session, configuration dictionary, list of messages, token counts, and code blocks.
- It handles special commands like quitting the application, copying code blocks, and opening the last response in an editor.
add_markdown_system_message:
- This function adds a system message to instruct the model to use Markdown formatting.
get_usage_stats:
- This function retrieves usage statistics for all users, including current cost, model costs, and total budget.

Managing Expenses

The expenses module provides functions to display and calculate expenses based on token usage and pricing rates.

display_expense:
- This function displays the current cost, total budget, and remaining budget for the user.
- It uses the BudgetManager from the config module to get the current cost and total budget.
calculate_expense:
- This function calculates the expense based on the number of tokens and pricing rates.

Managing History

The history module provides functions to load and save conversation history.

load_history_data:
- This function loads history data from a file. It supports both JSON and Markdown formats.
save_history:
- This function saves the history data to a file. It supports both JSON and Markdown formats.
calculate_tokens_and_cost:
- This function calculates the number of tokens and cost for a conversation.

Configuration and Budget Management

The config module loads the configuration from the config.yaml file, initializes the budget manager, and provides utility functions for managing the configuration and budget. It uses LiteLLM's BudgetManager to handle budget constraints and track usage.

load_config:
- This function loads the configuration from the config file and ensures all necessary keys are present.
initialize_budget_manager:
- This function initializes the budget manager with the specified configuration.
check_budget:
- This function checks if the current cost is within the budget limit.

Main Application

The chatgpt.py file is the main entry point of the application. It initializes the CLI, handles user input, and coordinates interactions between different modules.

Initialization:
- The script initializes global variables, sets up logging, and configures the CLI using rich_click.
Model and Path Completion:
- The ModelCompleter and PathCompleter classes provide autocompletion for model names and file paths, respectively.
CLI Options:
- The script defines various CLI options using click, such as setting the model, temperature, max tokens, API key, and more.
Main Function:
- The main function handles the main logic of the application. It loads the configuration, validates the model, and starts the prompt session.
- It checks the budget before making API calls and updates the budget after receiving responses.
- It saves the conversation history and displays usage statistics.

I will eventually refactor the main function further.

Contributing

Please read CONTRIBUTING.md for guidelines on contributing to this project.

Logging

The logs module manages logging and console output. It uses the loguru library for logging. To enable verbose output for LiteLLM, set the environment variable os.environ["LITELLM_LOG"] = "DEBUG".

Project details

Release history Release notifications | RSS feed

1.6.0

Aug 17, 2024

1.5.0

Aug 17, 2024

1.4.0

Aug 17, 2024

1.0.0

Aug 17, 2024

0.10.11

Sep 29, 2024

0.10.10

Sep 29, 2024

This version

0.4.8

Aug 17, 2024

0.4.7

Aug 16, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chat_llm_cli-0.4.8.tar.gz (24.7 kB view details)

Uploaded Aug 17, 2024 Source

Built Distribution

chat_llm_cli-0.4.8-py3-none-any.whl (28.2 kB view details)

Uploaded Aug 17, 2024 Python 3

File details

Details for the file chat_llm_cli-0.4.8.tar.gz.

File metadata

Download URL: chat_llm_cli-0.4.8.tar.gz
Upload date: Aug 17, 2024
Size: 24.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for chat_llm_cli-0.4.8.tar.gz
Algorithm	Hash digest
SHA256	`5ba7dc010e4fe6b070ba335827ff65be9a14320b24eac24055254b173f0e06fa`
MD5	`b69977b7c634ac73e6fee624fb629cf3`
BLAKE2b-256	`f5659e833158d59f9b4799ee605bc22143bfc34e823af0bfd369f44acd142199`

See more details on using hashes here.

File details

Details for the file chat_llm_cli-0.4.8-py3-none-any.whl.

File metadata

Download URL: chat_llm_cli-0.4.8-py3-none-any.whl
Upload date: Aug 17, 2024
Size: 28.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for chat_llm_cli-0.4.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9e55e9b4a5277ceaa9ae8f4fccc1c450bd688825225599331b64c6ae52875542`
MD5	`c8f5ddc82df59f7fdfd29b27e4fdfc2a`
BLAKE2b-256	`8f98751784b7df9dd31d94b08d3a5f155b0e7bf43874f25a145c7ac9e3283d45`