Chat with your local models — Transformers, llama.cpp, MLX, vMLX (JANG), and Ollama

These details have not been verified by PyPI

Project links

Project description

froggy

A terminal-based chat tool for running local AI models. Supports HuggingFace Transformers (with LoRA adapters), GGUF models via llama.cpp, Apple MLX on Apple Silicon, JANG models via vMLX, and Ollama.

Features

Auto-discovery - Scans directories to find local models, LoRA adapters, and GGUF files
Model management - Download, list, inspect, and remove models from the command line
Hardware-matched recommendations - Get model suggestions that fit your GPU/CPU/RAM via llmfit
Persistent configuration - YAML-based settings for device, format, and host preferences
Ollama integration - Auto-detects models from a running Ollama server and merges them into the selection menu
Streaming chat - Real-time token streaming with rich markdown rendering, automatic thinking-block filtering, and end-of-turn detection
LoRA support - Automatically detects and applies LoRA adapters, downloading base models as needed
Apple MLX - Native acceleration on Apple Silicon Macs via mlx-lm, auto-detected when available
vMLX for JANG models - Auto-routes JANG/unsupported MLX checkpoints (like Gemma 4 JANG) through a local vMLX OpenAI-compatible server
GPU acceleration - Auto-detects CUDA and selects optimal dtype (bfloat16/float32)
In-session controls - Switch models, adjust temperature, set system prompts, and more without restarting
Tool use - LLM-driven function calling (read/write files, run shell commands, web search) with a 3-tier safety model and custom plugin support

Installation

One-line install (recommended)

curl -fsSL https://raw.githubusercontent.com/overtimepog/froggy/main/install.sh | bash

This auto-detects your platform (Apple Silicon → MLX, CUDA → GPU, otherwise CPU) and installs froggy with the right backends. Customize the install location with FROGGY_INSTALL_DIR:

FROGGY_INSTALL_DIR=~/my-froggy curl -fsSL https://raw.githubusercontent.com/overtimepog/froggy/main/install.sh | bash

Manual install

# Clone the repo
git clone https://github.com/overtimepog/froggy.git
cd froggy

# Install (CPU only)
pip install .

# Install with GPU/Transformers support
pip install ".[gpu]"

# Install with Apple MLX + vMLX support (Apple Silicon only)
pip install ".[mlx]"

# Install with tool-use support (includes duckduckgo_search)
pip install ".[tools]"

Usage

# Start an interactive chat session
froggy chat

# Point to a specific models directory
froggy chat --models-dir /path/to/models

# Force CPU inference
froggy chat --device cpu

# Download a model from HuggingFace
froggy download mlx-community/Llama-3-8B-4bit

# List downloaded models
froggy list

# Show model details
froggy info Llama-3-8B-4bit

# Remove a model
froggy remove Llama-3-8B-4bit

# Get hardware-matched model recommendations
froggy recommend

# View or change configuration
froggy config

On launch, froggy chat scans for models and presents a selection menu. Pick a model and start chatting.

JANG checkpoints with jang_config.json are labeled JANG/vMLX in the picker and are launched through a local vmlx serve subprocess automatically. For families that need parser hints (for example Gemma 4), froggy also adds the matching --tool-call-parser / --reasoning-parser flags for vMLX automatically.

Commands

`froggy chat`

Start an interactive chat session with a local model.

froggy chat                              # Auto-discover models and pick one
froggy chat --models-dir /path/to/models # Scan a specific directory
froggy chat --device cpu                 # Force CPU inference
froggy chat --tools-dir ./my_tools       # Load custom tool plugins

`froggy download`

Download a HuggingFace model to ~/.froggy/models/.

# Download by repo ID
froggy download mlx-community/Llama-3-8B-4bit

# Download by full HuggingFace URL
froggy download https://huggingface.co/TheBloke/Mistral-7B-GGUF

# Download a specific format
froggy download TheBloke/Mistral-7B --format gguf

# Interactively pick from available variants
froggy download TheBloke/Mistral-7B --pick

# Download a JANG model that needs vMLX at runtime
froggy download https://huggingface.co/dealignai/Gemma-4-31B-JANG_4M-CRACK --format safetensors

For large safetensors/JANG downloads, froggy disables Hugging Face XET during the snapshot step to avoid the common macOS hang on huge files.

Option	Description
`--format [auto\|mlx\|gguf\|safetensors]`	Model format to download (default: `auto`)
`--pick`	Interactively pick from available variants

`froggy list`

List downloaded models in ~/.froggy/models/.

froggy list          # Pretty-printed table
froggy list --json   # Machine-readable JSON output

Option	Description
`--json`	Output as JSON

`froggy remove`

Remove a downloaded model.

froggy remove Llama-3-8B-4bit       # Prompts for confirmation
froggy remove Llama-3-8B-4bit -y    # Skip confirmation

Option	Description
`-y, --yes`	Skip confirmation prompt

`froggy info`

Show detailed information about a model (format, size, parameters, etc.).

froggy info Llama-3-8B-4bit

`froggy recommend`

Recommend models that match your hardware (GPU, CPU, RAM) via llmfit.

froggy recommend                          # Default recommendations
froggy recommend --limit 5               # Show top 5
froggy recommend --use-case coding        # Filter by use case
froggy recommend --json                   # Machine-readable JSON

Option	Description
`--limit INTEGER`	Max number of recommendations
`--use-case TEXT`	Target use case (e.g. `coding`, `chat`)
`--json`	Output as JSON

`froggy config`

View or modify froggy configuration. Settings are stored in ~/.froggy/config.yaml.

# Show all current settings
froggy config

# Get a specific setting
froggy config get device

# Set a value
froggy config set device mps
froggy config set format gguf
froggy config set ollama_host http://localhost:11434

Available config keys:

Key	Description	Example values
`device`	Inference device	`cpu`, `cuda`, `mps`
`format`	Preferred model format	`auto`, `mlx`, `gguf`, `safetensors`
`ollama_host`	Ollama server URL	`http://localhost:11434`

Chat Commands

Command	Description
`/help`	Show available commands
`/model`	Switch to a different model
`/system [prompt]`	Set or show the system prompt
`/temp [value]`	Set or show temperature (0.0 - 2.0)
`/tokens [value]`	Set or show max output tokens
`/info`	Show current session settings
`/clear`	Clear conversation history
`/quit`	Exit froggy
`/tools`	List available tools and their active state
`/tools on\|off`	Enable or disable the tool system
`/tools add <name>`	Activate a specific tool for this session
`/tools remove <name>`	Deactivate a specific tool for this session
`/autorun`	Toggle auto-approve for tool calls (skip confirmation prompts)

Tool System

froggy includes a function-calling tool system that lets the model read files, run commands, and search the web. Tools use Hermes XML format for reliable detection across model families.

Built-in Tools

Tool	Description	Safety tier
`read_file`	Read a file's contents	Auto-approve
`write_file`	Write or create a file	Confirm
`edit_file`	Replace a string in a file	Confirm
`run_shell`	Execute a shell command	Confirm / Blocked
`web_search`	Search the web (requires `duckduckgo_search`)	Auto-approve
`python_eval`	Evaluate Python code	Confirm

Safety Model

Tools run under a 3-tier safety model:

Auto-approve — low-risk operations run without prompting (read_file, web_search, safe shell commands)
Confirm — medium/high-risk operations prompt for user approval before running
Blocked — destructive commands (rm, sudo, curl, etc.) are never executed

On macOS, shell commands are wrapped in sandbox-exec to prevent filesystem writes outside /tmp.

Custom Tool Plugins

Place .py files in a tools/ directory (or pass --tools-dir) to add custom tools. Each file should export either:

TOOL: a single ToolDef object
TOOLS: a list of ToolDef objects

# tools/my_tool.py
from froggy.tools import ToolDef, ToolParam

TOOL = ToolDef(
    name="list_todos",
    description="Return the current TODO list from todo.txt",
    params=[],
)

Files starting with _ are ignored. Broken files are skipped with a warning rather than crashing froggy.

Environment Variables

Variable	Default	Description
`FROGGY_HOME`	`~/.froggy`	Base directory for models, config, and data
`FROGGY_TOOLS`	`0`	Set to `1` to enable tools at startup
`FROGGY_AUTORUN`	`0`	Set to `1` to auto-approve all tool calls
`FROGGY_PROJECT_ROOT`	`""`	Project root used to locate the `tools/` plugin directory
`FROGGY_MAX_TOOL_ROUNDS`	`5`	Maximum tool-call/response rounds per user message

# Use a custom home directory
FROGGY_HOME=~/my-models froggy chat

# Enable tools and autorun in one command
FROGGY_TOOLS=1 FROGGY_AUTORUN=1 froggy chat

Supported Backends

Backend	Status	Formats
HuggingFace Transformers	Working	SafeTensors, PyTorch bins, LoRA adapters
Apple MLX	Working	SafeTensors (Apple Silicon only, requires `mlx-lm`)
llama.cpp	Working	GGUF (requires `llama-cli` on PATH)
Ollama	Working	Any model available on your Ollama server

MLX Setup (Apple Silicon)

On Apple Silicon Macs, froggy automatically uses MLX when mlx-lm is installed — no configuration needed. MLX models from HuggingFace (e.g., from mlx-community) work out of the box:

# Install MLX support
pip install ".[mlx]"

# Download an MLX model
froggy download mlx-community/Llama-3-8B-4bit

# Launch froggy — MLX backend is auto-selected
froggy chat

Ollama Setup

If you have Ollama running, froggy will automatically discover its models:

# Start Ollama (if not already running)
ollama serve

# Pull a model
ollama pull llama3

# Launch froggy — Ollama models appear automatically
froggy chat

Project Structure

froggy/
  __init__.py         # Package init
  __main__.py         # Entry point (python -m froggy)
  cli.py              # CLI interface and model selection
  config.py           # YAML config persistence (load/save/get/set)
  paths.py            # Path helpers (~/.froggy resolution, FROGGY_HOME)
  download.py         # HuggingFace model downloader
  models.py           # Model listing, removal, and info
  llmfit.py           # Hardware-matched model recommendations
  backends.py         # Inference backends (Transformers, MLX, llama.cpp, Ollama)
  discovery.py        # Local + Ollama model discovery and validation
  session.py          # Chat session, tool loop, and command handling
  tools.py            # Tool registry and core tool definitions
  tool_parser.py      # Streaming-aware tool-call parser (Hermes XML + JSON)
  tool_executor.py    # Tool executor with 3-tier safety model
  tool_selector.py    # Tool filtering helper
tests/
  test_backends.py          # Backend selection and loading tests
  test_cli_config.py        # Config CLI command tests
  test_cli_group.py         # CLI group structure tests
  test_cli_management.py    # Model management CLI tests (download/list/remove/info/recommend)
  test_commands.py          # Chat command parsing tests
  test_config.py            # Config module unit tests
  test_custom_tools.py      # Custom tool plugin loader tests
  test_discovery.py         # Model discovery tests
  test_download.py          # Download module tests
  test_llmfit.py            # Recommendation engine tests
  test_mlx.py               # MLX backend and platform detection tests
  test_models.py            # Model list/remove/info tests
  test_ollama.py            # Ollama backend and discovery tests
  test_paths.py             # Path helper tests
  test_streaming.py         # Thinking filter and stop-string tests
  test_tool_executor.py     # Tool executor tests
  test_tool_integration.py  # Tool loop integration tests
  test_tool_parser.py       # Tool parser tests
  test_tool_safety.py       # Tool safety model tests
  test_tools.py             # Tool definition tests
tools/
  (place custom tool plugins here)

Requirements

Python 3.11+
click and rich (installed automatically)
pyyaml (installed automatically — used for config persistence)
huggingface-hub (installed automatically — used for model downloads)
For GPU inference: PyTorch, Transformers, Accelerate, PEFT
For Apple Silicon: mlx, mlx-lm
For tool use with web search: duckduckgo_search (pip install ".[tools]")

Running Tests

pip install pytest
pytest

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

Apr 7, 2026

0.1.1

Apr 6, 2026

0.1.0

Apr 6, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

froggy-0.2.0.tar.gz (85.0 kB view details)

Uploaded Apr 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

froggy-0.2.0-py3-none-any.whl (53.8 kB view details)

Uploaded Apr 7, 2026 Python 3

File details

Details for the file froggy-0.2.0.tar.gz.

File metadata

Download URL: froggy-0.2.0.tar.gz
Upload date: Apr 7, 2026
Size: 85.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for froggy-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`d849d24cb4cce378058e890cd477829b30831810898b37546fa6ab767dad329a`
MD5	`092ffd3bbede98cb1d67fdcfb6914704`
BLAKE2b-256	`7fb0dc60d125deb3a9dba0d46d260ec6ec405c4be550a3786b81c050e5aa02b2`

See more details on using hashes here.

File details

Details for the file froggy-0.2.0-py3-none-any.whl.

File metadata

Download URL: froggy-0.2.0-py3-none-any.whl
Upload date: Apr 7, 2026
Size: 53.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for froggy-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`97f005ad5b27359f63ea2f67a3594bcbf0a85df3799d94b8cc7fe70e65f45d32`
MD5	`179631ded6b797ce90f225fc78ce2010`
BLAKE2b-256	`a3f14172c855556fb0e520dc63af9909d86670d18f8da7499eaa4ef1c9217ff3`

See more details on using hashes here.

froggy 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

froggy

Features

Installation

One-line install (recommended)

Manual install

Usage

Commands

froggy chat

froggy download

froggy list

froggy remove

froggy info

froggy recommend

froggy config

Chat Commands

Tool System

Built-in Tools

Safety Model

Custom Tool Plugins

Environment Variables

Supported Backends

MLX Setup (Apple Silicon)

Ollama Setup

Project Structure

Requirements

Running Tests

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`froggy chat`

`froggy download`

`froggy list`

`froggy remove`

`froggy info`

`froggy recommend`

`froggy config`