WANI — Local & Cloud AI inference. Slash commands, tools, zero bloat.
Project description
wfw-ai · WANI v3.1
Local and cloud AI inference. Slash commands. Zero bloat.
pip install wfw-ai
wani
Features
- Local GGUF models via llama-cpp-python
- Cloud: Groq (free) · OpenAI · Anthropic · Gemini · HuggingFace · Ollama
- Auto hardware detection (Oppo A6x, Snapdragon, desktop)
- KV cache quantization — 50% less RAM
- Flash Attention + mmap
- HuggingFace direct download with resume
- Slash commands:
/model /download /connect /system /edit /run /read /write /stats - Shell shortcut:
//ls -la
Install
pip install wfw-ai # core (cloud only)
pip install "wfw-ai[local]" # + local GGUF support
pip install "wfw-ai[all]" # everything
Usage
wani # auto mode
wani --cloud groq --key gsk_... # groq cloud
wani --model mymodel.gguf # specific model
wani --download llama-3.2-3b # download from HF
wani --list # list models
wani --prompt "Hello" --cloud groq # single prompt
Slash Commands
| Command | Action |
|---|---|
/model |
list & load local models |
/download |
download from HuggingFace |
/connect |
manage cloud connections |
/system <text> |
set system prompt |
/edit <file> |
open file in editor |
/run <cmd> |
run shell command |
/read <file> |
load file into context |
/write <file> |
save response to file |
/stats |
hardware + session stats |
/clear |
clear history |
//cmd |
shell shortcut |
Environment Variables
export GROQ_API_KEY=gsk_...
export HF_TOKEN=hf_...
export ANTHROPIC_API_KEY=sk-ant-...
export GEMINI_API_KEY=...
export OPENAI_API_KEY=sk-...
Oppo A6x / 4GB Devices
Auto-detected. Optimal settings applied: 4 threads, Q8 KV cache, mmap on, Flash Attention on.
Recommended models:
wani --download llama-3.2-1b— 700MBwani --download llama-3.2-3b— 1.8GB ← best for 4GB
by Zain Ali · MIT License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
wfw_ai-3.1.0.tar.gz
(21.2 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
wfw_ai-3.1.0-py3-none-any.whl
(24.1 kB
view details)
File details
Details for the file wfw_ai-3.1.0.tar.gz.
File metadata
- Download URL: wfw_ai-3.1.0.tar.gz
- Upload date:
- Size: 21.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: python-requests/2.34.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
85604054c810990119ee51b73fb92e55780a8f723212e6dfb77bf644256497a6
|
|
| MD5 |
53aa2ab618576ddb6abbdde4fcbd42e1
|
|
| BLAKE2b-256 |
5fe2f5472e3f618cb0891d512b523afae93798f9b03ab23eb74d09209cba5489
|
File details
Details for the file wfw_ai-3.1.0-py3-none-any.whl.
File metadata
- Download URL: wfw_ai-3.1.0-py3-none-any.whl
- Upload date:
- Size: 24.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: python-requests/2.34.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1edb45ac6300e32afdcceb18dd60f6074025190d5d9a89c9992209de06b55e1b
|
|
| MD5 |
6f9bf15a1f7aa5866a0bb29562eb0cb5
|
|
| BLAKE2b-256 |
d808f35ec019a84c400f5086a6bc7b1e930df971dcc6d9a899cf776f74703f95
|