Skip to main content

WANI — Local & Cloud AI inference. Slash commands, tools, zero bloat.

Project description

wfw-ai · WANI v3.1

Local and cloud AI inference. Slash commands. Zero bloat.

pip install wfw-ai
wani

Features

  • Local GGUF models via llama-cpp-python
  • Cloud: Groq (free) · OpenAI · Anthropic · Gemini · HuggingFace · Ollama
  • Auto hardware detection (Oppo A6x, Snapdragon, desktop)
  • KV cache quantization — 50% less RAM
  • Flash Attention + mmap
  • HuggingFace direct download with resume
  • Slash commands: /model /download /connect /system /edit /run /read /write /stats
  • Shell shortcut: //ls -la

Install

pip install wfw-ai                        # core (cloud only)
pip install "wfw-ai[local]"               # + local GGUF support
pip install "wfw-ai[all]"                 # everything

Usage

wani                                      # auto mode
wani --cloud groq --key gsk_...          # groq cloud
wani --model mymodel.gguf                # specific model
wani --download llama-3.2-3b             # download from HF
wani --list                              # list models
wani --prompt "Hello" --cloud groq       # single prompt

Slash Commands

Command Action
/model list & load local models
/download download from HuggingFace
/connect manage cloud connections
/system <text> set system prompt
/edit <file> open file in editor
/run <cmd> run shell command
/read <file> load file into context
/write <file> save response to file
/stats hardware + session stats
/clear clear history
//cmd shell shortcut

Environment Variables

export GROQ_API_KEY=gsk_...
export HF_TOKEN=hf_...
export ANTHROPIC_API_KEY=sk-ant-...
export GEMINI_API_KEY=...
export OPENAI_API_KEY=sk-...

Oppo A6x / 4GB Devices

Auto-detected. Optimal settings applied: 4 threads, Q8 KV cache, mmap on, Flash Attention on.

Recommended models:

  • wani --download llama-3.2-1b — 700MB
  • wani --download llama-3.2-3b — 1.8GB ← best for 4GB

by Zain Ali · MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wfw_ai-3.1.0.tar.gz (21.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

wfw_ai-3.1.0-py3-none-any.whl (24.1 kB view details)

Uploaded Python 3

File details

Details for the file wfw_ai-3.1.0.tar.gz.

File metadata

  • Download URL: wfw_ai-3.1.0.tar.gz
  • Upload date:
  • Size: 21.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.34.2

File hashes

Hashes for wfw_ai-3.1.0.tar.gz
Algorithm Hash digest
SHA256 85604054c810990119ee51b73fb92e55780a8f723212e6dfb77bf644256497a6
MD5 53aa2ab618576ddb6abbdde4fcbd42e1
BLAKE2b-256 5fe2f5472e3f618cb0891d512b523afae93798f9b03ab23eb74d09209cba5489

See more details on using hashes here.

File details

Details for the file wfw_ai-3.1.0-py3-none-any.whl.

File metadata

  • Download URL: wfw_ai-3.1.0-py3-none-any.whl
  • Upload date:
  • Size: 24.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.34.2

File hashes

Hashes for wfw_ai-3.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1edb45ac6300e32afdcceb18dd60f6074025190d5d9a89c9992209de06b55e1b
MD5 6f9bf15a1f7aa5866a0bb29562eb0cb5
BLAKE2b-256 d808f35ec019a84c400f5086a6bc7b1e930df971dcc6d9a899cf776f74703f95

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page