Skip to main content

WANI — Wave-Attractor Neural Inference Framework

Project description

WANI — Wave-Attractor Neural Inference

Powered by Zain Ali

Run any GGUF model locally. Drop model in folder, run one command.


Folder Structure

WANI/
├── wani.py          ← Main tool (run this)
├── requirements.txt
├── models/          ← Put your .gguf files HERE
│   └── (empty)
├── logs/
│   └── wani.log     ← Auto-created
└── core/
    ├── config.py
    ├── scanner.py
    ├── loader.py
    └── __init__.py

Setup (3 steps)

Step 1 — Install Python 3.8+

python --version

Step 2 — Install backend

# CPU only (works on all systems including ARM/Android)
pip install llama-cpp-python

# OR with GPU support (Vulkan — works on Adreno 610/650)
CMAKE_ARGS="-DLLAMA_VULKAN=on" pip install llama-cpp-python --force-reinstall

Step 3 — Add your model

Download any GGUF model from:
  https://huggingface.co/models?search=gguf

Put the .gguf file inside:  WANI/models/

Example models to try:
  • Llama-3.2-1B-Instruct-Q4_K_M.gguf    (~700 MB)
  • Llama-3.2-3B-Instruct-Q4_K_M.gguf    (~1.8 GB)
  • Mistral-7B-Instruct-v0.3.Q4_K_M.gguf (~4.1 GB)
  • Phi-3-mini-4k-instruct-Q4_K_M.gguf   (~2.2 GB)

Usage

# Interactive mode (auto-scan + pick model)
python wani.py

# List all models
python wani.py --list

# Load specific model, start chat
python wani.py --model Llama-3.2-1B-Instruct-Q4_K_M.gguf --chat

# Single prompt
python wani.py --model yourmodel.gguf --prompt "Pakistan ki capital kya hai?"

# With custom settings
python wani.py --model yourmodel.gguf --threads 4 --ctx 4096 --temp 0.8

Options

Flag Default Description
--model FILE auto-pick Model filename from models/
--prompt TEXT Single prompt, then exit
--chat Force chat mode
--ctx N 2048 Context window (tokens)
--threads N 4 CPU threads
--gpu N 0 GPU offload layers (0=CPU)
--temp F 0.7 Temperature (creativity)
--max N 512 Max tokens to generate
--list List models and exit
--install Show install instructions

Recommended Models by RAM

RAM Model Download
1 GB Llama-3.2-1B Q4 HuggingFace
2 GB Phi-3-mini Q4 HuggingFace
3 GB Llama-3.2-3B Q4 HuggingFace
4 GB Mistral-7B Q4 HuggingFace
8 GB LLaMA-13B Q4 HuggingFace

Snapdragon 685 Tips

# Use 4 big cores only
python wani.py --threads 4

# Small context saves RAM
python wani.py --ctx 1024

# 1B or 3B models run best
# Use Q4_K_M quantization (best quality/size ratio)

Chat Commands

While in chat mode:

  • stats — show session stats (TPS, tokens generated)
  • clear — reset conversation history
  • help — show commands
  • quit — exit

WANI Framework v1.0.0 — Powered by Zain Ali

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wfw_ai-1.0.0.tar.gz (11.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

wfw_ai-1.0.0-py3-none-any.whl (14.7 kB view details)

Uploaded Python 3

File details

Details for the file wfw_ai-1.0.0.tar.gz.

File metadata

  • Download URL: wfw_ai-1.0.0.tar.gz
  • Upload date:
  • Size: 11.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.34.2

File hashes

Hashes for wfw_ai-1.0.0.tar.gz
Algorithm Hash digest
SHA256 43300f70ac52a9cb26b10c6c3239c71e620f3d04568c24eaf510ae0f6b26ca09
MD5 8a5abdaf4b8750418e933f2f7a467820
BLAKE2b-256 3f4e847aaba51998383cf44e56dfc1e423e7f8c0a6bebf9e0297da0a78e5e645

See more details on using hashes here.

File details

Details for the file wfw_ai-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: wfw_ai-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 14.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.34.2

File hashes

Hashes for wfw_ai-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 fa07c6ba60179da8287b7fc477bd253c6105b7658558930480b8fa51f17c4b20
MD5 a6c1168b7d5dc018fdb0f002fc252975
BLAKE2b-256 ea4b63a35d08fbfc5621b942ee59b0e5996467d1694da76a21906bb16329367a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page