Self-evolving AI agents via LoRA — Just talk to your agent, it learns.

These details have not been verified by PyPI

Project links

Project description

🦎 EvoClaw

Just talk to your agent — it learns and EVOLVES.

EvoClaw turns live conversations into continuous training data — automatically.
Works with any OpenAI-compatible API. Uses free Groq for PRM scoring. Trains with Tinker cloud LoRA.

🔥 What is EvoClaw?

EvoClaw wraps your existing AI agent behind an OpenAI-compatible proxy. Every conversation is:

Scored by a PRM (Process Reward Model) via Groq
Skills extracted from high-quality responses and stored
Skills injected into future prompts (immediate improvement, no retraining needed)
Failed turns trigger automatic skill evolution via LLM
All turns feed Tinker LoRA training (GRPO or OPD)

After every batch_size samples, updated weights are saved to Tinker — no service interruption.

🚀 Quick Start

pip install evoclaw

evoclaw init   # enter your Groq + Tinker API keys
evoclaw start  # proxy starts on localhost:8080

Then point your existing OpenAI client at EvoClaw:

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:8080/v1",
    api_key="any-string",  # Not checked by proxy
)

# Just use it normally — EvoClaw learns in the background
response = client.chat.completions.create(
    model="llama-3.3-70b-versatile",
    messages=[{"role": "user", "content": "Explain impermanent loss"}]
)

That's it. Start chatting. EvoClaw learns automatically.

🤖 Key Features

Skill Injection

At every turn, the most relevant learned skills are injected into the system prompt.
Immediate behavior improvement — no waiting for retraining.

Skill Evolution

When the agent fails (low PRM score), EvoClaw uses an LLM to generate a new skill
that would have prevented the failure. Over time, the skill bank grows smarter.

Tinker LoRA Training

All conversations feed into online LoRA training via Tinker.
No GPU required. Updated weights are hot-swapped with no downtime.

Two Learning Modes

GRPO: Reinforcement learning from implicit conversation rewards
OPD: On-policy distillation from high-quality responses

Works with Any Provider

Unlike MetaClaw (OpenClaw + Kimi-2.5 only), EvoClaw works with:

Groq (free, recommended)
OpenAI
Anthropic
Any OpenAI-compatible endpoint

⚙️ Configuration

All settings in EvoClawConfig:

Field	Default	Description
`model_name`	`Qwen/Qwen3-4B`	Tinker base model
`lora_rank`	`32`	LoRA rank
`batch_size`	`32`	Samples before train step
`loss_fn`	`importance_sampling`	`grpo` / `opd` / `cross_entropy`
`use_prm`	`True`	PRM scoring
`prm_threshold`	`0.65`	Min score to learn from
`use_skills`	`True`	Skill injection
`enable_skill_evolution`	`True`	Auto-generate skills from failures
`proxy_port`	`8080`	Proxy listen port

💪 Skill Packs

Pre-built skills for common domains:

config = EvoClawConfig(
    skill_packs=["general", "coding", "crypto", "defi", "security", "agentic"]
)

🔄 Training Loop Example

python examples/run_conversation_rl.py           # GRPO mode
python examples/run_conversation_rl.py --mode opd  # OPD mode
python examples/run_conversation_rl.py --no-train  # Skill injection only

Train from your own conversation file:

evoclaw train --file conversations.jsonl
# Format: {"user": "...", "assistant": "..."}

📊 Monitor Progress

evoclaw status        # Skills + trainer status
evoclaw skills        # List all learned skills  
evoclaw skills --category crypto  # Filter by category

🏗️ Architecture

User/Agent
    │
    ▼
┌─────────────────────────────────┐
│  EvoClaw Proxy (localhost:8080) │
│  - Inject skills into prompt    │
│  - Forward to upstream API      │
│  - Score response async (Groq)  │
│  - Evolve skills on failure     │
│  - Feed samples to Tinker       │
└─────────────────────────────────┘
    │              │
    ▼              ▼
Groq API     Tinker LoRA
(responses)  (training)

📄 License

MIT

Acknowledgements

Built on top of MetaClaw, Tinker, and Groq.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

Mar 13, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

evoclaw-0.2.0.tar.gz (25.6 kB view details)

Uploaded Mar 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

evoclaw-0.2.0-py3-none-any.whl (28.0 kB view details)

Uploaded Mar 13, 2026 Python 3

File details

Details for the file evoclaw-0.2.0.tar.gz.

File metadata

Download URL: evoclaw-0.2.0.tar.gz
Upload date: Mar 13, 2026
Size: 25.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for evoclaw-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`efe4d21eea4d46c7313e3a8c18f73394cab02b29de557787c143a48a894063cc`
MD5	`052e6e57bfef70574cf8bf26766753e8`
BLAKE2b-256	`147d174acf000e802d9c4b593fb90b1caac8508a6f1311e26ef8fda94ad3a023`

See more details on using hashes here.

File details

Details for the file evoclaw-0.2.0-py3-none-any.whl.

File metadata

Download URL: evoclaw-0.2.0-py3-none-any.whl
Upload date: Mar 13, 2026
Size: 28.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for evoclaw-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4159875b5735cd60bcbc8194041dbab3ebfe9d8f26bd9a0762de00f45245280f`
MD5	`c05125fc0dfba5a4c555391c6391e32d`
BLAKE2b-256	`f2f90f7044ae877b3cbb7833ca33529732b6ece63811f5d7aa29b4bca4a74e11`

See more details on using hashes here.

evoclaw 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

🦎 EvoClaw

🔥 What is EvoClaw?

🚀 Quick Start

🤖 Key Features

Skill Injection

Skill Evolution

Tinker LoRA Training

Two Learning Modes

Works with Any Provider

⚙️ Configuration

💪 Skill Packs

🔄 Training Loop Example

📊 Monitor Progress

🏗️ Architecture

📄 License

Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes