Skip to main content

An open stack to build, RL-post-train, and deploy frontier robot policies.

Project description

phylo

An open stack to build, RL-post-train, and deploy frontier robot policies — from pretraining to the real world.

phylo is the unified framework behind the WoRV flagship robot-foundation-model effort. It does not reimplement training or inference; it borrows mature components (HuggingFace Trainer, vLLM, LeRobot data, the vla-eval benchmark harness, starVLA model code) and unifies them behind one light surface.

Keystone: one serving stack is used identically for RL rollout, evaluation, and deployment — parity-tested, so what you train against is what you ship.

Architecture

A dual-system policy (PonderPounce lineage):

  • cortex — System-2 Foundation Cortex: a reusable, shared "brain" that reads context (human demonstration, memory, rules, current observation) and emits action-relevant cognition tokens.
  • reflex — System-1 execution layer: turns cognition tokens into a specific embodiment's actions. Small, swappable per robot. Adapted via SFT + Reflex RL (RLT-style actor-critic on the frozen cortex's cognition tokens).
  • platform — shared infra: the parity serving stack (rollout = eval = deploy), data/checkpoint registries, vla-eval integration.

Training recipe (stages)

[0] action-prior warm-up → [1] cortex pretrain (web-VLM co-training + cross-embodiment robot data) → [2] reflex SFT → [3] reflex RL (+ real-world online)

VLM-handling is a config dimension, not a fixed default — independent tune_vision / tune_llm / tune_projector flags + a Knowledge-Insulation (stop-gradient) option + per-stage freeze transitions; the default follows the imported model's published recipe.

Status

Early scaffold. The Reflex RL prototype (RLT on sim) is validated end-to-end on vla-eval (SimplerEnv × Qwen3-GR00T) — the server lives at vla-evaluation-harness/src/vla_eval/model_servers/starvla_rlt.py and will be folded into reflex/.

Borrows (heavy internals)

HuggingFace Trainer/Transformers · vLLM · LeRobot dataset format · vla-eval (benchmark harness + serving protocol) · starVLA (QwenGR00T model code).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

phylo-0.0.0.tar.gz (2.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

phylo-0.0.0-py3-none-any.whl (3.5 kB view details)

Uploaded Python 3

File details

Details for the file phylo-0.0.0.tar.gz.

File metadata

  • Download URL: phylo-0.0.0.tar.gz
  • Upload date:
  • Size: 2.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.24 {"installer":{"name":"uv","version":"0.11.24","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"22.04","id":"jammy","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for phylo-0.0.0.tar.gz
Algorithm Hash digest
SHA256 ade49178838a48b071148b285744354646c05ad3d11096ea664c7a756a75d1a3
MD5 894f2737e4d53ecab186820a64a52397
BLAKE2b-256 7699f8e2bad4f15254d2e8a456783825698503d8ea79a837218d8c9b5bdade8e

See more details on using hashes here.

File details

Details for the file phylo-0.0.0-py3-none-any.whl.

File metadata

  • Download URL: phylo-0.0.0-py3-none-any.whl
  • Upload date:
  • Size: 3.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.24 {"installer":{"name":"uv","version":"0.11.24","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"22.04","id":"jammy","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for phylo-0.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e7b8e11ca3be39a63a59af677d0480355445f8fe8fc30e482744879197fc9ca8
MD5 2efdf6f316a72cc426ca9b9e6e31f645
BLAKE2b-256 2e9e2e17788f05be9fa83f1dfb8faf63d347b9834b7e99dd7fd5b856cc355483

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page