Modular VLA and Environment interfaces.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

VLAgents

VLAgents is a python library that allows to separate next action prediction from policy networks from action execution in simulated or real environments. It defines an interface for policies and for environments. The policies run independent in their own virtual environment, potentially on a different computer, and can be queried for an action (in principle similar to the chatgpt api).

Why is this useful?

Separation of dependencies by using two different python environments: Some times dependencies contradict e.g. pytorch and jax
Some robot hardware requires a real time linux kernel which does not easily allow you to use an Nvidia GPU.
Separate deployment and model code

This library is a byproduct of the Refined Policy Distillation (RPD) paper which distilled VLAs into expert policies using Reinforcement Learning. The work also includes a section on related engineering challenges regarding jax and pytorch.

Installation

Pip Installation (Recommended)

pip install vlagents

Local Installation

git clone https://https://github.com/RobotControlStack/vlagents.git
cd vlagents
pip install -ve .

Environment and Policy Installation

On top of vlagents you can then install a simulation environment where the agent acts. We currently the following environments:

In order to avoid dependency conflicts, use a second conda/pip environment to install your policy. We currently support the following policies:

LeRobot

pip install 'lerobot[all]'

Octo

To use Octo as an agent/policy you need to create a new conda environment:

conda create -n octo python=3.10
conda activate octo
conda install nvidia/label/cuda-11.8.0::cuda --no-channel-priority
conda install conda-forge::cudnn=8.9
# octo dependencies
pip install git+https://github.com/octo-models/octo.git@241fb3514b7c40957a86d869fecb7c7fc353f540
pip install -r vlagents/utils/fixed_octo_requirements.txt
# for gpu support:
pip install --upgrade "jax[cuda11_pip]==0.4.20" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html

Verify that the jax installation was successful and that jax finds your gpu. Open a python shell in the same conda env and type

from jax.lib import xla_bridge
# this should output "gpu" if the gpu installation was successful
print(xla_bridge.get_backend().platform)

Install the vlagents library on top:

pip install git+https://github.com/juelg/vlagents.git

For more details, see the Octo github page.

Troubleshooting

If pip complains about dependency issues than it might have happened that torch somehow slipped in. Check if you have any torch packages installed by

pip freeze | grep torch
# if any, uninstall them e.g.
pip uninstall arm_pytorch_utilities
pip uninstall pytorch-seed
pip uninstall pytorch_kinematics

OpenVLA

To use OpenVLA, create a new conda environment:

conda create -n openvla python=3.10 -y
conda activate openvla
conda install pytorch torchvision torchaudio pytorch-cuda=12.4 -c pytorch -c nvidia -y

Install flash attention:

pip install packaging ninja
ninja --version; echo $?  # Verify Ninja --> should return exit code "0"
pip install "flash-attn==2.5.5" --no-build-isolation
# if you run into issues try `pip cache remove flash_attn` first

Install OpenVLA

pip install git+https://github.com/openvla/openvla.git@46b752f477cc5773cc1234b2e82c0e2130e4e890

Install the vlagents library on top:

pip install git+https://github.com/juelg/vlagents.git

For more details, see the OpenVLA github page.

OpenPi / Pi0

To use OpenPi, create a new conda environment:

conda create -n openpi python=3.11 -y
conda activate openpi

Clone the repo and install it.

git clone --recurse-submodules git@github.com:Physical-Intelligence/openpi.git
# Or if you already cloned the repo:
git submodule update --init --recursive
# install dependencies
GIT_LFS_SKIP_SMUDGE=1 uv sync
GIT_LFS_SKIP_SMUDGE=1 uv pip install -e .

For more details see openpi's github.

vjEPA2-ac

To use VJEPA2-AC, create a new conda environment:

conda create -n vjepa2 python=3.12
conda activate vjepa2

Clone the repo and install it.

git clone git@github.com:facebookresearch/vjepa2.git
cd vjepa2
pip install -e .

pip install git+https://github.com/juelg/vlagents.git
pip install -ve .

Diffusion Policy

Currently located on the branch diffusion_policy.

Usage

To start an vlagents server use the start-server command where kwargs is a dictionary of the constructor arguments of the policy you want to start e.g.

# lerobot act (n_action_steps is the executed horizon of the action chunk)
python -m vlagents start-server lerobot --port 8080 --host 0.0.0.0 --kwargs '{"policy_name": "act", "checkpoint_path": "<path to pretrained_model>", "n_action_steps": 1}'

# lerobot pi05
python -m vlagents start-server lerobot --port 20000 --host 0.0.0.0 --kwargs '{"policy_name": "pi05", "checkpoint_path": "<path to pretrained_model>", "n_action_steps": 1}'

# lerobot xvla
uv run python -m vlagents start-server lerobot --port 20000 --host 0.0.0.0 --kwargs '{"policy_name": "xvla", "checkpoint_path": "<path to pretrained_model>", "n_action_steps": 1, "rename_map": {"head": "image", "left_wrist": "image2", "right_wrist": "image3"}}'


# octo
python -m vlagents start-server octo --host localhost --port 8080 --kwargs '{"checkpoint_path": "hf://Juelg/octo-base-1.5-finetuned-maniskill", "checkpoint_step": None, "horizon": 1, "unnorm_key": []}'

# openvla
python -m vlagents start-server openvla --host localhost --port 8080 --kwargs '{"checkpoint_path": "Juelg/openvla-7b-finetuned-maniskill", "device": "cuda:0", "attn_implementation": "flash_attention_2", "unnorm_key": "maniskill_human:7.0.0", "checkpoint_step": 40000}'

# openpi
python -m vlagents start-server openpi --port=8080 --host=localhost --kwargs='{"checkpoint_path": "<path to checkpoint>/{checkpoint_step}", "model_name": "pi0_rcs", "checkpoint_step": <checkpoint_step>}' # leave "{checkpoint_step}" it will be replaced, "model_name" is the key for the training config

# vjepa2-ac
python -m vlagents start-server vjepa --port=20997 --host=0.0.0.0 --kwargs='{"cfg_path": "configs/inference/vjepa2-ac-vitg/<your_config>.yaml", "model_name": "vjepa2_ac_vit_giant", "default_checkpoint_path": "../.cache/torch/hub/checkpoints/vjepa2-ac-vitg.pt"}'

There is also the run-eval-during-training command to evaluate a model during training, so a single checkpoint. The run-eval-post-training command evaluates a range of checkpoints in parallel. In both cases environment and arguments as well as policy and arguments and wandb config for logging can be passed as CLI arguments.

Adding your own environment

from vlagents.evaluator_envs import EvaluatorEnv, Obs, Act
from typing import Any

class YourEnv(EvaluatorEnv):

    def translate_obs(self, obs: dict[str, Any]) -> Obs:
        # translated your observation
        return Obs()

    def step(self, action: Act) -> tuple[Obs, float, bool, bool, dict]:
        # step your env
        obs, reward, success, truncated, info = self.env.step(action)
        return self.translate_obs(obs), reward, success, truncated, info

    def reset(self, seed: int | None = None, options: dict[str, Any] | None = None) -> tuple[Obs, dict[str, Any]]:
        obs, info = self.env.reset()
        return self.translate_obs(obs), info

    @property
    def language_instruction(self) -> str:
        # return task instruction
        return "pick up the cube"

    @staticmethod
    def do_import():
        # do imports required by your env
        import libero

EvaluatorEnv.register("your-env-id", YourEnv)

Adding your own policy

from vlagents.policies import Agent, AGENTS
from vlagents.evaluator_envs import Obs, Act
from typing import Any
import numpy as np

class YourAgent(Agent):
    def initialize(self):
        # heavy initialization, e.g. loading models
        pass

    def act(self, obs: Obs) -> Act:
        # your forward pass
        return Act(action=np.zeros(7, dtype=np.float32), done=False, info={})

    def reset(self, obs: Obs, instruction: Any, **kwargs) -> dict[str, Any]:
        # reset model if it has state and return info dict
        return {}

    def close(self, *args, **kwargs):
        pass
AGENTS["your-agent-id"] = YourAgent

Contribution

New Policy

In order to extend the library with a new policy network, extend the Agent class in policies.py. It is important to only invoke policy specific imports in the class functions, as each policy can have its own dependencies.

New Environment

In order to extend the library with a new agent environment, extend the EvaluatorEnv class in evaluator_envs.py.

Developer Tools

Install the following dev dependencies:

pip install 'pip>=25.1'
pip install --group dev

The following dev tools are provided:

# format the code
make format

# lint the code
make lint

# run tests
make test

Citation

If you find the agent useful for your work, please consider citing the original works behind it:

@inproceedings{juelg2025refinedpolicydistillationvla,
    title={{Refined Policy Distillation}: {F}rom {VLA} Generalists to {RL} Experts}, 
    author={Tobias J{\"u}lg and Wolfram Burgard and Florian Walter},
    year={2025},
    booktitle={Proc.~of the IEEE/RSJ Int.~Conf.~on Intelligent Robots and Systems (IROS)}
}
@misc{juelg2026vlagentspolicyserverefficient,
      title={VLAgents: A Policy Server for Efficient VLA Inference}, 
      author={Tobias J{\"u}lg and Khaled Gamal and Nisarga Nilavadi and Pierre Krack and Seongjin Bien and Michael Krawez and Florian Walter and Wolfram Burgard},
      year={2026},
      howpublished={\url{https://arxiv.org/abs/2601.11250}}
}

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Jobi

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.2.0

Jun 20, 2026

0.1.0

Jan 16, 2026

0.0.1

Jan 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vlagents-0.2.0.tar.gz (37.4 kB view details)

Uploaded Jun 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vlagents-0.2.0-py3-none-any.whl (36.0 kB view details)

Uploaded Jun 20, 2026 Python 3

File details

Details for the file vlagents-0.2.0.tar.gz.

File metadata

Download URL: vlagents-0.2.0.tar.gz
Upload date: Jun 20, 2026
Size: 37.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vlagents-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`70fccaff809f587b6f950cfbed5a77eb713399c5771c759b0e7f107b04ee6f51`
MD5	`1fa7d7ee5bb5addf73516692dcbc876b`
BLAKE2b-256	`5505686d44503432ec70443f58941dae07bfe8f1dc6a63a79c5f200600d6b88e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vlagents-0.2.0.tar.gz:

Publisher: release.yaml on RobotControlStack/vlagents

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vlagents-0.2.0.tar.gz
- Subject digest: 70fccaff809f587b6f950cfbed5a77eb713399c5771c759b0e7f107b04ee6f51
- Sigstore transparency entry: 1879460474
- Sigstore integration time: Jun 20, 2026
Source repository:
- Permalink: RobotControlStack/vlagents@f66cb39db5d03cbe8775035501d9c7e32bae5af9
- Branch / Tag: refs/heads/master
- Owner: https://github.com/RobotControlStack
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yaml@f66cb39db5d03cbe8775035501d9c7e32bae5af9
- Trigger Event: workflow_dispatch

File details

Details for the file vlagents-0.2.0-py3-none-any.whl.

File metadata

Download URL: vlagents-0.2.0-py3-none-any.whl
Upload date: Jun 20, 2026
Size: 36.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vlagents-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`054c47f682c69cfa44c634ddc2499e6e560030481897d3b83f794c1caed1c0de`
MD5	`777844ed7d6a164dd2174402ea9d2c43`
BLAKE2b-256	`d7005f1cf0ef4016eece9cc22af792714bb9222d4ac4875955e1594a5d4394be`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vlagents-0.2.0-py3-none-any.whl:

Publisher: release.yaml on RobotControlStack/vlagents

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vlagents-0.2.0-py3-none-any.whl
- Subject digest: 054c47f682c69cfa44c634ddc2499e6e560030481897d3b83f794c1caed1c0de
- Sigstore transparency entry: 1879460517
- Sigstore integration time: Jun 20, 2026
Source repository:
- Permalink: RobotControlStack/vlagents@f66cb39db5d03cbe8775035501d9c7e32bae5af9
- Branch / Tag: refs/heads/master
- Owner: https://github.com/RobotControlStack
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yaml@f66cb39db5d03cbe8775035501d9c7e32bae5af9
- Trigger Event: workflow_dispatch

vlagents 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

VLAgents

Installation

Pip Installation (Recommended)

Local Installation

Environment and Policy Installation

LeRobot

Octo

Troubleshooting

OpenVLA

OpenPi / Pi0

vjEPA2-ac

Diffusion Policy

Usage

Adding your own environment

Adding your own policy

Contribution

New Policy

New Environment

Developer Tools

Citation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance