Mimic Video
Project description
Mimic Video (wip)
Implementation of Mimic-Video, Video-Action Models for Generalizable Robot Control Beyond VLAs
Appreciation
- Pranoy for submitting a pull request for proprioception masking
Install
$ pip install mimic-video
Usage
import torch
# video wrapper
# but will be agnostic to the model
from mimic_video.cosmos_predict import CosmosPredictWrapper
video_wrapper = CosmosPredictWrapper(
extract_layer = 1,
random_weights = True,
tiny = True
)
# mimic video
from mimic_video import MimicVideo
model = MimicVideo(512, video_wrapper)
# states
video = torch.rand(2, 3, 3, 32, 32)
joint_state = torch.randn(2, 32)
# action
actions = torch.randn(2, 32, 20)
# training
loss = model(
prompts = [
'put the package on the conveyer belt',
'pass the butter'
],
video = video,
actions = actions,
joint_state = joint_state
)
loss.backward()
# inference
actions = model.sample(
prompts = 'peel the orange',
video = video[:1],
joint_state = joint_state[:1]
)
assert actions.shape == (1, 32, 20)
Contributing
First make sure pytest and test dependencies are installed with
$ pip install '.[test]'
Then add your test to tests/test_mimic_video.py and run
$ pytest tests
That's it
Citations
@inproceedings{Pai2025mimicvideoVM,
title = {mimic-video: Video-Action Models for Generalizable Robot Control Beyond VLAs},
author = {Jonas Pai and Liam Achenbach and Victoriano Montesinos and Benedek Forrai and Oier Mees and Elvis Nava},
year = {2025},
url = {https://api.semanticscholar.org/CorpusID:283920528}
}
@misc{black2025trainingtimeactionconditioningefficient,
title = {Training-Time Action Conditioning for Efficient Real-Time Chunking},
author = {Kevin Black and Allen Z. Ren and Michael Equi and Sergey Levine},
year = {2025},
eprint = {2512.05964},
archivePrefix = {arXiv},
primaryClass = {cs.RO},
url = {https://arxiv.org/abs/2512.05964},
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
mimic_video-0.0.28.tar.gz
(780.5 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mimic_video-0.0.28.tar.gz.
File metadata
- Download URL: mimic_video-0.0.28.tar.gz
- Upload date:
- Size: 780.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.25
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7326cc581f544dcff7a0dec368f1ef328e61a4ece22802f68e6ceb90b6422e8a
|
|
| MD5 |
41f3aad29a4d702267bb688914a617e0
|
|
| BLAKE2b-256 |
49da87ce25f544e9e8f17003f046fe0e7c0d87dee3866dc45b5942e1c02b706e
|
File details
Details for the file mimic_video-0.0.28-py3-none-any.whl.
File metadata
- Download URL: mimic_video-0.0.28-py3-none-any.whl
- Upload date:
- Size: 12.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.25
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
52befa89706073a0745cc87336614fadb50e3ae53d0be4ed361e367a96f10441
|
|
| MD5 |
d9e36a47bf758246b492ce1b71d4b5a8
|
|
| BLAKE2b-256 |
8789c09a249149f4a77eb95b7e71ee2144bcfe2f45ac4da0ad284fe7482d0fc7
|