rtx - Pytorch
Project description
RT-X
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
Here we implement both model architectures, RTX-1 and RTX-2
Appreciation
- Lucidrains
- Agorians
Install
pip install rtx-torch
Usage
- RTX1 Usage takes in text and videos
import torch
from rtx.rtx1 import RTX1
model = RTX1()
video = torch.randn(2, 3, 6, 224, 224)
instructions = ["bring me that apple sitting on the table", "please pass the butter"]
# compute the train logits
train_logits = model.train(video, instructions)
# set the model to evaluation mode
model.model.eval()
# compute the eval logits with a conditional scale of 3
eval_logits = model.run(video, instructions, cond_scale=3.0)
print(eval_logits.shape)
- RTX-2 takes in images and text and interleaves them to form multi-modal sentences:
import torch
from rtx import RTX2
# usage
img = torch.randn(1, 3, 256, 256)
text = torch.randint(0, 20000, (1, 1024))
model = RTX2()
output = model(img, text)
print(output)
License
MIT
Citations
Todo
- Integrate Efficient net with RT-1 and RT-2
- create training script for both models
- Provide a table of all the datasets
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
rtx_torch-0.0.5.tar.gz
(11.5 kB
view hashes)
Built Distribution
rtx_torch-0.0.5-py3-none-any.whl
(11.7 kB
view hashes)
Close
Hashes for rtx_torch-0.0.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | fbd6a08cd2c1518f014981a2864190649174ef01f0acd1cbf67d535c5f9ee3d7 |
|
MD5 | 3b4e86f26a70770705aa783d8675164f |
|
BLAKE2b-256 | 29410717b45ace3f1eb25f3d987446e10551d537caa7fa96222915deb5ad2ab6 |