Skip to main content

A fork of GraspGen for EDEN project

Project description

GraspGen: A Diffusion-based Framework for 6-DOF Grasping

⚠️ LICENSE WARNING: This project is a derivative of GraspGen and is licensed under the NVIDIA Source Code License. It is restricted to Non-Commercial Research and Evaluation purposes only. Commercial use is strictly prohibited.

Installation

pip install eden_grasp_gen

NOTE: When compiling pointnet2_ops, if you are facing issues such as finding CUDA runtime headers or missing C++ compiler, try to manually set the following before installing:

export CC=/usr/bin/g++ && export CXX=/usr/bin/g++ && export CUDAHOSTCXX=/usr/bin/g++ && export TORCH_CUDA_ARCH_LIST="8.6"

Download Checkpoints

The checkpoints can be downloaded from HuggingFace:

grasp_gen download

Inference Demos

We have added scripts for visualizing grasp predictions on real world point clouds using the models. The sample dataset is in the models repository in the sample_data folder. Please see the script args for use. For plotting just the topk grasps (used on the real robot, k=100 by default) pass in the --return_topk flag. To visualize for different grippers, modify the --gripper_config argument.

Prerequisites

  1. Dataset: Please download checkpoints first - this will be the <path_to_models_repo> below.
  2. MeshCat: All the examples below are visualized on MeshCat in a browser. You can start a MeshCat server in a new terminal (in any environment, install with pip install meshcat) with the following command: meshcat-server. You can also just run a dedicated docker container in the background bash docker/run_meshcat.sh. Navigate to the corresponding url on the browser (it should be printed when you start the server) - the results will be visualized here.
  3. Docker: The first argument is the path to where you have locally cloned the GraspGen repository (always required). Use --models flag for the models directory. These will be mounted at /code and /models paths inside the container respectively.
# For inference only
bash docker/run.sh <path_to_graspgen_code> --models <path_to_models_repo>

Predicting grasps for segmented object point clouds

cd /code/ && python scripts/demo_object_pc.py --sample_data_dir /models/sample_data/real_object_pc --gripper_config /models/checkpoints/graspgen_robotiq_2f_140.yml

Predicting grasps for object meshes

cd /code/ && python scripts/demo_object_mesh.py --mesh_file /models/sample_data/meshes/box.obj --mesh_scale 1.0 --gripper_config /models/checkpoints/graspgen_robotiq_2f_140.yml

[Advanced] Predicting grasps for objects from scene point clouds

cd /code/ && python scripts/demo_scene_pc.py --sample_data_dir /models/sample_data/real_scene_pc --gripper_config /models/checkpoints/graspgen_robotiq_2f_140.yml

[Advanced] Predicting grasps for objects from scene point clouds, with collision checking

If you would like to filter the inferred grasps based on collisions, use the --filter_collisions flag. This uses a simple point cloud based collision checker. On the real robot, we suggest using NVBlox. The grasp is in collision if it is red, and green if collision free. See scripts/demo_collision_free_grasps.py if you want to pass in your own scene using the depth and segmentation image as commandline arguments.

cd /code/ && python scripts/demo_scene_pc.py --filter_collisions --sample_data_dir /models/sample_data/real_scene_pc --gripper_config /models/checkpoints/graspgen_franka_panda.yml

Note: At the time of release of this repo, the suction checkpoint was not trained with on-generator training, hence may not output the best grasp scores.

Dataset

There are two datasets to download:

  1. Grasp Dataset: This can be cloned from HuggingFace. Where you clone this would be <path_to_grasp_dataset>.
git clone https://huggingface.co/datasets/nvidia/PhysicalAI-Robotics-GraspGen
  1. Object Dataset: We have included a script for downloading the object dataset below. We recommend running this inside the docker container (the simplify arg would not work otherwise). You'll have to specify the directory to save the dataset <path_to_object_dataset>. We have only tested the training with simplified meshes (hence the --simplify arg), which was crucial to increasing rendering and simulation speed. This script may take a few hours to complete downloading and will take up a lot of CPU. If you are running inside a docker container, you'll need to mount a location to save this data.

First start the docker:

# For Dataset download only
mkdir -p <object_dataset>
bash docker/run.sh <path_to_graspgen_code> --grasp_dataset <path_to_grasp_dataset> --object_dataset <path_to_object_dataset>
cd /code && python scripts/download_objects.py --uuid_list /grasp_dataset/splits/franka_panda/ --output_dir /object_dataset --simplify

In total, we release over 57 million grasps, computed for a subset of 8515 objects from the Objaverse XL (LVIS) dataset. These grasps are specific to three grippers: Franka Panda, the Robotiq-2f-140 industrial gripper, and a single-contact suction gripper (30mm radius).

Training with Existing Datasets

This section covers training on existing pre-generated datasets for the three grippers. For a more detailed tutorial on generating your own dataset as well as training a model, please see TUTORIAL.md

Prerequisites

  1. Dataset: Please see the Dataset section and download the grasp and object datasets first.
  2. Path setup: You will need to find out the following paths for the next step
  • <path_to_graspgen_code>: Local path to where the graspgen repo was cloned
  • <path_to_grasp_dataset>: Local path to where grasp dataset was cloned.
  • <path_to_object_dataset>: Local path to where object dataset was downloaded.
  • <path_to_results>: Local path to where training logs and cache would be saved.
  1. Docker: Start the docker container with the correct paths:
# For training only.
mkdir -p <path_to_results>
bash docker/run.sh <path_to_graspgen_code> --grasp_dataset <path_to_grasp_dataset> --object_dataset <path_to_object_dataset> --results <path_to_results>

See the training scripts in runs/. For each gripper, there are models to train separately - the generator (diffusion model) as well as the discriminator.

# Example usage for training the generator
cd /code && bash runs/train_graspgen_robotiq_2f_140_gen.sh

# Example usage for training the discriminator
cd /code && bash runs/train_graspgen_robotiq_2f_140_dis.sh

Things to note regarding training:

  • The experiments in this paper were run with 8 X A100 machines. We have tested this on V100, A100, H100 and L40s
  • Dataset Caching: Before starting the actual training, the script builds a cache of the dataset and saves to a hdf5 .h5 file in the specified cache directory. Both caching and training is handled in the same train_graspgen.py script with same arguments. If a cache does not exist or is incomplete, the script will build the cache and will automatically continue with training once caching is complete. If the cache already exists, the script will immediately start the training.
  • On-Generator training: On-Generator training is not released for the discriminator training (yet). It will be released when the data generation repo is released. This is needed for the best performance and scoring of the predicted grasps (see paper).

Important Training Arguments

  • NGPU: Set to the number of GPUs you have for training
  • LOG_DIR: Specifies where the tensorboard logs, checkpoints and console logs are saved to
  • NWORKERS: A rule of thumb is to set to a non-zero number, roughly Number of CPU of Cores / Number of GPUs
  • NUM_REDUNDANT_DATAPOINTS: parameter controls the redundancy (of camera viewpoints) in cache building. The higher the number, the better the domain randomization and sim2real transfer. Default is 7. There will be a OOM error if it is too high.
  • debug mode: To run the job on a single GPU job and 1 worker, you can set the arg train.debug=True

Monitoring Training and Estimates:

  • Generator: Grasp reconstruction error reconstruction/error_trans_l2 on the validation set should converge to a few cm; this run will take at least 3K epochs to converge; on a 8 X A100 node, it takes about 40 hrs for 3K epochs
  • Discriminator: Validation AP score should be > 0.8 and bce_topk loss should go down; this run will take at least 3K epochs to converge; on a 8 X A100 node, it takes about 90 hrs for 3K epochs

Training & Data Generation (for new objects and grippers)

Please see TUTORIAL.md for a detailed walkthrough of training a model from scratch, including grasp data generation. Currently, we include an example for suction grippers. See the GraspDataGen package for Isaac Lab based grasp data generation for pinch grippers.

GraspGen Conventions

Please see the following files for documentation on the formats we adopted:

See the GraspDataGen package for Isaac Lab based grasp data generation in the above format.

FAQ

How do I train for a new gripper?

Please let us know what gripper you are interested in this short survey.

For optimal performance on a new gripper, we recommend re-training the model with our specified training recipie. You will need following to achieve that:

Please see TUTORIAL.md for a detailed walkthrough of training a model from scratch, including grasp data generation. See GraspDataGen for data generation.

My gripper is very similar to one of the existing grippers. Could I re-target model for my gripper?

In most cases, we recommend re-training a new model specific to your gripper as the physics would have changed.

If your gripper is antipodal and has a similar stroke length (i.e. width) to one of the existing grippers (Franka/Robotiq), feel free to re-target the model. You may have to apply a offset along the z direction import trimesh.transformations as tra; new_grasp = grasp @ tra.translation_matrix([0,0,-Z_OFFSET]) to align the base link frames of both grippers.

If you are using a single-cup suction gripper, you could retarget our suction model trained for a 30mm suction seal. You could rescale the object point cloud/mesh input before inference import trimesh.transformations as tra; mat = tra.scale_matrix(r/0.030) where r is the radius of the suction cup for your gripper.

How do I finetune on new object dataset?

The graspgen model is meant to generalize zero-shot to unknown objects. If you would like to further finetune the model on a new object/grasp dataset combination or train on a larger dataset, you will need to 1) pass in the pretrained checkpoint in the train.checkpoint argument in the train script and 2) change the paths to the new grasp/object dataset. Please check the GRASP_DATASET_FORMAT.md convention.

Why is my train script hanging/getting killed without any errors?

Make sure your docker container has sufficient CPU, swap and GPU memory. Please post a github issue otherwise.

How do I run this on the robot?

You will need instance segmentation (e.g. SAM2) and motion planning (e.g. cuRobo) to run this model. More details can be found in the experiments section of the paper.

You did not include the gripper I have/want with your dataset!

Sorry we missed your gripper! Please consider completing this quick survey to describe your gripper. You can optionally leave a your URDF.

How do I report a bug or ask more detailed questions?

Please post a github issue and we will follow up! Or feel free to email us.

License

License Copyright © 2025, NVIDIA Corporation & affiliates. All rights reserved.

For business inquiries, please submit the form NVIDIA Research Licensing.

Citation

If you found this work to be useful, please considering citing:

@article{murali2025graspgen,
  title={GraspGen: A Diffusion-based Framework for 6-DOF Grasping with On-Generator Training},
  author={Murali, Adithyavairavan and Sundaralingam, Balakumar and Chao, Yu-Wei and Yamada, Jun and Yuan, Wentao and Carlson, Mark and Ramos, Fabio and Birchfield, Stan and Fox, Dieter and Eppner, Clemens},
  journal={arXiv preprint arXiv:2507.13097},
  url={https://arxiv.org/abs/2507.13097},
  year={2025},
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

eden_grasp_gen-0.1.5.tar.gz (5.0 MB view details)

Uploaded Source

File details

Details for the file eden_grasp_gen-0.1.5.tar.gz.

File metadata

  • Download URL: eden_grasp_gen-0.1.5.tar.gz
  • Upload date:
  • Size: 5.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.0

File hashes

Hashes for eden_grasp_gen-0.1.5.tar.gz
Algorithm Hash digest
SHA256 e6634dac12160efafdc92af96f8aa46065e97569e2f20a0c44dd75ab66db0743
MD5 d3d58fd9b8a5b9341043d42af7523e77
BLAKE2b-256 2e7c635ddff8f722c682061fac6b6875e95d78b31379ce887a0b19f67bcc13bc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page