GapONet: Sim-to-Real Humanoid Robot Control

A reinforcement learning framework for training humanoid robot controllers using Isaac Lab, featuring DeepONet, Transformer, and MLP architectures for sim-to-real transfer.

Overview

GapONet implements a comprehensive training and evaluation framework for humanoid robot control with a focus on sim-to-real transfer. It supports multiple neural network architectures (DeepONet, Transformer, MLP) and provides environments for training and testing on various humanoid robot platforms.

Prerequisites

Isaac Sim 4.5.0+ (installed separately)
Python 3.10+
CUDA-capable GPU with appropriate drivers
Isaac Lab (see Isaac Lab documentation for installation)

Assets

Before installation, download the required assets:

Robot Assets: Download sim2real_assets and place the corresponding files in gaponet/source/sim2real_assets/sim2real_assets/.
Test Data: A test data sample is provided. Please refer to this template for the format of test and training data. Place the corresponding files in gaponet/source/sim2real/sim2real/motions/motion_amass/edited_27dof/.
Checkpoint: A checkpoint sample is provided. Please refer to this template for the format of test and training data. Place the corresponding files in gaponet/model/.

Installation

Use the setup script to automatically create the conda environment and install all dependencies:

# Clone the repository
git clone git@github.com:jiemingcui/gaponet.git
cd gaponet

# Run the setup script (creates 'gapo' environment by default)
./setup.sh

# Install IsaacSim
pip install "isaacsim[all,extscache]==4.5.0" --extra-index-url https://pypi.nvidia.com

# Install isaaclab
./isaaclab.sh --install

Usage

Training

Operator Environment

Train with DeepONet architecture on operator environment:

python scripts/rsl_rl/train.py --task Isaac-Humanoid-Operator-Delta-Action \
  --num_envs=4080 --max_iterations 100000 --experiment_name Sim2Real \
  --letter amass --run_name delta_action_mlp_payload --device cuda env.mode=train --headless

Evaluation/Playback

Evaluate a trained model:

python scripts/rsl_rl/play.py --task Isaac-Humanoid-Operator-Delta-Action  \
   --model ./model/model_17950.pt --num_envs 20 --headless

Export checkpoint to JIT format (for lightweight inference without Isaac Sim):

python scripts/rsl_rl/inference_jit.py \
    --export \
    --checkpoint ./model/model_17950.pt \
    --task Isaac-Humanoid-Operator-Delta-Action \
    --output ./model/policy.pt \
    --device cuda:0 \
    --num_envs 20

This script exports a trained checkpoint to JIT/TorchScript format. The exported model can be used for inference without requiring Isaac Sim. Note: This step requires Isaac Sim to be initialized for the export process.

Run lightweight inference and evaluation (no Isaac Sim required):

python scripts/rsl_rl/deploy.py \
    --model ./model/policy.pt \
    --test_data ./source/sim2real/sim2real/tasks/humanoid_operator/motions/motion_amass/edited_27dof/test.npz \

This script performs inference on test data and computes evaluation metrics:

Large Gap Ratio: Ratio of joint position errors >= 0.5 rad
Gap IQR: Interquartile range of joint position errors
Gap Range: Range (max - min) of joint position errors

Results are grouped by payload mass and displayed in tables. This script does not require Isaac Sim and can be run on any machine with PyTorch.

Adding a New Robot

To add support for a new robot, follow these steps:

Create a new task directory in source/sim2real/sim2real/tasks/:
- Create a new folder (e.g., humanoid_your_task/)
- Copy and modify files from humanoid_operator/ or humanoid_amass/ as reference
- Implement your environment class (e.g., your_robot_env.py)
- Create environment configuration (e.g., your_robot_env_cfg.py)
Register the task in source/sim2real/sim2real/tasks/humanoid_your_task/__init__.py:
- Use gym.register() to register your environment
- Reference existing registrations in humanoid_operator/__init__.py
Create robot configuration in source/sim2real_assets/sim2real_assets/robots/:
- Create a Python file (e.g., your_robot.py)
- Define robot configuration using ArticulationCfg
- Add joint and body name dictionaries
Add robot assets:
- Place URDF files in source/sim2real_assets/sim2real_assets/urdfs/
- Place USD files in source/sim2real_assets/sim2real_assets/usds/ (if using USD)
- Create versions with and without payloads if needed
Prepare motion data:
- Format your data according to the test data template
- Save as .npz files with required keys (see Motion Data section)
- Place in appropriate motion directory
Configure agent settings:
- Create or modify agent config in source/sim2real/sim2real/tasks/your_robot/agents/
- Choose appropriate method (DeepONet, Transformer, or MLP)
- Set network parameters based on your robot's DOF count
Start training:
- Use the registered task name in training commands
- Adjust num_envs and other hyperparameters as needed

Architecture

DeepONet Actor-Critic

The DeepONet architecture uses a branch-trunk network structure:

Branch Network: Processes sensor data at multiple resolutions
Trunk Network: Processes action targets and payload information
Fusion: Combines branch and trunk outputs for action prediction

Transformer Actor-Critic

Transformer-based architecture with:

Multi-head self-attention
Position-wise feed-forward networks
Separate actor and critic transformers

MLP Actor-Critic

Standard multi-layer perceptron with:

Configurable hidden dimensions
History buffer for temporal information
Action and value heads

Environments

HumanoidOperator

Operator environment for training with variable payloads and sensor configurations:

Supports multiple sensor positions
Handles wrist and hand payloads
Computes equivalent torques using Pinocchio
Sub-environment structure for efficient training

HumanoidAmass

AMASS motion tracking environment:

Loads motion data from AMASS dataset
Tracks reference motions
Supports history-based observations
Computes tracking rewards

Configuration

Environment Configuration

Key parameters in environment configs:

mode: "train" or "play"
num_envs: Number of parallel environments
episode_length_s: Episode length in seconds
max_payload_mass: Maximum payload mass for training
num_sensor_positions: Number of sensor configurations

Network Configuration

Network-specific parameters:

branch_input_dims: Input dimensions for branch networks
trunk_input_dim: Input dimension for trunk network
hidden_dims: Hidden layer dimensions
model_history_length: Length of history buffer

Motion Data

Motion data should be provided in NumPy .npz format with the following keys:

real_dof_positions: Joint positions
real_dof_velocities: Joint velocities
real_dof_positions_cmd: Target joint positions
real_dof_torques: Joint torques
joint_sequence: List of joint names for delta actions
payloads: Payload masses (optional)

Evaluation Metrics

The framework computes several metrics during evaluation:

MPJAE: Mean Per-Joint Angle Error (in degrees)
Large Gap Ratio: Ratio of gaps >= robot_threshold rad
Gap IQR: Interquartile range of gaps
Gap Range: Range of gaps
Upper Body Joint Area: Area under error curve
EEF Error: End-effector position error

Results are saved as CSV files and visualization plots.

Acknowledgments

Built on Isaac Lab
Uses rsl_rl for RL algorithms
Motion data from AMASS

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
apps		apps
scripts		scripts
source		source
tools		tools
.gitignore		.gitignore
LICENCE		LICENCE
LICENSE		LICENSE
LICENSE-mimic		LICENSE-mimic
README.md		README.md
SECURITY.md		SECURITY.md
environment.yml		environment.yml
isaaclab.bat		isaaclab.bat
isaaclab.sh		isaaclab.sh
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
setup.sh		setup.sh
sync_rsl_scripts.sh		sync_rsl_scripts.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GapONet: Sim-to-Real Humanoid Robot Control

Overview

Prerequisites

Assets

Installation

Usage

Training

Operator Environment

Evaluation/Playback

Adding a New Robot

Architecture

DeepONet Actor-Critic

Transformer Actor-Critic

MLP Actor-Critic

Environments

HumanoidOperator

HumanoidAmass

Configuration

Environment Configuration

Network Configuration

Motion Data

Evaluation Metrics

Acknowledgments

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GapONet: Sim-to-Real Humanoid Robot Control

Overview

Prerequisites

Assets

Installation

Usage

Training

Operator Environment

Evaluation/Playback

Adding a New Robot

Architecture

DeepONet Actor-Critic

Transformer Actor-Critic

MLP Actor-Critic

Environments

HumanoidOperator

HumanoidAmass

Configuration

Environment Configuration

Network Configuration

Motion Data

Evaluation Metrics

Acknowledgments

About

Resources

License

Licenses found

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages