TWISTED-RL: Hierarchical Skilled Agents for Knot-Tying without Human Demonstrations

Guy Freund¹, Tom Jurgenson², Matan Sudry² and Erez Karpas^2,3

¹Reichman University, ²Technion — Israel Institute of Technology, ³The University of Texas at Austin

Accepted to IEEE International Conference on Robotics and Automation (ICRA) 2026.

Official implementation of TWISTED-RL, a hierarchical framework for robotic knot-tying that combines a high-level topological planner with low-level reinforcement learning policies. The framework enables robots to learn complex knot-tying skills without human demonstrations.

🔧 Installation

This project uses Conda for environment management.

# Clone the repository
git clone https://github.com/guyfreund/twisted_rl.git
cd twisted_rl

# Create and activate conda environment
conda env create -f server_environment_39_18_04_2025.yml
conda activate twisted_rl

💡 Best Practices

Run all scripts with the root dir .../twisted_rl as the working directory.
It is recommended to run the problem set file to create all relevant problems:
```
python exploration/mdp/graph/problem_set.py
```

📈 Training

TWISTED-RL is trained hierarchically across levels of increasing complexity.

To train the TWISTED-RL-C variant, run the following:

Crossing Number 0 (G0)

python exploration/rl/cleanrl_scripts/sac_algorithm.py \
    --problem G0

Crossing Number 1 (G1):

python exploration/rl/cleanrl_scripts/sac_algorithm.py \
    --problem G1 \
    --replay_buffer_files_path PATH_TO_G0

Crossing Number 2 (G2):

python exploration/rl/cleanrl_scripts/sac_algorithm.py \
    --problem G2 \
    --replay_buffer_files_path PATH_TO_G1

⚙️ Configuration

Edit the following to adjust training configuration:

exploration/rl/config/sac.yml for training parameters.
exploration/rl/environment/exploration_env.yaml for environment parameters.

📊 Inference & Evaluation

To run evaluation of the full TWISTED-RL system:

python system_flow/evaluation_automation.py \
    --ablation C

⚙️ Configuration

Edit system_flow/config/twisted_evaluation.yml to adjust evaluation configuration.

LOW_LEVEL:
  RL:
    agents:
      G0: path/to/G0.pt
      G1: path/to/G1.pt
      G2: path/to/G2.pt

EVALUATION:
  states_type: 3-eval  # or easy, medium, hard, 4-eval

📁 Project Structure

twisted_rl/
├── exploration/
│   ├── goal_selector/               # Goal selection strategies
│   ├── initial_state_selector/      # Initial state selection strategies
│   ├── mdp/                         # MDP definitions
│   ├── preprocessing/               # Data processing
│   ├── reachable_configurations/    # Reachability graph
│   ├── rl/
│   │   ├── cleanrl_scripts/         # Training scripts
│   │   ├── config/                  # Training configs
│   │   ├── environment/             # Custom environments
│   │   ├── replay_buffer/           # Replay buffers
│   │   └── test_scripts/            # Evaluation scripts
│   └── utils/                       # Helper functions
├── mujoco_infra/                    # MuJoCo simulation wrappers and assets
├── system_flow/
│   ├── config/                      # System-level configs
│   ├── high_level_class/            # High-level planners
│   ├── low_level_class/             # Low-level planners
│   └── metrics/                     # Evaluation metrics metadata

📖 Citation

If you find TWISTED-RL useful in your research, please consider citing our work:

@inproceedings{freund2026twistedrl,
  title     = {TWISTED-RL: Hierarchical Skilled Agents for Knot-Tying without Human Demonstrations},
  author    = {Freund, Guy and Jurgenson, Tom and Sudry, Matan and Karpas, Erez},
  booktitle = {Proceedings of the IEEE International Conference on Robotics and Automation (ICRA)},
  year      = {2026}
  note      = {To appear}
}

⚖️ License

This repository is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
exploration		exploration
mujoco_infra		mujoco_infra
system_flow		system_flow
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
anytime_performance_analysis.png		anytime_performance_analysis.png
server_environment_39_12_02_2025.yml		server_environment_39_12_02_2025.yml
server_environment_39_18_04_2025.yml		server_environment_39_18_04_2025.yml
server_environment_43_12_02_2025.yml		server_environment_43_12_02_2025.yml
server_environment_43_18_04_2025.yml		server_environment_43_18_04_2025.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TWISTED-RL: Hierarchical Skilled Agents for Knot-Tying without Human Demonstrations

Guy Freund¹, Tom Jurgenson², Matan Sudry² and Erez Karpas^2,3

¹Reichman University, ²Technion — Israel Institute of Technology, ³The University of Texas at Austin

Accepted to IEEE International Conference on Robotics and Automation (ICRA) 2026.

🔧 Installation

💡 Best Practices

📈 Training

Crossing Number 0 (G0)

Crossing Number 1 (G1):

Crossing Number 2 (G2):

⚙️ Configuration

📊 Inference & Evaluation

⚙️ Configuration

📁 Project Structure

📖 Citation

⚖️ License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

guyfreund/twisted_rl

Folders and files

Latest commit

History

Repository files navigation

TWISTED-RL: Hierarchical Skilled Agents for Knot-Tying without Human Demonstrations

Guy Freund1, Tom Jurgenson2, Matan Sudry2 and Erez Karpas2,3

1Reichman University, 2Technion — Israel Institute of Technology, 3The University of Texas at Austin

Accepted to IEEE International Conference on Robotics and Automation (ICRA) 2026.

🔧 Installation

💡 Best Practices

📈 Training

Crossing Number 0 (G0)

Crossing Number 1 (G1):

Crossing Number 2 (G2):

⚙️ Configuration

📊 Inference & Evaluation

⚙️ Configuration

📁 Project Structure

📖 Citation

⚖️ License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Guy Freund¹, Tom Jurgenson², Matan Sudry² and Erez Karpas^2,3

¹Reichman University, ²Technion — Israel Institute of Technology, ³The University of Texas at Austin

Packages