Clean single-file implementation of offline RL algorithms in JAX
reinforcement-learning flax cql single-file jax awac iql offline-rl offline-reinforcement-learning d4rl decision-transformer td3bc
-
Updated
Nov 24, 2025 - Python