machinestein

Arip Asadulaev machinestein

Research Scientist

Achievements

Zero-Shot-Off-Policy-Learning Zero-Shot-Off-Policy-Learning Public

Official Pytorch Implementation of "Zero-Shot Off-Policy Learning"

Jupyter Notebook 14
Deep-Improvement-Supervision Deep-Improvement-Supervision Public

Official PyTorch implementation of "Your Latent Reasoning Is Secretly Policy Improvement Operator"

Python 11
Y-Shaped-Generative-Flows Y-Shaped-Generative-Flows Public

Official Pytorch Implementation of "Y-Shaped Generative Flows"

Jupyter Notebook 8
Partial-Policy-Learning Partial-Policy-Learning Public

Official JAX implementation of "Rethinking Optimal Transport in Offline Reinforcement Learning" (NeurIPS 2024)

Python 5 1
General-Cost-Neural-Optimal-Transport General-Cost-Neural-Optimal-Transport Public

Official Pytorch implementation of "Neural Optimal Transport with General Cost Functionals" (ICLR 2024)

Jupyter Notebook 24 1