CS Master’s Student & AI/ML Research Assistant
-
Swiss Federal Institute of Technology in Lausanne (EPFL)
- Lausanne, Switzerland
- johnny1188.github.io
Highlights
- Pro
Pinned Loading
-
learning-to-optimize
learning-to-optimize PublicCode for our paper "Investigation into the Training Dynamics of Learned Optimizers" (AAAI 2024 & ICAART 2024).
Jupyter Notebook
-
rl-memory
rl-memory PublicCode for our paper "Reverse-Engineering Memory in DreamerV3: From Sparse Representations to Functional Circuits" (NeurIPS 2025, Spotlight at Mech Interp Workshop).
Python
-
fractional-learning-to-optimize
fractional-learning-to-optimize PublicCode for our paper "Enhancing Fractional Gradient Descent with Learned Optimizers".
-
in-context-learning
in-context-learning PublicPaper reproduction: "General-Purpose In-Context Learning by Meta-Learning Transformers".
-
deep-q-learning-driver
deep-q-learning-driver PublicImplementation of Deep reinforcement learning (Q-learning) in a self-made game environment of a highway driver.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

