John Luoyu Zhou

Hello! I'm a Computer Science Ph.D. student at UCLA, advised by Jonathan Kao. I received my B.S., also in Computer Science, from Columbia University, where I was advised by Liam Paninski in the Center for Theoretical Neuroscience.

CV  /  Twitter  /  GitHub  /  Google Scholar

profile photo

Current Interests

I am focused on developing reinforcement learning algorithms that are able to reason through complex, long-horizon tasks by recursively decomposing them into simpler subproblems. To this end, I work on hierarchical methods that operate over (sub)goal-conditioned policies, action chunks, and options.

I spent my undergraduate studies and the first part of my Ph.D. studying neuroscience, investigating how the brain allocates control between model-free and model-based decision-making strategies and developing neural decoding applications.

Publications

project image

Flattening Hierarchies with Policy Bootstrapping


John L. Zhou and Jonathan C. Kao
Neural Information Processing Systems (NeurIPS), 2025 [Spotlight, top 3.2%]
project page / paper / code

We present Subgoal Advantage-Weighted Policy Bootstrapping (SAW), an offline goal-conditioned RL algorithm that achieves state-of-the-art performance on complex, long-horizon tasks without needing hierarchical policies or generative subgoal models.

project image

Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents


John L. Zhou, Weizhe Hong, and Jonathan C. Kao
Neural Information Processing Systems (NeurIPS), 2024
paper / code

We propose a form of intrinsic motivation to reciprocate the influence of other agents’ actions on one’s own return, and show that this is sufficient to encourage mutual cooperation from other, self-interested agents in sequential social dilemmas.



Previous Research

project image

Neuroscience Cloud Analysis As a Service


Taiga Abe et al. (Co-author)
Neuron, 2022
paper / code

An open-source, drag-and-drop platform that uses cloud resources to run modern neuroscience data analyses.

project image

Partitioning variability in animal behavioral videos using semi-supervised variational autoencoders


Matthew R Whiteway et al. (Co-author)
PLOS Computational Biology, 2021
paper / code

A semi-supervised framework that combines the output of supervised pose estimation algorithms with unsupervised dimensionality reduction methods to produce interpretable, low-dimensional representations of behavioral videos.



Writing

project image

A Full Dive


John L. Zhou
Knowing Neurons, 2023
article

An article that takes a dive into the current state of brain-computer interfaces and open challenges in neural decoding and encoding. Inspired by my research as well as Sword Art Online, one of my favorite childhood anime series.


Template