A Computational Model of the Hippocampus Supports Exploratory Behaviour in Reinforcement Learning Agents

Sabrina Du; Adel Halawa; Aleksei Efremov; Adrien Peyrache; Daniel Levenstein; Blake Richards

doi:10.26443/msurj.v21i2.437

Vol. 21 No. 2 (2026), OSE Undergraduate Science Showcase Proceedings

Vol. 21 No. 2 (2026)

A Computational Model of the Hippocampus Supports Exploratory Behaviour in Reinforcement Learning Agents

OSE Undergraduate Science Showcase Proceedings

https://doi.org/10.26443/msurj.v21i2.437

Published 2026-04-10

Sabrina Du⁺⁻
Adel Halawa⁺⁻
Aleksei Efremov⁺⁻
Adrien Peyrache⁺⁻
Daniel Levenstein⁺⁻
Blake Richards⁺⁻

Sabrina Du

Montreal Neurological Institute, Department of Neurology and Neurosurgery, McGill University, Montreal, QC, Canada

Adel Halawa

Montreal Neurological Institute, Department of Neurology and Neurosurgery, McGill University, Montreal, QC, Canada

Aleksei Efremov

Montreal Neurological Institute, Department of Neurology and Neurosurgery, McGill University, Montreal, QC, Canada

Adrien Peyrache

Montreal Neurological Institute, Department of Neurology and Neurosurgery, McGill University, Montreal, QC, Canada

Daniel Levenstein

Department of Neuroscience, Yale School of Medicine, New Haven, Connecticut, United States of America

Blake Richards

Montreal Neurological Institute, Department of Neurology and Neurosurgery, McGill University, Montreal, QC, Canada

PDF

Keywords

Hippocampus
Cognitive maps
Intrinsic reinforcement learning
Curiosity-driven exploration

How to Cite

Du, S., Halawa, A., Efremov, A., Peyrache, A., Levenstein, D., & Richards, B. (2026). A Computational Model of the Hippocampus Supports Exploratory Behaviour in Reinforcement Learning Agents. McGill Science Undergraduate Research Journal, 21(2). https://doi.org/10.26443/msurj.v21i2.437

Abstract

The hippocampus supports spatial navigation, memory, and planning through the formation of a cognitive map: a structured environmental representation reflected in its neural activity. These neural dynamics can be modeled computationally using recurrent neural networks (RNNs). However, these RNNs are typically trained via reinforcement learning (RL) using external rewards, failing to capture the intrinsic drive of freely exploring organisms in the absence of external rewards. Instead, reward-free RL models, which rely on internal environmental representations, are better candidates to study novelty-seeking and exploratory behaviour. This study aims to investigate whether an RNN exhibiting hippocampal-like activity builds spatial representations sufficient to support exploratory behavior in reward-free RL agents. We leveraged an existing RNN trained for sensory sequence prediction, which exhibits hippocampal-like activity patterns, and used its prediction error as the intrinsic reward to train an Actor-Critic agent. We used a Novel Object Recognition task to quantify its preference for novel stimuli. The RL agent occupied the region of interest (defined as a 3-unit radius around the novel object) significantly more often than a random agent (29.3% ± 3.4% vs. 11.3% ± 1.1% [mean ± SEM], Welch's t-test, p < 0.01), across multiple episodes and novel object locations. The RL agent’s performance was also measured in a multi-room environment. It spent 8.2% ± 0.2% of its time in the most distant and novel room, a significant increase over the 0.023% ± 0.017% of the random control (p < 0.05). This work demonstrates that hippocampal-like representations can support exploratory behavior, and can help us investigate how cognitive maps guide exploration and navigation.

https://doi.org/10.26443/msurj.v21i2.437

PDF

This work is licensed under a Creative Commons Attribution 4.0 International License.

Downloads

Download data is not yet available.

A Computational Model of the Hippocampus Supports Exploratory Behaviour in Reinforcement Learning Agents

Keywords

How to Cite

Download Citation

Abstract

Downloads