Modeling social conventions with Sequential Episodic Control

Enllaç permanent

Descripció

  • Resum

    Computational models can help bring light to the underlying cognitive mechanisms responsible for the emergence of conventions in human societies. In order to pro- vide meaningful theoretical insight, these models of behavior should aim to resemble human-like performance. Deep Reinforcement Learning (deep RL) algorithms fail to do so; these techniques are data inefficient and require several training instances to approximate the speed of human learning. Episodic Reinforcement Learning (ERL) algorithms, for their part, seek to improve deep RL algorithms by implementing memory buffers that can counter the sample inefficiency problem. Nonetheless, this approach also falls short because it considers memories as isolated, discrete events. On the other hand, episodic control models provide a model-free, non-parametric ap- proach capable of rapid learning, thus resembling more closely human performance. These algorithms introduce a fast memory system inspired by the hippocampus that allows them to search for a solution without having to make strong assump- tions about the world. The present work will test the adequacy of one particular episodic control algorithm, the Sequential Episodic Control (SEC) model to sim- ulate human behavioral data in a repeated coordination game. This project will compare the modeled and behavioral data concerning efficiency, fairness, and sta- bility measures to evaluate the model’s performance. Finally, given the structure of the model, this project will examine the potential theoretical implications of human conventionalization, as well as the limitations and future work on this approach.
  • Descripció

    Treball fi de màster de: Master in Cognitive Systems and Interactive Media
    Directors: Ismael T. Freire, Adrián Fernández Amil
  • Mostra el registre complet