Optimal control using sparse-matrix belief propagation

dc.contributor.authorIribarne, Albert
dc.date.accessioned2019-10-29T10:53:24Z
dc.date.available2019-10-29T10:53:24Z
dc.date.issued2019
dc.descriptionTreball fi de màster de: Master in Intelligent Interactive Systemsca
dc.descriptionTutor: Vicenç Gómez Cerdà
dc.description.abstractThe optimal control framework is a mathematical formulation by means of which many decision making problems can be represented and solved by finding optimal policies or controls. We consider the class of optimal control problems that can be formulated as a probabilistic inference on a graphical model, known as Kullback- Leibler (KL) control problems. In particular, we look at the recent progress on exploiting parallelisation facilitated by the graphics processing units (GPU) to solve such inference tasks, considering the recently introduced sparse-matrix belief propagation framework [1]. The sparse-matrix belief propagation algorithm was reported to deliver significant improvements in performance with respect to traditional loopy belief propagation, when tested on grid Markov random fields. We develop our approach in the context of the KL-stag hunt game, a multi-agent, grid-like game which shows two different behavior regimes [2]. We first describe how to transform the original problem into a pairwise Markov random field, amenable to inference using sparse-matrix belief propagation and, second, we perform an experimental evaluation. Our results show that the use of GPUs can bring notable performance improvements to the optimal control computations in the class of KL control problems. However, our results also suggest that the improvements of sparse-matrix belief propagation may be limited by the concrete form of the Markov random field factors, specially on models with high sparsity within a factor, and variables with high cardinality.ca
dc.format.mimetypeapplication/pdf*
dc.identifier.urihttp://hdl.handle.net/10230/42542
dc.language.isoengca
dc.rightsAtribución-NoComercial-SinDerivadas 3.0 España*
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.subject.keywordOptimal control
dc.subject.keywordGraphical model
dc.subject.keywordApproximate inference
dc.subject.keywordSparse matrix
dc.subject.keywordBelief propagation
dc.subject.keywordGPU
dc.subject.otherIntel·ligència artificial
dc.titleOptimal control using sparse-matrix belief propagationca
dc.typeinfo:eu-repo/semantics/masterThesisca

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
Iribarne_2019.pdf
Mida:
673.29 KB
Format:
Adobe Portable Document Format
Descripció:

Llicència

Drets