Thompson sampling for prediction with expert advice

dc.contributor.authorSánchez Pérez de Amézaga, Claudio
dc.date.accessioned2021-12-15T12:25:34Z
dc.date.available2021-12-15T12:25:34Z
dc.date.issued2021-09
dc.descriptionTreball fi de màster de: Master in Intelligent Interactive Systemsca
dc.descriptionTutor: Gergely Neu
dc.description.abstractWe study Thompson Sampling for prediction with expert advice. With Follow the leader, and Follow the perturbed leader strategies, we present relevant results in order to proceed with Thompson Sampling Algorithm. Using a similar strategy used for studying Follow the perturbed leader, we decompose the regret in three terms. We study the expressions of choosing an expert from a set of experts. Here we show some interesting equivalences between the probability at time t, and the probability of a cheating forecaster which can see in the future t + 1. Finally, we present some experimental cases xing a nal time T. We analyze how the model selects the expert with the best performance. We obtain strong evidence to bound the diference between the cheating forecaster and the true one, following an exponential growth similar to T.ca
dc.format.mimetypeapplication/pdf*
dc.identifier.urihttp://hdl.handle.net/10230/49223
dc.language.isoengca
dc.rightsAttribution-NonCommercial- NoDerivs 3.0 Spainca
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/3.0/es/ca
dc.subject.otherAlgorismes
dc.titleThompson sampling for prediction with expert adviceca
dc.typeinfo:eu-repo/semantics/masterThesisca

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TFM_Claudio_Sanchez_Perez_de_Amezaga.pdf
Size:
664.13 KB
Format:
Adobe Portable Document Format
Description:

License

Rights