Imitation learning and policy representation for constrained reinforcement learning

dc.contributor.authorCaicoya Ros, Ana
dc.date.accessioned2024-11-05T16:27:08Z
dc.date.available2024-11-05T16:27:08Z
dc.date.issued2024
dc.descriptionTreball fi de màster de: Master in Intelligent Interactive Systemsca
dc.descriptionTutor: Miguel Calvo-Fullana
dc.description.abstractThis thesis explores the integration of imitation learning and policy representation within the domain of constrained reinforcement learning (CRL) to enhance decision making in environments with stringent limitations. Reinforcement Learning (RL) is a machine learning paradigm focused on training agents to make decisions by maximizing cumulative rewards. However, real-world scenarios often require additional constraints, such as safety and regulatory requirements, necessitating the use of CRL to ensure these constraints are respected while optimizing performance. The research addresses the challenges of solving CRL problems using Generative Adversarial Imitation Learning (GAIL) within the framework of Constrained Markov Decision Processes (CMDPs). CMDPs provide a mathematical structure that incorporates constraints into the RL process. The methodology involves two key phases: the first phase uses the state-augmented CRL algorithm to obtain policies that satisfy the constraints in an augmented space, incorporating dual variables. The second phase refines these policies using GAIL to map them back to the original state space, leveraging imitation learning techniques to ensure robust performance. Numerical results from simulations demonstrate the effectiveness of this approach in achieving constraint satisfaction while maintaining high performance. The findings indicate that the integration of CRL with imitation learning can lead to significant improvements in policy robustness and compliance with constraints. This research contributes to the broader field of machine learning by providing new insights and methods for developing constrained intelligent systems.ca
dc.format.mimetypeapplication/pdf*
dc.identifier.urihttp://hdl.handle.net/10230/68440
dc.language.isoengca
dc.rightsAttribution NonCommercial- NoDerivs 3.0 Spainca
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/3.0/es/*
dc.subject.otherAprenentatge automàticca
dc.titleImitation learning and policy representation for constrained reinforcement learningca
dc.typeinfo:eu-repo/semantics/masterThesisca

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Caicoya_2024.pdf
Size:
1.12 MB
Format:
Adobe Portable Document Format
Description:

License

Rights