2nd collaboration workshop on Reinforcement Learning for Autonomous Accelerators (RL4AA'24)

Name: 2nd collaboration workshop on Reinforcement Learning for Autonomous Accelerators (RL4AA'24)
Start: 2024-02-05T08:05:00+01:00
End: 2024-02-07T16:30:00+01:00
Location: Universität Salzburg (Paris-Lodron-Universität)

Feb 5 – 7, 2024

Universität Salzburg (Paris-Lodron-Universität)

Europe/Berlin timezone

Registration and call for abstracts extended to 5 January

Contact

simon.hirlaender@plus.ac.at

The Geometry of Reinforcement Learning: Insights from the Dual Linear Program

Not scheduled

10m

Blue lecture hall (Universität Salzburg (Paris-Lodron-Universität))

Blue lecture hall

Universität Salzburg (Paris-Lodron-Universität)

Hellbrunnerstrasse 34 5020 Salzburg

Student Talk Student Session

Nikola Milosevic (Max Planck Institute for Human Cognitive and Brain Sciences)

Reinforcement Learning (RL) has become a cornerstone of machine learning, showcasing remarkable success in addressing real-world control problems and providing insights into cognitive processes in the brain. However, navigating the intricacies of modern RL proves challenging due to its numerous moving parts, escalating agent complexity, and the application of deep learning in a non-i.i.d. setting. The inherent challenge of intuitively reasoning about RL stems, in part, from its time-dependent and recursive nature. During this presentation, we explore the dual linear program and the intuitions it can offer. What traditionally serves as a theoretical construct for proving theorems emerges as a valuable tool for developing intuitions and facilitating the exploration of higher-level questions. We will focus on two practical demonstrations that underscore the significance of this perspective: 1) designing policy optimization algorithms and 2) pretraining RL agents. During the first half of this presentation, I will review the dual linear program and its geometry, aiming to uncover novel policy optimization strategies. In the second part, I will provide a preview of how the linear program can be generalized to convex MDPs, resulting in pretraining objectives similar to representation learning with the Variational Autoencoder.

Possible contributed talk	Yes
Are you a student?	Yes

Nikola Milosevic (Max Planck Institute for Human Cognitive and Brain Sciences) Johannes Müller (RWTH Aachen)

Dr Nico Scherf (MPI CBS) Semih Cayçı (RWTH Aachen)

There are no materials yet.

2nd collaboration workshop on Reinforcement Learning for Autonomous Accelerators (RL4AA'24)

Contact

The Geometry of Reinforcement Learning: Insights from the Dual Linear Program

Blue lecture hall

Universität Salzburg (Paris-Lodron-Universität)

Speaker

Description

Primary authors

Co-authors

Presentation materials

Choose timezone

2nd collaboration workshop on Reinforcement Learning for Autonomous Accelerators (RL4AA'24)

Contact

Speaker

Description

Primary authors

Co-authors

Presentation materials