Advertisement · 728 × 90

Posts by Raphaël Fonteneau

Post image

Today, we welcome Dr. Ir. @adrienbolland.bsky.social in the context of the Reinforcement Learning class for a lesson about policy gradient methods. Many thanks to Adrien for sharing his knowledge about these methods that have enabled many successful implementations! #ReinforcementLearning

1 month ago 3 1 0 0
Preview
Off-Policy Maximum Entropy RL with Future State and Action Visitation Measures We introduce a new maximum entropy reinforcement learning framework based on the distribution of states and actions visited by a policy. More precisely, an intrinsic reward function is added to the re...

Check our work on max entropy RL! We introduce an off-policy method to maximize the entropy of the future state-action visitation distribution, leading to policies that explore effectively and achieve high performance 🎯

Link 📑 arxiv.org/abs/2412.06655

#RL #MaxEntRL #Exploration

1 year ago 8 4 0 0