Diminishing Return of Value Expansion Methods
Authors: Daniel Palenicek, Michael Lutter, João Carvalho, Daniel Dennert, Faran Ahmad, and Jan Peters Fellow
pre-print -> arxiv.org/abs/2412.20537
#rl #reinforcement_learning #modelbased_rl #value_expension
2
0
0
1