A Multi-Fidelity Control Variate Approach for Policy Gradient Estimation
Xinjie Liu, Cyrus Neary, Kushagra Gupta et al.
Action editor: Adam White
https://openreview.net/forum?id=zAo0L7Dcqt
#reinforcement #reinforce #trained
1
0
0
0