Action Projection in Safe Reinforcement Learning: Policy vs Environment
Researchers compare Safe Environment RL (SE‑RL) and Safe Policy RL (SP‑RL), showing that action aliasing hurts SP‑RL more, but a penalty term narrows the gap. Read more: getnews.me/action-projection-in-saf... #saferl #actionaliasing
0
0
0
0