RL Boosts Masked Diffusion Language Models, Cutting Decoding Steps
EOSER and ASS decoding tricks plus the CJ‑GRPO RL algorithm let LLaDA‑8B‑Instruct keep accuracy while cutting inference steps. The study was submitted in September 2025. Read more: getnews.me/rl-boosts-masked-diffusi... #maskeddiffusion #llada8b
0
0
0
0