diffusionbased

1 Articles
Reinforcement learning boosts reasoning skills in new diffusion-based language model d1
Tech

Reinforcement learning boosts reasoning skills in new diffusion-based language model d1

Log Probability Estimation in diffu-GRPO. Credit: arXiv (2025). DOI: 10.48550/arxiv.2504.12216 A team of AI researchers at the University of California, Los Angeles, working...