DreamReasoner-8B: Block-Size Curriculum Learning for Diffusion Reasoning Models
Researchers developed DreamReasoner-8B, an open-source block diffusion model for long chain-of-thought (CoT) reasoning. They studied how block sizes during training and inference impact performance on long-CoT tasks. The study found that training with large block sizes results in poor reasoning, while smaller block sizes during training and inference improve performance. This informs best practices for scaling block diffusion models to complex reasoning tasks.
- DreamReasoner-8B is an open-source block diffusion model for long-CoT reasoning.
- Training block size significantly impacts long-CoT performance.
- Smaller block sizes during training and inference improve reasoning.