Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
DMax is a new dLLM paradigm achieving aggressive parallel decoding while preserving generation quality.
Official repository for the paper: Accelerating Diffusion LLMs via Adaptive Parallel Decoding
Code accompanying the paper "Layer Collapse in Diffusion Language Models" by Alexander Conzelmann, Albert Catalan-Tatjer, and Shiwei Liu (Tübingen AI Center / MPI for Intelligent Systems / ELLIS Institute Tübingen). Link: https://arxiv.org/abs/2605.06366