Efficiency: DPad-enhanced dLLMs achieve up to a 61.39× speedup over vanilla dLLM baselines. Accuracy: DPad-enhanced dLLMs achieve up to a +26.46% improvement over vanilla dLLM baselines. (Evaluation conducted on NVIDIA A100-PCIe-80GB GPUs).
DMax is a new dLLM paradigm achieving aggressive parallel decoding while preserving generation quality.
Official repository for the paper: Accelerating Diffusion LLMs via Adaptive Parallel Decoding
Code accompanying the paper "Layer Collapse in Diffusion Language Models" by Alexander Conzelmann, Albert Catalan-Tatjer, and Shiwei Liu (Tübingen AI Center / MPI for Intelligent Systems / ELLIS Institute Tübingen). Link: https://arxiv.org/abs/2605.06366