Efficiency: DPad-enhanced dLLMs achieve up to a 61.39× speedup over vanilla dLLM baselines. Accuracy: DPad-enhanced dLLMs achieve up to a +26.46% improvement over vanilla dLLM baselines. (Evaluation conducted on NVIDIA A100-PCIe-80GB GPUs).
Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
Official repository for the paper: Accelerating Diffusion LLMs via Adaptive Parallel Decoding
Code accompanying the paper "Layer Collapse in Diffusion Language Models" by Alexander Conzelmann, Albert Catalan-Tatjer, and Shiwei Liu (Tübingen AI Center / MPI for Intelligent Systems / ELLIS Institute Tübingen). Link: https://arxiv.org/abs/2605.06366