DMax

DMax is a new dLLM paradigm achieving aggressive parallel decoding while preserving generation quality.

Features

Inference Optimization - Aggressive parallel decoding for diffusion LLMs.

Crys-Chen/DPad

Efficiency: DPad-enhanced dLLMs achieve up to a 61.39× speedup over vanilla dLLM baselines. Accuracy: DPad-enhanced dLLMs achieve up to a +26.46% improvement over vanilla dLLM baselines. (Evaluation conducted on NVIDIA A100-PCIe-80GB GPUs).

cychomatica/FreeDave

21View on GitHub

Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models

danielmisrael/apd

20View on GitHub

Official repository for the paper: Accelerating Diffusion LLMs via Adaptive Parallel Decoding

Conzel/super-outlier-dlm

1View on GitHub

Code accompanying the paper "Layer Collapse in Diffusion Language Models" by Alexander Conzelmann, Albert Catalan-Tatjer, and Shiwei Liu (Tübingen AI Center / MPI for Intelligent Systems / ELLIS Institute Tübingen). Link: https://arxiv.org/abs/2605.06366

Crys-Chen/DPad

63View on GitHub

cychomatica/FreeDave

21View on GitHub

Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models

danielmisrael/apd

20View on GitHub

Official repository for the paper: Accelerating Diffusion LLMs via Adaptive Parallel Decoding

Conzel/super-outlier-dlm

1View on GitHub

czg1225DMax

Features

Open-source alternatives to DMax

Crys-Chen/DPad

cychomatica/FreeDave

danielmisrael/apd

Conzel/super-outlier-dlm

Star history

Open-source alternatives to DMax

Crys-Chen/DPad

cychomatica/FreeDave

danielmisrael/apd

Conzel/super-outlier-dlm