PRISM is an efficient inference framework designed for Discrete Diffusion Language Models (dLLMs), focusing on a favorable performance-efficiency trade-off by matching Best-of-N performance with substantially fewer Function Evaluations (NFE).
Features
Inference Optimization - Test-time scaling via hierarchical search and self-verification.