Tencent’s WeDLM 8B: When Diffusion Models Beat Autoregressive LLMs at Their Own Game
Tencent’s diffusion-based language model achieves 3-6× faster inference than vLLM-optimized Qwen3-8B on math reasoning, challenging the token-by-token generation paradigm that has dominated LLMs since GPT-2.