3 articles found
LLaDA2.0’s MoE-powered diffusion architecture challenges everything we know about local AI deployment
New framework turns BERT into a diffusion-based chatbot using discrete diffusion, and it’s rewriting how we think about language generation
It turns out BERT’s masked language training looks suspiciously like a single step of discrete text diffusion.