Tagged with

3 articles found

OpenAI’s $10B Cerebras Bet: A 750-Megawatt Hail Mary at Nvidia’s Throne

The $10 billion Cerebras deal looks like OpenAI’s attempt to buy architectural independence, but the numbers don’t quite add up. A deep dive into wafer-scale ambitions, power-hungry data centers, and the fine print behind those ’15x faster’ claims.

#AI infrastructure#cerebras#openai...

cerebras

Pruning MoE Models: The Art of Cutting Complexity Without Losing Brains

Cerebras releases REAP-pruned GLM-4.6 variants at 25%, 30%, and 40% sparsity with FP8 quantization – but do they actually work?

#cerebras#fp8#llm-compression...

cerebras

When Less Is Actually More: Cerebras’ REAP Exposes Expert Merging as Flawed MoE Strategy

REAP pruning outperforms merging in MoE models, enabling near-lossless compression of 480B giants to local hardware

#cerebras#compression#LLM...