The 0.6 Billion Parameter Insult: How Distilled Qwen3 Models Are Humiliating Frontier LLMs
Distilled Qwen3 models with 0.6B-8B parameters are beating GPT-5 and Claude on narrow tasks at 1/100th the cost. Here’s the systematic proof that bigger isn’t better.