1 article found
Unsloth’s aggressive 2-bit quantization slashes GLM-4.7 from 400GB to 134GB, forcing a reckoning with what ‘good enough’ means for frontier models