KV Cache Quantization Benchmarks: TurboQuant Is Overrated and KVarN Is the Real Deal
Deep benchmarks of Qwen 3.6 27B KV cache quantization methods reveal that TurboQuant’s glory days are behind it, while KVarN shifts the entire quality-per-memory curve.