KV cache quantization: what FP8/INT8 K and V actually buy you, and where they break

· Dev.to