≃
Calvin's Notebook
Home
About
Archive
Notes
Kv Cache Quantization
PolarQuant: Quantizing KV Caches with Polar Transformation
QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead
TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate