≃
Calvin's Notebook
home
archive
notes
☀
🌙
PolarQuant: Quantizing KV Caches with Polar Transformation
QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead
TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate