≃ Calvin's Notebook
home archive notes
  • PolarQuant: Quantizing KV Caches with Polar Transformation
  • QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead
  • TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate
Site proudly generated by Hakyll