cwbe coordinatez:
101
63533
63608
8771344
9305792
9307042

ABSOLUT
KYBERIA
permissions
you: r,
system: public
net: yes

neurons

stats|by_visit|by_K
source
tiamat
K|my_K|given_K
last
commanders
polls

total descendants::2
total children::1
5 ❤️


show[ 2 | 3] flat


Hm, zda sa ze toto je dost massive, clanok archivujem aj sem na kyberiu

https://arxiv.org/pdf/2504.19874

Vector quantization, a problem rooted in Shannon’s source coding theory, aims to quantize
high-dimensional Euclidean vectors while minimizing distortion in their geometric structure. We propose TurboQuant to address both mean-squared error (MSE) and inner product distor-
tion, overcoming limitations of existing methods that fail to achieve optimal distortion rates. Our data-oblivious algorithms, suitable for online applications, achieve near-optimal distortion rates (within a small constant factor) across all bit-widths and dimensions. TurboQuant achieves this by randomly rotating input vectors, inducing a concentrated Beta distribution on coordinates, and leveraging the near-independence property of distinct coordinates in high dimensions to simply apply optimal scalar quantizers per each coordinate. Recognizing that MSE-optimal quantizers introduce bias in inner product estimation, we propose a two-stage approach: applying an MSE quantizer followed by a 1-bit Quantized JL (QJL) transform on the residual, resulting in an unbiased inner product quantizer. We also provide a formal proof of the information-theoretic lower bounds on best achievable distortion rate by any vector quantizer, demonstrating that TurboQuant closely matches these bounds, differing only by a small constant (≈ 2.7) factor. Experimental results validate our theoretical findings, showing that for KV cache quantization, we achieve absolute quality neutrality with 3.5 bits per channel and marginal quality degradation with 2.5 bits per channel. Furthermore, in nearest neighbor search tasks, our method outperforms existing product quantization techniques in recall while reducing indexing time to virtually zero.


download here: turboquant paper



00000101000635330006360808771344093057920930704209307044
maniac
 maniac      02.04.2026 - 12:19:35 , level: 1, UP   NEW
pustite na to Johna Carmacka, to len potom bude Turbo :)

0000010100063533000636080877134409305792093070420930704409307045
just_minding_my_business
 just_minding_my_business      02.04.2026 - 12:21:06 , level: 2, UP   NEW
Carmack robil aplikovanú informatiku, nie vedu.