humanspiral@lemmy.ca to LocalLLaMA@sh.itjust.worksEnglish · 7 months agoautoround (optimized for intel but works on amd) integer quantization provides good CPU performance, and good accuracy benchmarks.github.comexternal-linkmessage-square1linkfedilinkarrow-up10arrow-down10
arrow-up10arrow-down1external-linkautoround (optimized for intel but works on amd) integer quantization provides good CPU performance, and good accuracy benchmarks.github.comhumanspiral@lemmy.ca to LocalLLaMA@sh.itjust.worksEnglish · 7 months agomessage-square1linkfedilink
minus-squarehendrik@palaver.p3x.delinkfedilinkEnglisharrow-up1·edit-27 months agoSo… Any context on how it compares to other quantization techniques? Is it faster or slower at similar accuracy?
So… Any context on how it compares to other quantization techniques? Is it faster or slower at similar accuracy?