You must log in or # to comment.
So… Any context on how it compares to other quantization techniques? Is it faster or slower at similar accuracy?
So… Any context on how it compares to other quantization techniques? Is it faster or slower at similar accuracy?