What quantization were you using? I've been getting some weird results with 34b ...

gpjt on Feb 28, 2024 | parent | context | favorite | on: Ask HN: People who switched from GPT to their own ...

What quantization were you using? I've been getting some weird results with 34b quantized to 4 bits -- glitching, dropped tokens, generating Java rather than Python as requested. But 7b, even at 4 bits, works OK. Posted about it earlier on this evening: https://www.gilesthomas.com/2024/02/llm-quantisation-weirdne...