Thank you for the reference to the CUDA file [1]. It's always nice to see how co... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

qrios 9 months ago | parent | context | favorite | on: GGUF, the Long Way Around

Thank you for the reference to the CUDA file [1]. It's always nice to see how complex data structures are handled in GPUs. Does anyone have any idea what the bit patterns are for (starting at line 1529)?

[1] https://github.com/ggerganov/llama.cpp/blob/master/ggml-cuda...

thrtythreeforty 9 months ago [–]

Those have to do with dequantization. It involves table lookups and some adjusting math.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact