Learning more LLM quantization #166
Replies: 2 comments
-
I developed all quantization types in
|
Beta Was this translation helpful? Give feedback.
-
Thank you for this humble response ! Now I understand it's doing inference on quantized weights. But I get lost trying to understand llama cpp codebase. how should I navigate this codebase ? Do I need to understand fundamentals concept on operating systems, comp.arch, memory-management e.t.c. ? I want to be a programmar like you. Sorry .. lots of questions all over the place :( |
Beta Was this translation helpful? Give feedback.
-
For beginners like me to ML, I wanted to learn what research papers guided the quantization implement in llama.
It might sound silly but we have separate tricks for quantization during training and during evaluation right ?
Beta Was this translation helpful? Give feedback.
All reactions