카테고리 없음

Quantize Llama models with GGUF and llama.cpp GGML vs. GPTQ vs. NF4

bryan9 2024. 2. 13. 12:00
반응형

https://mlabonne.github.io/blog/posts/Quantize_Llama_2_models_using_ggml.html

 

 

 

 

Quantize Llama models with GGUF and llama.cpp
GGML vs. GPTQ vs. NF4