Skip to content

Tags: dfriehs/llama.cpp

Tags

b2023

Toggle b2023's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
ggml : fix IQ3_XXS on Metal (ggml-org#5219)

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

b1878

Toggle b1878's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
llama : apply classifier-free guidance to logits directly (ggml-org#4951

)

b1874

Toggle b1874's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
CUDA: faster dequantize kernels for Q4_0 and Q4_1 (ggml-org#4938)

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

b1863

Toggle b1863's commit message

Verified

This commit was signed with the committer’s verified signature.
ggerganov Georgi Gerganov
sync : ggml