Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: fail closed when grammar parsing fails (#19051)
#19349 opened Feb 5, 2026 by ingyukoh Loading…
opencl: add general Q6_K mm and Q4_K mv ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#19347 opened Feb 4, 2026 by lhez Draft
Fix link failures in s390x ggml changes relating to the ggml tensor library for machine learning
#19341 opened Feb 4, 2026 by WhyNotHugo Loading…
MSVC regex fix
#19340 opened Feb 4, 2026 by Iemand005 Loading…
CUDA: Fix non-contig rope ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#19338 opened Feb 4, 2026 by ORippler Loading…
metal : skip loading all-zero mask Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#19337 opened Feb 4, 2026 by ggerganov Loading…
Add missing ggml.pc.in ggml changes relating to the ggml tensor library for machine learning
#19334 opened Feb 4, 2026 by WhyNotHugo Loading…
vendor : update BoringSSL to 0.20260204.0
#19333 opened Feb 4, 2026 by angt Loading…
metal : add diag Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#19330 opened Feb 4, 2026 by ggerganov Loading…
scripts: update corpus of compare-logprobs python python script changes script Script related
#19326 opened Feb 4, 2026 by ngxson Loading…
vulkan: optimized coopmat matmul perf for IntelGPU ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#19320 opened Feb 4, 2026 by fish-jiang Draft
gguf-py: Bump sentencepiece version python python script changes
#19319 opened Feb 4, 2026 by Ahajha Loading…
cleanup llama-quantize --help output examples
#19317 opened Feb 4, 2026 by ddh0 Loading…
[WebGPU] Plug memory leaks and free resources on shutdown ggml changes relating to the ggml tensor library for machine learning
#19315 opened Feb 4, 2026 by nikhilJain17 Draft
ggml-webgpu: JIT compile binary operators and handle binding overlaps ggml changes relating to the ggml tensor library for machine learning
#19310 opened Feb 4, 2026 by abhijitramesh Loading…
vulkan: make FA mask/softcap enables spec constants ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#19309 opened Feb 3, 2026 by jeffbolznv Loading…
sycl: add F16 support for GGML_OP_CEIL documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#19306 opened Feb 3, 2026 by NechamaKrashinski Loading…
vulkan: Set k_load_shmem to false when K is too large ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#19301 opened Feb 3, 2026 by jeffbolznv Loading…
ci : add metal server workflows devops improvements to build systems and github actions
#19293 opened Feb 3, 2026 by ggerganov Draft
1 task
CANN: Multi-stream support Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#19284 opened Feb 3, 2026 by hipudding Draft
Support Step3.5-Flash model Model specific python python script changes
#19283 opened Feb 3, 2026 by forforever73 Loading…
[WIP] ggml-hexagon: convert f32 to f16 - fa opt part3 ggml changes relating to the ggml tensor library for machine learning
#19282 opened Feb 3, 2026 by chraac Draft
ProTip! What’s not been updated in a month: updated:<2026-01-04.