Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

common : support manually triggering the reasoning budget end sequence testing Everything test related
#23949 opened May 31, 2026 by aldehir Contributor Loading…
[Models] Add support for Xiaomi MiMo-V2.5-ASR multimodal model examples python python script changes
#23946 opened May 31, 2026 by az2204Fwhs Loading…
Remove redundant CUDA copies after gated_delta_net. ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#23940 opened May 31, 2026 by gaugarg-nv Contributor Loading…
speculative : fix out-of-bounds read in ngram-map on prompt shrink
#23936 opened May 31, 2026 by o7si Contributor Loading…
cuda: reset cuda context after reading memory size ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#23935 opened May 31, 2026 by 0cc4m Contributor Loading…
ci: remove redundant or duplicate jobs devops improvements to build systems and github actions
#23927 opened May 31, 2026 by netrunnereve Collaborator Loading…
opencl: fix compiler warnings for non-adreno path ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#23922 opened May 30, 2026 by lhez Contributor Draft
loader: increase async upload staging buffer to 4 MiB
#23915 opened May 30, 2026 by cl0ckt0wer Loading…
ci : disable ccache for msvc windows release jobs devops improvements to build systems and github actions
#23911 opened May 30, 2026 by ggerganov Member Loading…
build: Add vulkan building script
#23908 opened May 30, 2026 by sapbotgit Loading…
cuda: reserve space for quantize kv-cache at startup ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#23907 opened May 30, 2026 by am17an Contributor Loading…
fix: VMM pool cuMemSetAccess for ROCm gfx1151 APU ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#23900 opened May 30, 2026 by ricred Draft
vocab: add normalizer.lowercase support to WPM python python script changes
#23899 opened May 30, 2026 by o7si Contributor Loading…
docs: update HOWTO-add-model.md [no release] documentation Improvements or additions to documentation
#23883 opened May 29, 2026 by Xarbirus Contributor Loading…
metal: template GLU kernels to support f16/f32 Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#23882 opened May 29, 2026 by shrivasshankar Loading…
ui: PWA support devops improvements to build systems and github actions examples script Script related server/ui server
#23871 opened May 29, 2026 by allozaur Contributor Draft
ggml-hip: enable -ffast-math for HIP builds ggml changes relating to the ggml tensor library for machine learning
#23862 opened May 29, 2026 by a-huk Loading…
1 task done
chat: route LiquidAI LFM2.5 through specialized parser testing Everything test related
#23856 opened May 29, 2026 by mattngaw Loading…
agentic: question tool + shared plumbing examples python python script changes server/ui server
#23848 opened May 29, 2026 by LPFchan Loading…
ProTip! Adding no:label will show everything without a label.