-
Notifications
You must be signed in to change notification settings - Fork 14.8k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
vulkan: optimized coopmat matmul perf for IntelGPU
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#19320
opened Feb 4, 2026 by
fish-jiang
•
Draft
gguf-py: Bump sentencepiece version
python
python script changes
#19319
opened Feb 4, 2026 by
Ahajha
Loading…
[WebGPU] Plug memory leaks and free resources on shutdown
ggml
changes relating to the ggml tensor library for machine learning
#19315
opened Feb 4, 2026 by
nikhilJain17
•
Draft
chore: update cpp-httplib version
python
python script changes
script
Script related
#19313
opened Feb 4, 2026 by
taronaeo
Loading…
server: make UI textarea fields resizable
examples
server
#19312
opened Feb 4, 2026 by
ssam18
Loading…
ggml-webgpu: JIT compile binary operators and handle binding overlaps
ggml
changes relating to the ggml tensor library for machine learning
#19310
opened Feb 4, 2026 by
abhijitramesh
Loading…
vulkan: make FA mask/softcap enables spec constants
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#19309
opened Feb 3, 2026 by
jeffbolznv
Loading…
sycl: add F16 support for GGML_OP_CEIL
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#19306
opened Feb 3, 2026 by
NechamaKrashinski
Loading…
vulkan: Set k_load_shmem to false when K is too large
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#19301
opened Feb 3, 2026 by
jeffbolznv
Loading…
vulkan: fix non-contig rope
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#19299
opened Feb 3, 2026 by
jeffbolznv
Loading…
tests : add non-cont, inplace rope tests
testing
Everything test related
#19296
opened Feb 3, 2026 by
ggerganov
Loading…
CANN: Multi-stream support
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
Support Step3.5-Flash
model
Model specific
python
python script changes
#19283
opened Feb 3, 2026 by
forforever73
Loading…
[WIP] ggml-hexagon: convert f32 to f16 - fa opt part3
ggml
changes relating to the ggml tensor library for machine learning
vulkan: Preprocess FA mask to detect all-neg-inf and all-zero.
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#19281
opened Feb 3, 2026 by
jeffbolznv
Loading…
fix: only reset LoRa configs when they have changed from previous batch
examples
server
#19280
opened Feb 3, 2026 by
agent-enemy-2
Loading…
fix: use physical cores for --threads auto-detect (#19110)
#19260
opened Feb 2, 2026 by
ingyukoh
Loading…
Add test for vk_buffer from host memory
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#19254
opened Feb 1, 2026 by
sredman
Loading…
[SYCL] fix segmentation fault on consumer CPUs without bfloat16 hardware
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#19247
opened Feb 1, 2026 by
sajonoso
Loading…
Feat: Adding token healing support for auto complete
examples
server
#19238
opened Feb 1, 2026 by
agent-enemy-2
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.