-
Notifications
You must be signed in to change notification settings - Fork 13.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Metal Pool 1D Kernel
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#16429
opened Oct 5, 2025 by
ThoreKoritzius
Loading…
fix: add generic fallback to detect trailing <think> tags in Jinja templates and handle forced-open reasoning blocks
testing
Everything test related
#16426
opened Oct 4, 2025 by
ServeurpersoCom
Loading…
ci : refactor sdk caching to minimize storage
devops
improvements to build systems and github actions
#16414
opened Oct 3, 2025 by
CISC
Loading…
server / ranking : add sorting and management of top_n
examples
server
#16403
opened Oct 3, 2025 by
YannFollet
Loading…
refactor: centralize CoT parsing in backend for streaming mode
examples
server
testing
Everything test related
#16394
opened Oct 2, 2025 by
ServeurpersoCom
Loading…
tests : add -INF blocks to the KQ mask in the FA tests
testing
Everything test related
#16380
opened Oct 2, 2025 by
ggerganov
Loading…
metal : index FA blocks
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#16372
opened Oct 1, 2025 by
ggerganov
Loading…
model: EmbeddingGemma Adding Support for SentenceTransformers Dense Modules
python
python script changes
#16367
opened Oct 1, 2025 by
sfallah
Loading…
Add support to New feature or request
examples
server/webui
server
◁think▷...◁/think▷
format and DRY the thinking processing logic
enhancement
#16364
opened Sep 30, 2025 by
allozaur
Loading…
Add ARANGE Operator to SYCL Backend (Small & Focused Changes)
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16362
opened Sep 30, 2025 by
GittyBurstein
Loading…
feat: render user content as markdown option
examples
server
#16358
opened Sep 30, 2025 by
ServeurpersoCom
Loading…
SYCL SET operator optimized for F32 tensors
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16350
opened Sep 30, 2025 by
GittyBurstein
Loading…
Update build.md
documentation
Improvements or additions to documentation
#16346
opened Sep 30, 2025 by
refine360-debug
Loading…
ggml-cpu : inspect -march and -mcpu to found the CPU
ggml
changes relating to the ggml tensor library for machine learning
#16333
opened Sep 29, 2025 by
angt
Loading…
ggml : fix unaligned access in AMX code
ggml
changes relating to the ggml tensor library for machine learning
#16315
opened Sep 28, 2025 by
ggerganov
Loading…
ggml : remove SVE paths
ggml
changes relating to the ggml tensor library for machine learning
#16314
opened Sep 28, 2025 by
ggerganov
Loading…
Enable Intel AMX acceleration while in CPU/GPU hybrid with new "--amx" toggle.
examples
#16310
opened Sep 28, 2025 by
Gadflyii
Loading…
cuda : Disable host buffers on integrated GPUs (#15034)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16308
opened Sep 28, 2025 by
ai-fonsi
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.