-
Notifications
You must be signed in to change notification settings - Fork 11.4k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CANN]feat: Increase the way memory allocation is managed
ggml
changes relating to the ggml tensor library for machine learning
#12875
opened Apr 10, 2025 by
bachelor-dou
•
Draft
llama-bench: enhance benchmark with improved token throughput measurements
examples
#12874
opened Apr 10, 2025 by
thevishalagarwal
Loading…
SYCL: Support sycl_ext_oneapi_limited_graph
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
ggml-impl.h: Fix build issues on AArch64 with CUDA version 12
ggml
changes relating to the ggml tensor library for machine learning
#12872
opened Apr 10, 2025 by
zhengjun-xing
Loading…
ggml : add SSE 4.2 variant for CPUs without AVX
ggml
changes relating to the ggml tensor library for machine learning
#12871
opened Apr 10, 2025 by
slaren
Loading…
convert : proper tensor name mapping for llama4
python
python script changes
#12870
opened Apr 10, 2025 by
ngxson
Loading…
clip : use smart pointer (⚠️ breaking change)
breaking change
Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility.
examples
#12869
opened Apr 10, 2025 by
ngxson
Loading…
opencl: fix incorrect local_size index in profiling log
ggml
changes relating to the ggml tensor library for machine learning
#12868
opened Apr 10, 2025 by
kimminsu38oo
Loading…
glm4-4-0414 : add Glm4Model implementation for GLM-4-0414
examples
python
python script changes
server
#12867
opened Apr 10, 2025 by
zRzRzRzRzRzRzR
Loading…
[CANN]Opt ROPE optimization
ggml
changes relating to the ggml tensor library for machine learning
#12865
opened Apr 10, 2025 by
noemotiovon
Loading…
Replace freediskspace to free_disk_space in docker.yml
devops
improvements to build systems and github actions
#12861
opened Apr 10, 2025 by
yeahdongcn
Loading…
sycl : implementation of reordered Q4_0 MMVQ for Intel GPUs
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12858
opened Apr 10, 2025 by
Alcpz
Loading…
2 of 3 tasks
vulkan: use aligned loads for flash attention mask
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#12853
opened Apr 9, 2025 by
jeffbolznv
Loading…
gguf-py: byteswapping improvements
python
python script changes
#12851
opened Apr 9, 2025 by
AlekseiNikiforovIBM
Loading…
metal : add memory pool for temp allocs
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
ggml: fixes #12846 compilation error
ggml
changes relating to the ggml tensor library for machine learning
#12848
opened Apr 9, 2025 by
taronaeo
Loading…
Llama-3_1-Nemotron-Ultra-253B-v1 support
python
python script changes
#12843
opened Apr 9, 2025 by
ymcki
Loading…
convert : write tensors in parallel
performance
Speed related topics
python
python script changes
#12837
opened Apr 8, 2025 by
compilade
Loading…
3 of 6 tasks
llamax : add a possible implementation of a simple API for llama.cpp …
build
Compilation issues
#12835
opened Apr 8, 2025 by
cyrilleberger
Loading…
Add AVX512 implementation of GEMM - q4kx8
ggml
changes relating to the ggml tensor library for machine learning
#12829
opened Apr 8, 2025 by
Srihari-mcw
Loading…
ci: fix cross-compile sync issues
devops
improvements to build systems and github actions
#12804
opened Apr 7, 2025 by
bandoti
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.