Skip to content

Tags: justinsb/llama.cpp

Tags

b4735

Toggle b4735's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
CUDA: use async data loading for FlashAttention (ggml-org#11894)

* CUDA: use async data loading for FlashAttention

---------

Co-authored-by: Diego Devesa <[email protected]>

b4734

Toggle b4734's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
update release requirements (ggml-org#11897)

b4733

Toggle b4733's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
server : fix divide-by-zero in metrics reporting (ggml-org#11915)

b4732

Toggle b4732's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
vulkan: implement several ops relevant for ggml_opt (ggml-org#11769)

* vulkan: support memset_tensor

* vulkan: support GGML_OP_SUM

* vulkan: implement GGML_OP_ARGMAX

* vulkan: implement GGML_OP_SUB

* vulkan: implement GGML_OP_COUNT_EQUAL

* vulkan: implement GGML_OP_OPT_STEP_ADAMW

* vulkan: fix check_results RWKV_WKV6 crash and memory leaks

* vulkan: implement GGML_OP_REPEAT_BACK

* tests: remove invalid test-backend-ops REPEAT_BACK tests

* vulkan: fix COUNT_EQUAL memset using a fillBuffer command

b4731

Toggle b4731's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
server : bump httplib to 0.19.0 (ggml-org#11908)

b4730

Toggle b4730's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
common : Fix a typo in help (ggml-org#11899)

This patch fixes a typo in command help.
prefx -> prefix

Signed-off-by: Masanari Iida <[email protected]>

b4728

Toggle b4728's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
vulkan: support multi/vision rope, and noncontiguous rope (ggml-org#1…

…1902)

b4727

Toggle b4727's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
metal : fix the crash caused by the lack of residency set support on …

…Intel Macs. (ggml-org#11904)

b4724

Toggle b4724's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
metal : optimize dequant q6_K kernel (ggml-org#11892)

b4722

Toggle b4722's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
repo : update links to new url (ggml-org#11886)

* repo : update links to new url

ggml-ci

* cont : more urls

ggml-ci