Skip to content

Tags: l29ah/llama.cpp

Tags

master-eeaa7b0

Toggle master-eeaa7b0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
ggml : multi-thread ggml_rope() (~3-4 times faster on M1) (ggml-org#781)

master-986b6ce

Toggle master-986b6ce's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
ggml, llama : avoid heavy V transpose + improvements (ggml-org#775)

ggml :

- added ggml_view_3d()
- ggml_view_tensor() now inherits the stride too
- reimplement ggml_cpy() to account for dst stride
- no longer require tensor->data to be memory aligned

llama :

- compute RoPE on 32-bit tensors (should be more accurate)
- store RoPE-ed K in the KV cache
- store transposed V in the KV cache (significant speed-up)
- avoid unnecessary Q copy

master-5a8c4f6

Toggle master-5a8c4f6's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
llama : define non-positive top_k; top_k range check (ggml-org#779)

* Define non-positive top_k; top_k range check

* minor : brackets

---------

Co-authored-by: Georgi Gerganov <[email protected]>

master-0c44427

Toggle master-0c44427's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
make : missing host optimizations in CXXFLAGS (ggml-org#763)

master-cd7fa95

Toggle master-cd7fa95's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Define non-positive temperature behavior (ggml-org#720)

master-437e778

Toggle master-437e778's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
10+% performance improvement of ggml_vec_dot_q4_0 on AVX2 (ggml-org#654)

* Performance improvement of AVX2 code
* Fixed problem with MSVC compiler
* Reviewer comments: removed double semicolon, deleted empty line 1962

master-53dbba7

Toggle master-53dbba7's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Windows: reactive sigint handler after each Ctrl-C (ggml-org#736)

master-e986f94

Toggle master-e986f94's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Added api for getting/setting the kv_cache (ggml-org#685)

The api provides access methods for retrieving the current memory buffer for the kv_cache and its token number.
It also contains a method for setting the kv_cache from a memory buffer.

This makes it possible to load/save history - maybe support --cache-prompt paramater as well?

Co-authored-by: Pavol Rusnak <[email protected]>

master-c4f89d8

Toggle master-c4f89d8's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
make : use -march=native -mtune=native on x86 (ggml-org#609)

master-c0bb1d3

Toggle master-c0bb1d3's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
ggml : change ne to int64_t (ggml-org#626)