Tags · l29ah/llama.cpp

master-eeaa7b0

ggml : multi-thread ggml_rope() (~3-4 times faster on M1) (ggml-org#781)

Apr 5, 2023
eeaa7b0
zip
tar.gz

master-986b6ce

ggml, llama : avoid heavy V transpose + improvements (ggml-org#775)

ggml :

- added ggml_view_3d()
- ggml_view_tensor() now inherits the stride too
- reimplement ggml_cpy() to account for dst stride
- no longer require tensor->data to be memory aligned

llama :

- compute RoPE on 32-bit tensors (should be more accurate)
- store RoPE-ed K in the KV cache
- store transposed V in the KV cache (significant speed-up)
- avoid unnecessary Q copy

Apr 5, 2023
986b6ce
zip
tar.gz

master-5a8c4f6

llama : define non-positive top_k; top_k range check (ggml-org#779)

* Define non-positive top_k; top_k range check

* minor : brackets

---------

Co-authored-by: Georgi Gerganov <[email protected]>

Apr 5, 2023
5a8c4f6
zip
tar.gz

master-0c44427

make : missing host optimizations in CXXFLAGS (ggml-org#763)

Apr 5, 2023
0c44427
zip
tar.gz

master-cd7fa95

Define non-positive temperature behavior (ggml-org#720)

Apr 3, 2023
cd7fa95
zip
tar.gz

master-437e778

10+% performance improvement of ggml_vec_dot_q4_0 on AVX2 (ggml-org#654)

* Performance improvement of AVX2 code
* Fixed problem with MSVC compiler
* Reviewer comments: removed double semicolon, deleted empty line 1962

Apr 3, 2023
437e778
zip
tar.gz

master-53dbba7

Windows: reactive sigint handler after each Ctrl-C (ggml-org#736)

Apr 3, 2023
53dbba7
zip
tar.gz

master-e986f94

Added api for getting/setting the kv_cache (ggml-org#685)

The api provides access methods for retrieving the current memory buffer for the kv_cache and its token number.
It also contains a method for setting the kv_cache from a memory buffer.

This makes it possible to load/save history - maybe support --cache-prompt paramater as well?

Co-authored-by: Pavol Rusnak <[email protected]>

Apr 2, 2023
e986f94
zip
tar.gz

master-c4f89d8

make : use -march=native -mtune=native on x86 (ggml-org#609)

Apr 2, 2023
c4f89d8
zip
tar.gz

master-c0bb1d3

ggml : change ne to int64_t (ggml-org#626)

Apr 2, 2023
c0bb1d3
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

master-eeaa7b0

master-986b6ce

master-5a8c4f6

master-0c44427

master-cd7fa95

master-437e778

master-53dbba7

master-e986f94

master-c4f89d8

master-c0bb1d3

Tags: l29ah/llama.cpp