Skip to content

Tags: tslmy/llama.cpp

Tags

master-46088f7

Toggle master-46088f7's commit message
ggml : fix build with OpenBLAS (close ggml-org#2066)

master-79f634a

Toggle master-79f634a's commit message

Verified

This commit was signed with the committer’s verified signature.
ggerganov Georgi Gerganov
embd-input : fix returning ptr to temporary

master-2f8cd97

Toggle master-2f8cd97's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
metal : release buffers when freeing metal context (ggml-org#2062)

master-0bc2cdf

Toggle master-0bc2cdf's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Better CUDA synchronization logic (ggml-org#2057)

master-b8c8dda

Toggle master-b8c8dda's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Use unsigned for random seed (ggml-org#2006)

* Use unsigned for random seed. Keep -1 as the value to use a time based seed.

Co-authored-by: Georgi Gerganov <[email protected]>

master-d3494bb

Toggle master-d3494bb's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
llama : replacing auto &kv with const auto &kv (ggml-org#2041)

* Replacing auto &kv with const auto &kv

* Create codacy.yml

* Delete codacy.yml

master-b922bc3

Toggle master-b922bc3's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
llama : remove shards weight file support (ggml-org#2000)

* Remove multiple shards

* Remove multiple file loaders

* Remove llama_load_tensor_shard class

* Simplify load logic

* Remove dead code guess_n_parts function

* Remove vocab_only from constructor of llama_model_loader

* Remove alignment_prevents_mmap which is not more needed.

* Remove useless check

master-6432aab

Toggle master-6432aab's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
cuda : fix missing const qualifier in casts (ggml-org#2027)

master-7f9753f

Toggle master-7f9753f's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
CUDA GPU acceleration for LoRAs + f16 models (ggml-org#1970)

master-5b351e9

Toggle master-5b351e9's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
cuda : remove nchannels_x argument from mul_mat_vec_nc_f16_f32 (ggml-…

…org#2028)

- Not used