Skip to content

Tags: tmm1/llama.cpp

Tags

master-d3494bb

Toggle master-d3494bb's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
llama : replacing auto &kv with const auto &kv (ggml-org#2041)

* Replacing auto &kv with const auto &kv

* Create codacy.yml

* Delete codacy.yml

master-b922bc3

Toggle master-b922bc3's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
llama : remove shards weight file support (ggml-org#2000)

* Remove multiple shards

* Remove multiple file loaders

* Remove llama_load_tensor_shard class

* Simplify load logic

* Remove dead code guess_n_parts function

* Remove vocab_only from constructor of llama_model_loader

* Remove alignment_prevents_mmap which is not more needed.

* Remove useless check

master-6432aab

Toggle master-6432aab's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
cuda : fix missing const qualifier in casts (ggml-org#2027)

master-7f9753f

Toggle master-7f9753f's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
CUDA GPU acceleration for LoRAs + f16 models (ggml-org#1970)

master-5b351e9

Toggle master-5b351e9's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
cuda : remove nchannels_x argument from mul_mat_vec_nc_f16_f32 (ggml-…

…org#2028)

- Not used

master-9d23589

Toggle master-9d23589's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
fix pthreads setaffinity usage on android (ggml-org#2020)

master-0be54f7

Toggle master-0be54f7's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
baby-llama : fix build after ggml_rope change (ggml-org#2016)

master-eaa6ca5

Toggle master-eaa6ca5's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
ggml : increase max tensor name + clean up compiler warnings in train…

…-text (ggml-org#1988)

* Clean up compiler warnings in train-text

Some brackets to disambiguate order of operations

* Increase GGML_MAX_NAME

Avoiding strncpy danger in train-text-from-scratch and reducing potential future name length issues

master-cbebf61

Toggle master-cbebf61's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Fix assert when free invalid cuda pointer (ggml-org#2005)

Fix assert via initializing extra structure always.
CUDA error 1 at C:\GPT\llama.cpp\ggml-cuda.cu:2536: invalid argument

master-c824d2e

Toggle master-c824d2e's commit message

Verified

This commit was signed with the committer’s verified signature.
ggerganov Georgi Gerganov
ggml : avoid conv 2d kernel round up