Skip to content

Tags: shibe2/llama.cpp

Tags

b1634

Toggle b1634's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
server : fix handling of characters that span multiple tokens when st…

…reaming (ggml-org#4446)

b1631

Toggle b1631's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
common : add `--version` option to show build info in CLI (ggml-org#4433

)

b1539

Toggle b1539's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
gguf-py : export chat templates (ggml-org#4125)

* gguf-py : export chat templates

* llama.cpp : escape new lines in gguf kv info prints

* gguf-py : bump version

* gguf-py : check chat_template type

* gguf-py : initialize chat_template

b1441

Toggle b1441's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
flake : update flake.lock for newer transformers version + provide ex…

…tra dev shell (ggml-org#3797)

* flake : update flake.lock for newer transformers version + provide extra dev shell with torch and transformers (for most convert-xxx.py scripts)

b1407

Toggle b1407's commit message
CLBlast: Add outer loops over src0 for broadcasting in mulmat

Reduce repeated dequantization of the same data.

b1401

Toggle b1401's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
multimodal : add BakLLaVA conversion support (ggml-org#3682)

b1400

Toggle b1400's commit message
CLBlast: Add outer loops over src0 for broadcasting in mulmat

Reduce repeated dequantization of the same data.

b1398

Toggle b1398's commit message

Verified

This commit was signed with the committer’s verified signature.
ggerganov Georgi Gerganov
speculative : bug fixes

b1395

Toggle b1395's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
opencl : fix element-wise multiplication (ggml-org#3656)

b1391

Toggle b1391's commit message
CLBlast: Fix temporary buffer size for f16 conversion (wsize)

Fix buffer overflow.
Reduce the size to fit just one 2D slice.
Assert sufficient size.