Release notes from llama.cpp

b8635: Relax prefill parser to allow space. (#21240)

2026-04-02T09:29:11Z

2026-04-02T09:28:56Z

chat : add Granite 4.0 chat template with correct tool_call role mapp…

2026-04-02T07:39:00Z

sync : ggml

2026-04-02T07:08:32Z

sycl : fix llama_kv_cache hang when kv_cache is huge: 5GB (ggml-org#21283)

2026-04-02T00:44:02Z

2026-04-01T19:54:58Z

opencl: fix leak in Adreno q8_0 path (ggml-org#21212)

2026-04-01T19:32:15Z

server: Bypass API Key validation for WebUI static bundle assets (ggml-org#21…

2026-04-01T19:28:19Z

CUDA: fix FA kernel selection logic (ggml-org#21271)

2026-04-01T08:10:25Z

ggml : fix RWKV ops thread assignment (ggml-org#21226)

2026-04-01T08:10:03Z