Tags · anthony-maio/llama.cpp

This repository was archived by the owner on Apr 14, 2026. It is now read-only.

b8514

Merge branch 'ggml-org:master' into master

Mar 24, 2026
46dba9b
zip
tar.gz

b8497

docs: add dotsocr support communication drafts

Mar 23, 2026
ccdd1e5
zip
tar.gz

b8150

add dots.ocr (DotsOCRForCausalLM) GGUF conversion and inference support

Add full support for rednote-hilab/dots.ocr, a Qwen2 text backbone (1.7B)
with a modified Qwen2-VL vision encoder (1.2B, 42 layers).

Key architectural differences from Qwen2-VL:
Text model uses standard Qwen2 with 1D RoPE (not M-RoPE
Vision encoder uses RMSNorm (not LayerNorm), SiLU gated MLP,
  Conv2D patch embedding, no attention bias, no window attention
2D M-RoPE internal to vision only with sections [d/4, d/4, 0, 0]
Post-trunk RMSNorm before merger (unique to dots.ocr)
Distinct chat template: <|user|>...<|endofuser|> and
also Address PR Feedback

Feb 25, 2026
6202304
zip
tar.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

b8514

b8497

b8150

Tags: anthony-maio/llama.cpp

b8514

b8497

b8150