Skip to content
This repository was archived by the owner on Apr 14, 2026. It is now read-only.

Tags: anthony-maio/llama.cpp

Tags

b8514

Toggle b8514's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Merge branch 'ggml-org:master' into master

b8497

Toggle b8497's commit message
docs: add dotsocr support communication drafts

b8150

Toggle b8150's commit message
add dots.ocr (DotsOCRForCausalLM) GGUF conversion and inference support

Add full support for rednote-hilab/dots.ocr, a Qwen2 text backbone (1.7B)
with a modified Qwen2-VL vision encoder (1.2B, 42 layers).

Key architectural differences from Qwen2-VL:
Text model uses standard Qwen2 with 1D RoPE (not M-RoPE
Vision encoder uses RMSNorm (not LayerNorm), SiLU gated MLP,
  Conv2D patch embedding, no attention bias, no window attention
2D M-RoPE internal to vision only with sections [d/4, d/4, 0, 0]
Post-trunk RMSNorm before merger (unique to dots.ocr)
Distinct chat template: <|user|>...<|endofuser|> and
also Address PR Feedback