Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: ggml-org/llama.cpp
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: master
Choose a base ref
...
head repository: hellc/llama.cpp
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: master
Choose a head ref
Checking mergeability… Don’t worry, you can still create the pull request.
  • 2 commits
  • 3 files changed
  • 1 contributor

Commits on Feb 28, 2026

  1. feat: add native support for PPLX (Perplexity) Qwen3 embedding archit…

    …ecture
    
    This adds bidirectional attention (non-causal) support and pooling type loading for Qwen3-based embedding models from Perplexity AI.
    hellc committed Feb 28, 2026
    Configuration menu
    Copy the full SHA
    b0cb0ac View commit details
    Browse the repository at this point in the history
  2. feat: bit-perfect parity for Perplexity PPLXQwen3Model

    - Fix: disable causal mask for bidirectional attention in QWEN3 arch
    - Fix: inject ggml_l2_norm after MEAN pooling for bit-perfect similarity
    - Fix: export correct attention pooling metadata in conversion script
    hellc committed Feb 28, 2026
    Configuration menu
    Copy the full SHA
    e64249b View commit details
    Browse the repository at this point in the history
Loading