Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: githubbadguy/llama-cpp-python
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: main
Choose a base ref
...
head repository: abetlen/llama-cpp-python
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: main
Choose a head ref
Checking mergeability… Don’t worry, you can still create the pull request.
  • 7 commits
  • 29 files changed
  • 2 contributors

Commits on Mar 22, 2026

  1. fix(ci): Rename huggingface-cli to hf (abetlen#2149)

    * Fix model download in test workflow
    
    * Use hf CLI in test workflow
    
    * Use hf CLI name in CI and docs
    
    * Reference PR in changelog
    abetlen authored Mar 22, 2026
    Configuration menu
    Copy the full SHA
    ca3b00a View commit details
    Browse the repository at this point in the history
  2. fix(ci): Fix macos tests, support both Intel and Apple Silicon testing (

    abetlen#2150)
    
    * fix(ci): use supported macos runner label
    
    * fix(ci): add apple silicon macos test coverage
    
    * fix(ci): run standard macos tests on apple silicon
    
    * fix(ci): simplify apple silicon macos install
    
    * fix(ci): disable ggml native on apple silicon runner
    
    * docs: update changelog for macos ci runner fix
    abetlen authored Mar 22, 2026
    Configuration menu
    Copy the full SHA
    9f661ff View commit details
    Browse the repository at this point in the history
  3. misc: Add Ruff formatting (abetlen#2148)

    * Add Ruff formatting and safe lint baseline
    
    * Update changelog for Ruff setup
    abetlen authored Mar 22, 2026
    Configuration menu
    Copy the full SHA
    a9b4a06 View commit details
    Browse the repository at this point in the history

Commits on Mar 23, 2026

  1. feat: Update llama.cpp to ggml-org/llama.cpp@49bfdde (abetlen#2151)

    * Update llama.cpp and sync bindings
    
    * Clean up binding compatibility shims
    
    * Remove flash attention property shim
    
    * Remove mtmd verbosity shim
    
    * Add docstrings for new bindings
    
    * Format Ruff files and add changelog entry
    abetlen authored Mar 23, 2026
    Configuration menu
    Copy the full SHA
    18aa31e View commit details
    Browse the repository at this point in the history
  2. ci: add riscv64 wheel builds to release workflow (abetlen#2139)

    * ci: add riscv64 wheel builds to release workflow
    
    Add a build_wheels_riscv64 job mirroring the existing arm64 QEMU-based
    build. Uses cibuildwheel with QEMU emulation for linux/riscv64, targeting
    CPython 3.10-3.14 on manylinux.
    
    Closes abetlen#2138
    
    * ci: use cibuildwheel 3.1.2 for riscv64 wheels
    
    * docs: update changelog for riscv64 wheel PR
    
    ---------
    
    Co-authored-by: abetlen <[email protected]>
    gounthar and abetlen authored Mar 23, 2026
    Configuration menu
    Copy the full SHA
    e1f8ac0 View commit details
    Browse the repository at this point in the history
  3. fix: Qwen 3.5 support (abetlen#2152)

    * fix: handle Qwen 3.5 hybrid prefix reuse
    
    * test: fix Qwen runtime unit mocks
    
    * test: drop Qwen runtime unit tests
    
    * docs: credit Qwen fix contributors in changelog
    
    * docs/tests: update default Qwen model to 3.5 0.8B
    
    * test: rebaseline Qwen 3.5 outputs
    
    * test: stabilize low-level Qwen sampling check
    
    * test: tighten Qwen 3.5 completion prompts
    abetlen authored Mar 23, 2026
    Configuration menu
    Copy the full SHA
    11e7a55 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    a6b1807 View commit details
    Browse the repository at this point in the history
Loading