Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: sakharovaan/llama-cpp-python
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: main
Choose a base ref
...
head repository: JamePeng/llama-cpp-python
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: main
Choose a head ref
Checking mergeability… Don’t worry, you can still create the pull request.
  • 16 commits
  • 51 files changed
  • 1 contributor

Commits on Apr 13, 2026

  1. Configuration menu
    Copy the full SHA
    4ec15ac View commit details
    Browse the repository at this point in the history
  2. docs: add audio processing recommendation to Gemma4ChatHandler

    - Recommend BF16 mmproj for Gemma4 E2B and E4B models.
    - Note known degraded audio performance with other quantizations.
    - Add reference link to the relevant llama.cpp PR/issue comment.
    
    Signed-off-by: JamePeng <[email protected]>
    JamePeng committed Apr 13, 2026
    Configuration menu
    Copy the full SHA
    5e6529e View commit details
    Browse the repository at this point in the history

Commits on Apr 14, 2026

  1. Configuration menu
    Copy the full SHA
    e70304b View commit details
    Browse the repository at this point in the history
  2. docs: add comprehensive omni multimodal example for Gemma-4

    - Wrapped the existing Qwen3-VL image loading example in a `<details>` block to improve README readability and save vertical space.
    - Introduced a complete, production-ready "Omni MultiModal" example demonstrating simultaneous Vision and Audio processing using the `Gemma4ChatHandler`.
    - Added a universal `build_media_payload` helper function to dynamically route and encode local files into OpenAI-compatible `image_url` and `input_audio` payload structures.
    - Added crucial documentation clarifying multimodal capability differences across Gemma-4 variants (E2B/E4B supporting full audio/vision vs. 31B/26BA4B supporting vision only).
    
    Signed-off-by: JamePeng <[email protected]>
    JamePeng committed Apr 14, 2026
    Configuration menu
    Copy the full SHA
    7115772 View commit details
    Browse the repository at this point in the history

Commits on Apr 15, 2026

  1. Configuration menu
    Copy the full SHA
    701195e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9d2b2cb View commit details
    Browse the repository at this point in the history

Commits on Apr 16, 2026

  1. Configuration menu
    Copy the full SHA
    3ee7ff8 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9e00017 View commit details
    Browse the repository at this point in the history
  3. feat: enhance Qwen35ChatHandler with preserve_thinking and Qwen3.6 te…

    …mplate fixes
    
    - Add `preserve_thinking` parameter to optionally retain `<think>` reasoning
      blocks across all historical conversational turns (defaults to False to save tokens).
    - Improve template robustness by adding an `is defined` safety check for `enable_thinking`.
    - Simplify JSON serialization logic for tool call arguments in the Jinja template.
    - Update class docstring to explicitly indicate support for Qwen 3.5 and Qwen 3.6 models.
    - Include `preserve_thinking` state in verbose processing logs.
    JamePeng committed Apr 16, 2026
    Configuration menu
    Copy the full SHA
    7820677 View commit details
    Browse the repository at this point in the history
  4. Bump version to 0.3.36

    Signed-off-by: JamePeng <[email protected]>
    JamePeng committed Apr 16, 2026
    Configuration menu
    Copy the full SHA
    b97cb63 View commit details
    Browse the repository at this point in the history

Commits on Apr 17, 2026

  1. Configuration menu
    Copy the full SHA
    7a19575 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    33ce052 View commit details
    Browse the repository at this point in the history

Commits on Apr 19, 2026

  1. Configuration menu
    Copy the full SHA
    3984ab5 View commit details
    Browse the repository at this point in the history
  2. docs: initialize LLM Wiki structure for better documentation maintenance

    - Create docs/wiki/ directory with full folder structure
    - Add SCHEMA.md, index.md and contributing guidelines
    - Set up core/, features/, examples/, types/ and subdirectories
    - Prepare for LLM-powered living documentation (Llama class, multi-modal chat handlers, vision/audio examples, etc.)
    - Include .gitkeep files to preserve empty directories
    
    This lays the foundation for a modern, maintainable wiki that will replace outdated static docs.
    Future commits will populate pages with up-to-date content generated from latest source code.
    
    Signed-off-by: JamePeng <[email protected]>
    JamePeng committed Apr 19, 2026
    Configuration menu
    Copy the full SHA
    15f8a36 View commit details
    Browse the repository at this point in the history

Commits on Apr 20, 2026

  1. Configuration menu
    Copy the full SHA
    e3b7ad6 View commit details
    Browse the repository at this point in the history

Commits on Apr 21, 2026

  1. Configuration menu
    Copy the full SHA
    8625836 View commit details
    Browse the repository at this point in the history
Loading