Tags: homer6/llama.cpp
Tags
llama : add model type detection for rwkv7 7B&14B (ggml-org#14816) Signed-off-by: Molly Sophia <[email protected]>
imatrix: add option to display importance score statistics for a give… …n imatrix file (ggml-org#12718) * Add --show-statistics option * Add --show-statistics logic * Add tensor name parsing * Tidy output format * Fix typo in title * Improve tensor influence ranking * Add better statistics * Change statistics' sort order * Add Cosine Similarity * Add header search path * Change header search path to private * Add weighted statistics per layer * Update report title * Refactor compute_statistics out of main * Refactor compute_cossim out of load_imatrix * Refactor compute_statistics out of load_imatrix * Move imatrix statistics calculation into its own functions * Add checks and validations * Remove unnecessary include directory * Rename labels * Add m_stats getter and refactor compute_statistics out of load_imatrix * Refactor variable names * Minor cosmetic change * Retrigger checks (empty commit) * Rerun checks (empty commit) * Fix unnecessary type promotion Co-authored-by: compilade <[email protected]> * Reverting change to improve code readability * Rerun checks (empty commit) * Rerun checks (empty commit) * Rerun checks - third time's the Charm 🤞 (empty commit) * Minor cosmetic change * Update README * Fix typo * Update README * Rerun checks (empty commit) * Re-implement changes on top of ggml-org#9400 * Update README.md * Update README * Update README.md Co-authored-by: compilade <[email protected]> * Update README.md Co-authored-by: compilade <[email protected]> * Update README.md * Remove duplicate option in print_usage() * Update README.md * Update README.md Co-authored-by: compilade <[email protected]> * Update README.md Co-authored-by: compilade <[email protected]> * Remove input check * Remove commented out code --------- Co-authored-by: compilade <[email protected]>
Mtmd: add a way to select device for vision encoder (ggml-org#14236) * Mtmd: add a way to select device for vision encoder * simplify * format * Warn user if manual device selection failed * initialize backend to nullptr
cuda : implement bf16 cpy ops and enable bf16 cont (ggml-org#14763) * implement bf16 cpy ops and enable bf16 cont * deduplicate copy functions * deduplicate checks
server : allow setting `--reverse-prompt` arg (ggml-org#14799) Signed-off-by: Molly Sophia <[email protected]>
cuda: remove linking to cublasLt (ggml-org#14790) Signed-off-by: Xiaodong Ye <[email protected]>
opencl: add conv2d kernel (ggml-org#14403) * add conv2d kernel * fix trailing whitespace * whitespace fixe * handle f16 input and f16 kernel, more opt * resolve conflicts * use enqueue_ndrange_kernel
kleidiai: add support for get_rows (ggml-org#14676) * kleidiai: add support for get_rows * apply fixes based on code review * apply more fixes based on code review
PreviousNext