build: disable soname to reduce binary size by Bing-su · Pull Request #2177 · abetlen/llama-cpp-python

Bing-su · 2026-04-09T16:33:19Z

Disable soname to reduce binary size.

As explained in pep-778, wheel files currently do not handle symbolic links (it replaces the symbolic link with the original file). This causes the llama-cpp-python wheel file to become larger than the original.

By setting the NO_SONAME flag to prevent the creation of symbolic links, it resolves this issue.

https://github.com/Bing-su/llama-cpp-python/actions/runs/24200306300

Please also check the built files.

wheels-ubuntu-22.04.zip
wheels-windows-2022.zip

It looks like macos require some different configuration. However, I don't have a Apple machine to test it on...
wheels-macos-15.zip

Bing-su · 2026-04-09T16:36:01Z

CMakeLists.txt

+            RUNTIME_OUTPUT_DIRECTORY ${CMAKE_BINARY_DIR}
+            LIBRARY_OUTPUT_DIRECTORY ${CMAKE_BINARY_DIR}


If these two settings were not configured, an issue arose where the library referenced files under the bin/ folder that did not exist.

❯ readelf -d dist/llama_cpp_python-0.3.20-py3-none-linux_x86_64/llama_cpp/lib/libggml.so Dynamic section at offset 0xbd98 contains 29 entries: Tag Type Name/Value 0x0000000000000001 (NEEDED) Shared library: [bin/libggml-cpu.so] 0x0000000000000001 (NEEDED) Shared library: [bin/libggml-base.so] 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 0x000000000000001d (RUNPATH) Library runpath: [$ORIGIN] ❯ readelf -d dist/llama_cpp_python-0.3.20-py3-none-linux_x86_64/llama_cpp/lib/libllama.so Dynamic section at offset 0x2dc128 contains 31 entries: Tag Type Name/Value 0x0000000000000001 (NEEDED) Shared library: [bin/libggml.so] 0x0000000000000001 (NEEDED) Shared library: [bin/libggml-base.so] 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 0x000000000000001d (RUNPATH) Library runpath: [$ORIGIN]

https://github.com/ggml-org/llama.cpp/blob/d6f3030047f85a98b009189e76f441fe818ea44d/CMakeLists.txt#L20-L21

So I overrode the settings in llama-cpp.

build: disable soname to reduce binary size

9caf851

Bing-su commented Apr 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

build: disable soname to reduce binary size#2177

build: disable soname to reduce binary size#2177
Bing-su wants to merge 1 commit intoabetlen:mainfrom
Bing-su:fix/no-soname

Bing-su commented Apr 9, 2026 •

edited

Loading

Uh oh!

Bing-su Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		RUNTIME_OUTPUT_DIRECTORY ${CMAKE_BINARY_DIR}
		LIBRARY_OUTPUT_DIRECTORY ${CMAKE_BINARY_DIR}

Conversation

Bing-su commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Bing-su Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Bing-su commented Apr 9, 2026 •

edited

Loading