This is a Python package for writing binary files in the GGUF (GGML Universal File) format.
See convert-llama-hf-to-gguf.py as an example for its usage.
pip install ggufMaintainers who participate in development of this package are advised to install it in editable mode:
cd /path/to/llama.cpp/gguf-py
pip install --editable .Note: This may require to upgrade your Pip installation, with a message saying that editable installation currently requires setup.py.
In this case, upgrade Pip to the latest:
pip install --upgrade pipThere's a GitHub workflow to make a release automatically upon creation of tags in a specified format.
- Bump the version in
pyproject.toml. - Create a tag named
gguf-vx.x.xwherex.x.xis the semantic version number.
git tag -a gguf-v1.0.0 -m "Version 1.0 release"- Push the tags.
git push origin --tagsIf you want to publish the package manually for any reason, you need to have twine and build installed:
pip install build twineThen, folow these steps to release a new version:
- Bump the version in
pyproject.toml. - Build the package:
python -m build- Upload the generated distribution archives:
python -m twine upload dist/*- Add tests
- Include conversion scripts as command line entry points in this package.