Tags · embedl/flash-head

v0.1.9

Fix handling of lm head for Qwen3.5

Apr 21, 2026
40b146b
zip
tar.gz
Notes
Downloads

v0.1.8

Add FlashHeadQwen3_5 support

Apr 21, 2026
5e5704e
zip
tar.gz
Notes
Downloads

v0.1.7

Simplify loading, remove duplicate indices calc, fix prefill path

- Remove clustering_config.json validation from _get_centroids (rely on
  safetensors contents directly)
- Auto-detect n_clusters from centroids tensor shape instead of requiring
  it as a parameter
- Infer vocab_size/hidden_size from weight shape instead of config metadata
- Return indices from _get_cluster_logits to avoid recomputing them in
  get_next_token (removes duplicate index_select + flatten + unique)
- Fix prefill regression: only use FlashHead for single-token decode
  (shape[0] == 1); let vLLM handle prefill natively via compiled path
- Fix sampling softmax to slice [:, -1, :] before temperature scaling

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

Apr 14, 2026
77c36af
zip
tar.gz
Notes
Downloads

v0.1.6

Remove duplicate indices calc

Apr 14, 2026
69b4d8f
zip
tar.gz
Notes
Downloads

v0.1.5

Bump version to 0.1.5

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

Apr 14, 2026
4775805
zip
tar.gz
Notes
Downloads

v0.1.4

Bump version to 0.1.4

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

Apr 9, 2026
9ae0da0
zip
tar.gz
Notes
Downloads

v0.1.3

Bump version to 0.1.3, add HF collection link, remove Homepage URL

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

Apr 9, 2026
c9ad5df
zip
tar.gz
Notes
Downloads

v0.1.2

Bump version to 0.1.2, fix README images for PyPI

Use absolute URLs for images so they render on PyPI.

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

Apr 9, 2026
bd3dd35
zip
tar.gz
Notes
Downloads

v0.1.1

Bump version to 0.1.1, add PyPI metadata and landing page

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

Apr 9, 2026
8c22613
zip
tar.gz
Notes
Downloads

v0.1.0

Fix pypi-publish permissions: add contents read for release download

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>

Apr 9, 2026
588f4d1
zip
tar.gz
Notes
Downloads

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.1.9

v0.1.8

v0.1.7

v0.1.6

v0.1.5

v0.1.4

v0.1.3

v0.1.2

v0.1.1

v0.1.0

Tags: embedl/flash-head