transformer-attention

Here are 8 public repositories matching this topic...

datnnt1997 / multi-head_self-attention

A Faster Pytorch Implementation of Multi-Head Self-Attention

attention attention-mechanism multihead-attention self-attention multi-head-attention multi-head multi-head-self-attention multihead-self-attention transformer-attention pytorch-self-attention

Updated May 27, 2022
Jupyter Notebook

Nandan91 / entropy-guided-attention-llm

Star

Official PyTorch Implementation of 'Entropy-Guided Attention for Private LLMs' (PPAI Workshop. AAAI 2025)

softmax-algorithm entropy-regularizer gpt-2 transformer-attention llm-training aaai2025

Updated Mar 17, 2025
Python

Fsoft-AIC / LGD-MaskedGuideAttention

Star

[IROS 2024] Language-driven Grasp Detection with Mask-guided Attention

multimodal-learning transformer-attention language-driven-grasp-detection

Updated Oct 25, 2024
Python

viktor-shcherb / attention-plasticity

Star

CLI toolkit that ingests qk-sniffer dumps, measures per-head positional predictability and attention plasticity, and exports CSV stats plus ready-to-share plots.

python-package interpretability attention-heads huggingface-datasets transformer-attention research-tooling qk-sniffer

Updated Dec 12, 2025
Python

djene-mengistu / STAC

Star

This work proposes STAC, a novel framework for weakly supervised defect localization that leverages saliency-guided transformer attention and pixel-level contrastive learning to achieve precise defect maps using only image-level labels.

industrial-automation machine-vision intelligent-systems multi-class-classification class-activation-map robotic-vision weakly-supervised-segmentation transformer-attention defect-segmentation intelligent-manufacturing-and-automation

Updated Apr 17, 2026
Python

MPender08 / llm-constrained-transport

Star

Investigating how formal constraints reorganize the internal routing geometry of Transformer attention graphs across model families.

pytorch graph-theory computational-linguistics interpretability ricci-curvature logical-reasoning forman-curvature causal-graphs large-language-models transformer-attention mechanistic-interpretability network-geometry information-routing

Updated Apr 1, 2026
Python

ChrisBrooksbank / attention

Star

Train your attention like a transformer trains its weights. Selective, sustained & N-back exercises grounded in the Q/K/V attention framework.

react typescript pwa neuroscience attention vite cognitive-training transformer-attention

Updated Apr 12, 2026
TypeScript

middesurya / daily-webapp-2026-04-13-hopfieldlab

Star

HopfieldLab — Interactive Modern Hopfield Network & Associative Memory Laboratory with 3D energy landscapes, attention-Hopfield bridge, phase transitions, and memory interference visualization

neural-networks hopfield-network interactive-visualization associative-memory transformer-attention claude-code ai-built daily-webapp

Updated Apr 13, 2026
HTML

Improve this page

Add a description, image, and links to the transformer-attention topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the transformer-attention topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transformer-attention

Here are 8 public repositories matching this topic...

datnnt1997 / multi-head_self-attention

Nandan91 / entropy-guided-attention-llm

Fsoft-AIC / LGD-MaskedGuideAttention

viktor-shcherb / attention-plasticity

djene-mengistu / STAC

MPender08 / llm-constrained-transport

ChrisBrooksbank / attention

middesurya / daily-webapp-2026-04-13-hopfieldlab

Improve this page

Add this topic to your repo