power-law-graph-attention

Here are 3 public repositories matching this topic...

burcgokden / LLM-from-Power-Law-Decoder-Representations

Implementation of PLDR-LLM: Large Language Model from Power Law Decoder Representations

nlp machine-learning natural-language-processing deep-learning tensorflow keras transformer dag directed-acyclic-graph large-language-models llm power-law-graph-attention pldr-llm power-law-decoder-representations

Updated Nov 1, 2024
Python

burcgokden / PLDR-LLM-with-KVG-cache

Star

Implementation of PLDR-LLM with KV-cache and G-cache in Pytorch for the paper titled "PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference"

nlp machine-learning natural-language-processing deep-learning pytorch transformer dag directed-acyclic-graph kv-cache large-language-models llm power-law-graph-attention pldr-llm power-law-decoder-representations g-cache kvg-cache

Updated Mar 19, 2025
Python

burcgokden / PLDR-LLM-Self-Organized-Criticality

Star

Code used in paper titled "PLDR-LLMs Reason at Self-Organized Criticality"

python transformers pytorch attention self-organized-criticality plga large-language-models llm power-law-graph-attention pldr-llm

Updated Mar 26, 2026
Python

Improve this page

Add a description, image, and links to the power-law-graph-attention topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the power-law-graph-attention topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly