Implementation of PLDR-LLM: Large Language Model from Power Law Decoder Representations
-
Updated
Nov 1, 2024 - Python
Implementation of PLDR-LLM: Large Language Model from Power Law Decoder Representations
Implementation of PLDR-LLM with KV-cache and G-cache in Pytorch for the paper titled "PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference"
Code used in paper titled "PLDR-LLMs Reason at Self-Organized Criticality"
Add a description, image, and links to the power-law-graph-attention topic page so that developers can more easily learn about it.
To associate your repository with the power-law-graph-attention topic, visit your repo's landing page and select "manage topics."