decoder-only

Star

Here are 30 public repositories matching this topic...

MKnoche / DONUT

Star

[ICCV 2025] DONUT: A Decoder-Only Model for Trajectory Prediction

autonomous-driving trajectory-prediction motion-prediction argoverse decoder-only iccv2025

Updated Mar 23, 2026
Python

microsoft / encoder-decoder-slm

Star

Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and vision-language capabilities

encoder-decoder vision-and-language llm decoder-only

Updated Feb 7, 2025
Python

liaoyanqing666 / Decoder-only-transformer_Time_Series_Prediction

Star

使用Decoder-only的Transformer进行时序预测，包含SwiGLU和RoPE(Rotary Positional Embedding)，Time series prediction using Decoder-only Transformer, Including SwiGLU and RoPE(Rotary Positional Embedding)

time-series pytorch transformer rope time-series-prediction decoder-only rotary-positional-embedding swiglu

Updated Jan 25, 2024
Python

pittisl / mPnP-LLM

Star

Code for paper "Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI"

deep-learning multimodal embodied-ai large-language-model decoder-only modality-adaptation

Updated Jan 19, 2024
Python

cisnlp / MEXA

Star

🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment

multilingual evaluation embeddings evaluation-metrics cross-lingual multilingual-nlp large-language-models decoder-only

Updated Apr 6, 2025
Python

Ayush-Aditya / decoder-only-seq2seq

Star

Minimal decoder-only seq2seq pipeline with proper causal masking, teacher forcing, Ignite training loop, and checkpointed inference

nlp machine-learning deep-learning pytorch transformer seq2seq-model pytorch-implementation autoregressive-models decoder-only causal-masking

Updated Feb 23, 2026
Python

SanMumumu / SAMPO

Star

SAMPO: Scale-wise Autoregression with Motion Prompt for Generative World Models

generative-model var decoder-only

Updated Jan 26, 2026
Python

ntphuc149 / ViAG

Star

ViAG: A Novel Framework for Fine-tuning Answer Generation models ultilizing Encoder-Decoder and Decoder-only Transformers's architecture

meteor question-answering bart llama rouge bleu-score encoder-decoder fine-tuning answer-generation t5 plms bartpho llm bertscore instruction-tuning qlora qwen decoder-only vit5

Updated May 26, 2025
Python

msmrexe / pytorch-gpt2-persian-sentiment-generation

Star

A from-scratch implementation of a scaled-down GPT-2 model in PyTorch, trained on the Snappfood dataset for sentiment-controlled Persian text generation.

deep-learning university-project text-generation transformer course-project self-attention review-generation gpt2 positional-embedding decoder-only persian-text-generation causal-attention

Updated Nov 2, 2025
Python

gurpejsingh13 / punjabi-gpt-scratch-20m

Star

Developed and pre-trained a 20.39M-parameter Punjabi GPT-style base model from scratch, including corpus preparation, tokenizer training, benchmark evaluation, and text generation, using a cleaned Punjabi corpus and local Apple Silicon GPU acceleration.

language-model gpt2tokenizer gpt2lmheadmodel llms generative-ai punjabi-language decoder-only tiny-model

Updated Mar 12, 2026
Jupyter Notebook

michaelbabsek / LLM

Star

attention-mechanism multihead-attention llm llm-training llm-inference decoder-only

Updated Jun 2, 2025
Python

pablo-reyes8 / implementing-gpt

Star

Clean-room GPT-2/GPT-3 implementation: tokenizers, architecture blocks, training loop with AdamW + cosine decay, CLI scripts, inference tools, and pytest suite. Covers OpenWebText-10k & WikiText-103 workflows. Designed as an academic reference for understanding and scaling decoder-only transformers

nlp transformers pytorch gpu-acceleration language-model adamw gpt2 gpt3 cosine-decay decoder-only educational-implementation

Updated Feb 18, 2026
Python

caochk / zh2en

Star

中文至英文序列转导模型。从零严格复现《Attention Is All You Need》在中文→英文机器翻译（Zh→En）上的完整流程。

deep-learning transformer encoder-decoder decoder-only

Updated Dec 29, 2025
Python

LeviJunior21 / Trabalho-AprendizagemProfunda-Transformer

Star

Criando um modelo Transformer do zero com variações como Multi-Head Attention e Grouped Query Attention em livros de Machado de Assis.

transformers batch-normalization layer-normalization multi-head-attention grouped-query-attention rms-norm decoder-only

Updated Sep 27, 2025
Python

sea-rod / minigpt

Star

A mini version of GPT implemented on shakespear using BPE

python ai gpt transfomer decoder-model minigpt decoder-only

Updated May 27, 2025
Jupyter Notebook

RicardoRobledo / BuildingTransformerModelsWithPytorch

Star

This is a compilation about excercises to learn how to implement a transformer model

transformers pytorch embeddings attention-mechanism encoder-decoder-model tokenizers decoder-only

Updated Nov 22, 2025
Jupyter Notebook

nadeem4 / mini-gpt

Star

A compact, readable GPT-style decoder-only Transformer implemented in pure PyTorch. The goal is to expose the essential architectural pieces with minimal scaffolding so you can train and tinker quickly.

pytorch gpt decoder-only

Updated Nov 23, 2025
Python

Amir-Hofo / GPT2

Star

Implementation of the GPT-2 architecture using PyTorch, trained on the TinyStories dataset. Features custom training pipelines on Modal (cloud computing) and integration with the Hugging Face ecosystem.

modal transformers pytorch english-nlp gpt2 huggingface decoder-only bpe-tokenizer tiny-stories

Updated Jan 1, 2026
Python

rud-ninja / decoder-transformer-language-model

Star

Auto regressive text generation application using decoder transformer

python deep-learning pytorch flask-application gpt decoder-only

Updated Feb 4, 2024
Python

gloptim / text_generating_transformer

Star

Decoder-only transformer, simplest character-level tokenization, training and text generation.

python neural-network tokenizer text-generation torch pytorch transformer educational positional-encoding llm decoder-only

Updated Mar 4, 2025
Python

Improve this page

Add a description, image, and links to the decoder-only topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the decoder-only topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

decoder-only

Here are 30 public repositories matching this topic...

MKnoche / DONUT

microsoft / encoder-decoder-slm

liaoyanqing666 / Decoder-only-transformer_Time_Series_Prediction

pittisl / mPnP-LLM

cisnlp / MEXA

Ayush-Aditya / decoder-only-seq2seq

SanMumumu / SAMPO

ntphuc149 / ViAG

msmrexe / pytorch-gpt2-persian-sentiment-generation

gurpejsingh13 / punjabi-gpt-scratch-20m

michaelbabsek / LLM

pablo-reyes8 / implementing-gpt

caochk / zh2en

LeviJunior21 / Trabalho-AprendizagemProfunda-Transformer

sea-rod / minigpt

RicardoRobledo / BuildingTransformerModelsWithPytorch

nadeem4 / mini-gpt

Amir-Hofo / GPT2

rud-ninja / decoder-transformer-language-model

gloptim / text_generating_transformer

Improve this page

Add this topic to your repo