training

Training

This folder contains the training code for the Cross-Layer Transcoder (CLT) model, a sparse autoencoder that replaces ESM-2 MLP blocks with cross-layer reconstruction capabilities.

Key Components

clt_model.py: CLT model architecture with top-k sparse activation and cross-layer decoding
clt_module.py: PyTorch Lightning training module with loss functions and optimization
data_module.py: Data loading and preprocessing for protein sequences
run_clt.py: Main training script with argument parsing and logging
main.sh: Shell script for running training with default parameters

Usage

Run training with default settings:

./main.sh

Or customize parameters by changing variables in main.sh. You can also find the trained CLT at: https://huggingface.co/anonymous-hf-user/ProtoMechModels/tree/main/CLT_L6_D3200

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Training

Key Components

Usage

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
clt_model.py		clt_model.py
clt_module.py		clt_module.py
data_module.py		data_module.py
main.sh		main.sh
run_clt.py		run_clt.py

FilesExpand file tree

training

Directory actions

More options

Directory actions

More options

Latest commit

History

training

Folders and files

parent directory

README.md

Training

Key Components

Usage