Skip to content

Pull requests: safety-research/crosscoder_emergent_misalignment

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Speed up training code with 4 major performance optimizations
#691 opened Jul 25, 2025 by superkaiba Collaborator Loading…
6 tasks done
Speed up top prompts and token processing analysis steps
#690 opened Jul 25, 2025 by superkaiba Collaborator Loading…
Add comprehensive support for Gemma models
#688 opened Jul 25, 2025 by superkaiba Collaborator Loading…
Add comprehensive perplexity analysis tool
#604 opened Jul 22, 2025 by superkaiba Collaborator Loading…
Add Matryoshka interpretability analysis standalone script
#571 opened Jul 21, 2025 by superkaiba Collaborator Loading…
Implement MatryoshkaSharedCrossCoder: Combine Matryoshka and Shared Features
#563 opened Jul 21, 2025 by superkaiba Collaborator Loading…
4 of 6 tasks
ProTip! Adding no:label will show everything without a label.