Repositories list Public safety-research/legibility’s past year of commit activity Jupyter Notebook
• • 00 forks • 00 stars • 00 issues • 00 pull requests • Updated Apr 19, 2026 Apr 19, 2026 Public safety-research/aligning-ai-orgs’s past year of commit activity Python
• 00 forks • 00 stars • 00 issues • 00 pull requests • Updated Apr 17, 2026 Apr 17, 2026 Public safety-research/introspection-mechanisms’s past year of commit activity Python
• 55 forks • 1717 stars • 00 issues • 00 pull requests • Updated Apr 16, 2026 Apr 16, 2026 Public safety-research/petri’s past year of commit activity Python
• • 149149 forks • 995995 stars • 33 issues • 44 pull requests • Updated Apr 16, 2026 Apr 16, 2026 Public safety-research/auditing-agents’s past year of commit activity Python
• 22 forks • 1313 stars • 11 issue • 22 pull requests • Updated Apr 14, 2026 Apr 14, 2026 Public safety-research/automated-w2s-research’s past year of commit activity Python
• 2929 forks • 136136 stars • 00 issues • 00 pull requests • Updated Apr 13, 2026 Apr 13, 2026 Public safety-research/crosscoder_emergent_misalignment’s past year of commit activity Python
• 00 forks • 55 stars • 5555 issues • 88 pull requests • Updated Apr 1, 2026 Apr 1, 2026 Public safety-research/agent-transcript-editor’s past year of commit activity Python
• 00 forks • 11 star • 00 issues • 00 pull requests • Updated Mar 31, 2026 Mar 31, 2026 Public safety-research/trusted-monitor’s past year of commit activity Python
• 00 forks • 11 star • 00 issues • 00 pull requests • Updated Mar 28, 2026 Mar 28, 2026 Public safety-research/safety-tooling’s past year of commit activity Python
• • 3636 forks • 115115 stars • 1313 issues • 1818 pull requests • Updated Mar 23, 2026 Mar 23, 2026 Public safety-research/PurpleLlama’s past year of commit activity Python
• • 723723 forks • 00 stars • 00 issues • 00 pull requests • Updated Feb 23, 2026 Feb 23, 2026 Public safety-research/bloom’s past year of commit activity Python
• • 162162 forks • 1.3k1.3k stars • 00 issues • 88 pull requests • Updated Feb 17, 2026 Feb 17, 2026 Public safety-research/casr’s past year of commit activity Rust
• • 3636 forks • 00 stars • 00 issues • 00 pull requests • Updated Feb 3, 2026 Feb 3, 2026 Public safety-research/assistant-axis’s past year of commit activity Jupyter Notebook
• 3636 forks • 127127 stars • 22 issues • 11 pull request • Updated Jan 20, 2026 Jan 20, 2026 Public safety-research/selective-gradient-masking’s past year of commit activity Python
• • 55 forks • 5151 stars • 00 issues • 00 pull requests • Updated Jan 11, 2026 Jan 11, 2026 Public safety-research/how-ai-impacts-skill-formation’s past year of commit activity Python
• 22 forks • 1313 stars • 00 issues • 11 pull request • Updated Jan 3, 2026 Jan 3, 2026 Public safety-research/A3’s past year of commit activity Python
• • 11 fork • 1414 stars • 00 issues • 00 pull requests • Updated Dec 29, 2025 Dec 29, 2025 Python
• • 22 forks • 2525 stars • 00 issues • 00 pull requests • Updated Dec 3, 2025 Dec 3, 2025 Public safety-research/impossiblebench’s past year of commit activity Python
• • 88 forks • 3737 stars • 00 issues • 00 pull requests • Updated Dec 1, 2025 Dec 1, 2025 Public safety-research/SCONE-bench’s past year of commit activity • 2929 forks • 177177 stars • 55 issues • 00 pull requests • Updated Nov 25, 2025 Nov 25, 2025 Public safety-research/unsupervised-truth-probes’s past year of commit activity Python
• 00 forks • 55 stars • 11 issue • 00 pull requests • Updated Nov 24, 2025 Nov 24, 2025 Public safety-research/ciphered-reasoning-llms’s past year of commit activity Jupyter Notebook
• • 1515 forks • 99 stars • 00 issues • 00 pull requests • Updated Nov 20, 2025 Nov 20, 2025 Jinja
• • 1515 forks • 2525 stars • 00 issues • 22 pull requests • Updated Nov 11, 2025 Nov 11, 2025 Public safety-research/weight-steering’s past year of commit activity Python
• 33 forks • 88 stars • 00 issues • 00 pull requests • Updated Nov 11, 2025 Nov 11, 2025 Public safety-research/misalignment-scraper’s past year of commit activity Python
• • 00 forks • 11 star • 00 issues • 00 pull requests • Updated Nov 1, 2025 Nov 1, 2025 Public safety-research/believe-it-or-not’s past year of commit activity Python
• • 44 forks • 1313 stars • 11 issue • 00 pull requests • Updated Oct 23, 2025 Oct 23, 2025 Public safety-research/science-synth-facts’s past year of commit activity Python
• 55 forks • 66 stars • 11 issue • 00 pull requests • Updated Oct 22, 2025 Oct 22, 2025 Public safety-research/finetuning-auditor’s past year of commit activity Python
• • 33 forks • 2020 stars • 00 issues • 00 pull requests • Updated Oct 21, 2025 Oct 21, 2025 Public safety-research/inoculation-prompting’s past year of commit activity Python
• 55 forks • 1010 stars • 00 issues • 00 pull requests • Updated Oct 13, 2025 Oct 13, 2025 Public safety-research/verl’s past year of commit activity Python
• • 3.7k3.7k forks • 44 stars • 00 issues • 11 pull request • Updated Oct 3, 2025 Oct 3, 2025 ProTip! When viewing an organization's repositories, you can use the
props. filter to filter by custom property.