Skip to content

tomMcGrath/tommcgrath.github.io

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Work

I'm chief scientist and co-founder of Goodfire. We're an AI interpretability startup.

I was at DeepMind from 2019 to late 2023, where I worked on:

  • Interpretability for LLMs (e.g. the Hydra Effect, Copy Suppression) and AlphaZero.
  • Science of training data.
  • RLHF data quality and self-annotation.
  • Evaluation of generalist deep RL agents.

I did my PhD (thesis) at Imperial College with Nick Jones and Kevin Murphy.

Research

My papers are listed on my Google Scholar page. I have a list of research projects I'm interested in working on.

If there's something on there you're interested in collaborating on, please get in touch!

Writing

I have a substack if you prefer to read there.

Contact me

Email is probably best, but you can reach me on Twitter or LinkedIn as well.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors