Skip to content

NoahAmsel/PolarExpress

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

PolarExpress

This repo implements the PolarExpress method from our paper, The Polar Express: Optimal Matrix Sign Methods and Their Application to the Muon Algorithm. For now, simply copy polar_express.py in your repo, and then use the PolarExpress function. Coefficients are generated upfront. You can adjust the safety factors safety_factor_eps and cushion as you like, but for now the degree is fixed to 5. Note that if safety_factor_eps > 0 the method may not converge all the way to full precision, though for deep learning applications this is not important.

If you wish to reproduce our experiments, see the polar branch of the GPT-opt repo.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages